AGCTGTGGTA CCATCATTTC TGTATAACCA TGTTGTGTTG TATGTTTTGT AAT CAT AT AG 2 64 0 

TTCATTAAAG CACGCTCTAA TTGCGCACCT TCATTTGTTA AATATACAAA ACGCGCACCT 2700 

GAAACTTTTG CTGCACGATC AAAATCAGCC ATTTTCAATT CTTCTACAS ATCCCAATGT 27 60 

GCTTTGGGTT CAAATGAAAA CTCaCGTGGT GTACCCCACT TTTTAACTTC AACGTTATCT 2820 

TCATCAGATT CACCTTGAGG T AC AT CAT C A CTTATTAAAT TTGGAATACG ACAAAGGATA 2880 

CCTGTCATTT TAT TAT C AAT TTCATTTAAT TGACTATCTT TTTCTTTAAT ATCGTCACCT 2 94 0 

AATGTGCGCA TTTCAGCAAT CACATCATCA GCATTTTCTT TATTACGTTT TTTTAATGCG 3000 

ATTTCTTCGC TTACTTTATT ACGACGTGCT TTCATTTCTT CTGTTGCACT AAT T AAT T T A 3060 

CGTCGTTGCT CATCCAATTC TAAAATTTCA TCTACAACTT TTGGATCATC T C C AC G T AAT 312 0 

TCAATTTTGC TCTTAACTGT GTCAGGCTCA TTTCTGAATA ATCTAATGTC TAACATTAAC 3180 

CTTCATCCTT TCCCAAATAA TTATCATTTA TTATGGAATG ACGTACGTCT TTATTTTTTA 32 4 0 

GAAAATAAAA AAAG AC C AC A TCCCTACAAG GGACGTGGTC TACGCGTTGC CACCCTATTT 3300 

AAC AAT T T AA GTTATAAAGA TACACTAAAC CTAAATTGCA CTTCACTAAA ATAACfETTA 3360 

TCACCGATTG TTCTTTTAAA TTAAGTAGGT AGATTCATAT ATATGTTGAT TCTTGTTCAC 34 20 

ACTAACCACA AGCTCTCTGA TATCGAACAC TATATATTAC TTGTCCTACG AACAATGTCT 34 80 

TATTAAGTTA TTTTTAATAT AGCAAACTAT ATTTGCTTTT TCAAGTAACG ATTTCAAACA 35 4 0 

TCACTCATGT CGATTTAGTG ACATGCAGTC GTTTGATAAA TTGATTGCTT TAAATACTGT 3600 

GCAACCGCTT CAATATCTTT ATGAAATTGA CGATCATGTG TAATGGATGG CACGATACTT 3 6 60 

CGAAACTCAT CATACTTGCG ACGTGTTTTT GGTGATAATC CTTCAACACC TTTTAACTCT 3720 

GCTGCTTGTA ATGCAATAAC ACATTCGATT GCCAGCA&C GTCTTGCATT T T C AAT AAT T 37 8 0 

T GAT AAC CAT GTCTAGCAGC TGTAGTTCCC ATAGATACGT GATCTTCTTG GTTCGCAGAT 38 4 0 

GAAGTGATAG AATCAACACT CGCTGGATGC GCTAAAGTTT TATTTTCAGA AACGAGACTT 3900 

GCAGCAGCAT ATTGCATAAT CATCGCGCCA CTTTGCAATC CTGGCTCTGG ACTAAGAAAT 3 960 

GCTGGTAAAT CACCATTTAA TTGAGGATTT ACTAGTCGCT CTAGACGACG TTCCGATACG 4 02 0 

TTTGCTAATT ' CACTTACACC TAATTTAAGA TGATCTAATG CAAAAGCAAT AGGTTGTCCA 4080 

TGGAAGTTAC CACCTGAAAT AACAAACGTT TCATTTGCTT CCTCAAATAT AAGTGGATTA 4140 

TCATTAGCCG CATTCATTTC AAATTCTAAT TGCTGTTTAA CATAATTGAA TACTTGAAAA 4200 

CTCGCGCCAT GGATTTGTGG TATACAACGC AACGTATATG CATCTTGTAC ACGTATTTCT 4 2 60 

GATTGTCGCG TCGTTAATGT TGATCCTTCT AACCAATCAC GCATACGCGC TGCCACATTA 4 320 

ATCTGTTCTT GAAAATTACG AACTGCGTGC AC AT CAT G T C GATAECATC TATAATGCCA 4 38 0 

TTAAGAGACT GATGCGTTAA TGCAGCAATC CATTCAGATT GGTAACCTAA ATCTTCTGCT 4 4 40 
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T C TAT AT ARC TAATGACACC TTGAGCTGTC ATAGCTTGCG TACCATTAAT CAATGCTAAA 4 500 

CCTTCTTTAG CCTGAAGGTT CAAAGGTTGT CTATTTAATT CTCTTAATAC ATCGTCACTA 4 5 60 

TCCTTTTCTT CCCCTCTGTA CAATACTTTC CCTTCACCAA TTAATGCTAA TGCTAAATGT 4 620 

GATAATGGCG CTAAATCTCC TGATGCACCG AGAGAGCCTT GCTGTGGGAT TATCGGTATA 4 680 

AT ACGT T CAT TTATAAAAAA TTGTAATTGT CTCACTAATT CTAAAGTGGC ACCTGAATGA 4 74 0 

CCTTTTAATA ATGTATTCAA TCGTAASkTC ATCATGACTA ATGCTACTTC TTTTGAAAAT 4800 

GGCTCACCTA GTCCACAGGC ATGTGAGCGT AT C AG AT T C A CTTGTAATTC ATTATATTGC 4 8 60 

GTCGGGTCTA TACGTACATC ACTAAATAAC CCAAAACCTG TAGTAATACC GTAAACCGTT 4 920 

TCCTCATTTT CAATAATACG TTCAACTACC GCTCTACTTT TTTTGACACG TCTAACGCA 4 980 

TCATCAATAA TTTCAATCTT TGATTGTTGT TGTAAAAATG ATTTAATATC CTCAATTGTT 5 04 0 

AGTGTTTCAC CATCTAAATA TAAAGTCATA TATGTTACCC CCTTGTTTAT ATTAAGTAAC 5100 

CCATCCTTCT TGAAGTATAC GTTTTCATTT TTATTGAAAC AATGGTTTTA CGTACATTTA 5160 

TAACCTATTA TCAGAGCACT ATTGTAGTGC GTTAAAGGAT ATTAAGATTG TTGTAAGCAT 5220 

ATTTAATAAT TTATCTATTG ACGAATTGCA TATACAGGTA TAGTATTTTC TATTGTATTT 5280 

AACGACAAAT AATAATGAAT TCAGAAATTT ATAATACATT TTGTTAAAAG T T AC TAT AT A 5 34 0 

TTTTTAAAAT TGAATAAATT CGGAAAAGGC TTTACATGG GAGGT T AT AT CAC TATGGAA 54 00 

ACGTTAAATT CTATTAACAT TCCTAAGCGT AAAGAAGATT CACATAAAGG TGATTATGGC 54 60 

AAAATTTTAT TAATTGGTGG ATCTGCTAAC TTAGGTGGTG CCATTATGTT AGCGGCTCGT 5 520 

GCATGTGTAT TTAGCGGTAG TGGTTTAATC ACTGTAGCTA CACATCCAAC AAATCATTCA 5580 

GCATTACATT CTCGTTGCCC AGAAGCGATG GTTATTGATA TTAATGATAC GAAAATGTTG 5 64 0 

ACGAAAATGA TTGAAATGAC TGACAGTATA CTAATTGGTC CAGGTCTTGG CGTTGATTTC 5 7 00 

AAAGGAAATA ATGCCATTAC ATTCCTACTA CAAAATATAC AACCGCATCA AAATTTAATC 57 60 

GTAGACGGCG ATGCGATTAC AATCTTTAGT AAACTGAAAC CGCAATTACC TACATGTCGT 58 2 0 

GTGATCTTTA CACCACACCT CAAAGAATGG GAACGATTAA GTGGTATTCC TATTGAGGAA 5880 

C AG AC AT AT G AGCGTAATCG TGAAGCAGTT GATCGTTTAG GTGCAACTGT TGTACTTAAA 5 94 0 

AAACATGGTA CTGAAATTTT CTTTAAAGAT GAAGACTTTA ATTGACAAT CGGTAGCCCA 6000 

GCAA'TGGCGA CTGGTGGTAT GGGCGATACA CTTGCTGGTA TGATTACAAG CTTTGTCGGT 6060 

CAATTTGATA ACTTAAAAGA AGCGGTTATG AGTGCCACAT ATACACATAG TTTTATTGGC 6120 

GAAAACCTTG CAAAAGATAT GTATGTGGTG CCACCATCAA GACTTATCAA TGAAATACCT 61£B 

TACGCAATGA AACAATTAGA AAGTTAGTCA TTACTAATCA TTGAATATAG TAAAGCATTA 624 0 
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CTTTCTAGCA TAAAAATAAG ACTCCCCTAC ATATAGGGAA GTCTTATTTT TTATTATTCT 6300 

TCATCTGATG ATTGTTGTAT ATCTTCTTCA ACACGATCCA TGAAATCTTG TCTTACTTCA 63 60 

ATACGTCCAT CTTCATCATT TTCTTCTGAA TCAATCACTT CAGTATGAAT TGCATTTCCT 64 2 0 

GGTGTTTCAT CATTTaCAAC CGCTTCACGT TGTTGTTCAG TACCATCTTC AG AT AC AG T T 64 8 0 

GAAG TAG AT T GCTCATCTTC ATTCGTTTCA TCTTCTGCAT CTTCTTTTAC TTTAGCAACC 654 0 

GTTGAAACAA ATTGATCATC ACCTAAGCGA ATTAAGCGAA CACCTTGTG TGCACGACCA 6600 

TTTTGAGAAA TATCTGCAAC ATCTAGTCGA ATAATGACAC CTGCATTAGT AACAATCATT 6660 

AAATCTTCTT CACCAGTTAC TGTAGTGATA CATACAACAT TACCATTACG CTCAGTAATC 6720 

GTAGCTGTTT TAATACCTTT ACCACCACGA TTTGATAAGC GATAGTCATT AACTGGCGTA 67 8 0 

CGTTTACCAT AACCATTTTC AG T AAC TACT AATACTTCAT CAACACTGTT TGCATGAGCT 68 4 0 

ACATCAAGCC CTACAACTTC GTCACCTTCA CGAAGTGTAA TACCTTTCAC ACCCGTTGCT 6900 

GTACGGCCTA AAGGACGTAA TGTTGATTCA GGGAATCGAA TTAATGATGC ATGTGATGTA 6960 

CCAATCAAGA TATCTTCTTG ACCACTTGTT AAGCGAACTG CAATTAACTC ATCATCTTCT 7 02 0 

C T G AAC G AAA TCGCAATCTT ACCATTTCTA TTTATTCTTG AGAAGTTACT TAATGCTGAA 7080 

CGTTTAACGA CACCACGTTT AGTTGCAAAC ACTAAGAAGT TGTCTTCACT TTCAAGGTCT 714 0 

TTAACAGCAA TCATTGTACT AATGACTTCA TCATTTTCAA GTTCAATAGC ATTCAKACA 7200 

GGAATACCTT TAGACTGTCT TGATAACTCA GGCACTTCGT AACCTTTAAG TTTGTATACA 72 60 

CGACCTTTGT TAGTAAAGAA CAATACATGG TCATGTGTAC TTAAAGTTAC CAATTGACTG 7 32 0 

ACAAAATCTT CTTCCAATGT ATTCATACCT T G AAC AC C AC GACCACCACG GTTTTGAGCA 7 38 0 

CGATATGTAG ATACCGGCAA ACGTTTAATG TAGTTATTAT GGCTTAGTGT AATTACTATT 74 4 0 

TGTTCTTCTG GAATTAAGTC TTCGTCCTCT AAGTCTTCAA AT C C AC C T AA TTGAATTTCT 7500 

GTACGACGAT CATCACCGAA ACGATCTCTA ATTTCAGTCA ATTCATCTCT AACTAACTGT 7 5 60 

AATAACACTT CTTCATCAGC TAAGATTGCT TCTAATT&C TAATATAATT TAATAACTCA 7 62 0 

TTATATTCAG CTTCAATTTT GTCTCTCTCT AAACCTGTTA GACGTCTTAA ACGCATGTCT 7 68 0 

AAAATAGCTT GAGCTTGTTT TTCAGAAAGT TTGAAGCGTT GTTGCAAGCT TTCCATTGCA 77 4 0 

ACTTTATCTG TATCTGACTC ACGAATCGTT GAAATAATTT CATCGATATG GTCAAGTGCG 7 8 00 

ATACGTAATC CTTCTAAAAT GTGGGCACGA TCTTTAGCTT TACGTAAgTT GTATTGCGTA 7 8 60 

CGTCTTCTAA CAACTGTCTT TTGATGCTCT AAATAATGTA CCAACGCTTC TTTTAAATTA 7 920 

ATAAGCTTCG GTCTACCATT TACAAGTGCA ATCATATTCA CACCAAATGA TGTTTGAAGA 7 98 0 

GGTGTTTGTT TGTATAAGTT ATTTAAAATG ACACTAGCAT TTGCATCCTT ACGCACATCA 8 04 0 

ATAACGACAC GCACACCAGT ACGTAAACTT GTTTCATCAC GTAAATCAGT GATACCGTCA 8100 
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ATTTTCTTGT C AC G AAC GAG CTCTGCAATT TTTTCAATCA TACGAGCCTT ATTCACTTGG 8160 

AAAGGAATTT C AG T G AC AAC AATACGTTGA CGTCCGCCTC CACGTCTTC AATAACTGCA 8220 

CGAGAACGCA TTTGAATTGA ACCACGACCT GTTTCATATG CACGTCTAAT ACCACTCTTA 8280 

CCTAAAATAA GTCCAGCAGT TGGGAAATCA GGACCTTCAA TATCCTCCAT TAACTCAGCA 834 0 

ATT GAAAT AT CAGGGTTCTT ACTTAAGCTA AG T AC AC CAT TGATTAATTC TGTTAAGTTA 8 4 00 

TGTGGTGGAA TATTCGTTGC CATACCTACC GCGATACCTG ATGCACCATT GGCTAATAAG 8 4 60 

TTAGGGAATC GAGCAGGTAA GACTGACGGC TCTCTTTCAT TACCATCATA GTTATCGATA 8520 

AAATCTATTG TATCTTTATT AATATCACGT AACAGTTCAA GTGTGATTTT AGTCATACGC 8580 

GCTTCAGTAT AACGCATTGC TGCTGCGTCA TCTCCATCCA T T G AAC C AAA GTTACCTTGG 8 64 0 

C C AT C AAC AA GCGGATAACG ATAACTGAAA TCTTGAGCCA T AC G T AC CAT TGCTTCATAA 8700 

AT AG AT GAG T CACCATGAGG GTGATATTTA CCCATTACGT CACCAACGAT ACGTGCTGAT 87 60 

TTTTTATATG ATTTATCCGG TGTCATACCT TGTTCATTTA ATCCATATAG TEACGACGA 8 82 0 

TGTACTGGTT TTAAACCGTC AC G AAC AT C T GGCAATGCAC GAGCAACGAT AACACTCATC 88 80 

GCATAATCTA AAAATGATTC ACGCATTTCA CTGGTAATAT TTCGTTCATT TATTCTTGAT 8 94 0 

TGAGGTAATT CAGCCATCAA GAGTTCCTCC TTCAAAAGTT CAGTTCACAG CGCTTAGAAG 9000 

TCTAAGTTTG CATAAACTGC ATTATCTTCT ATAAATTGTC TACGGTTTTC T AC AAC G T C A 90 60 

CCCATTAACA TTTCAAATGT TTGGTCCGCT TCAATCGCAT CTTCAAGTTT TACTTGTAAA 9120 

AGAGCGCGGT GCTCAGGGTT CATTGTTGTT TCCCAtAATT GATCTGCATT CATTTCTCCA 918 0 

AGACCTTTGT ATCGTGCAAT AGACCATTTT GGTCTTGGAT TCAATTCAGA TTTAAGTTTA 924 0 

TCAAGTTCCC TATCATTGTA TACATAATAC TTTTGTTTAC CTTGTGTCAG TTTATACAAC 9300 

GGTGGCTGTG CAATATACAC ATAGCCTGCT TCAATTAACG GTCTCATAAA TCGATAGAAG 93 60 

AATGTTAATA ACAATGTTCT AATATGCGCT CCATCCACAT CGGCATCAGT CATAATGACG 94 2 0 

ATTTTGTGAT ATCTTGCTTT CGCTAGATCA AAGTCGCCAC CGATTCCTGT ACCAAATGCT 94 8 0 

GTGATCATTT GACGAATTTC ATTGTTATTC AAAATTCTAT CTAATCGTGC TTTTTCAACA 954 0 

TTTAATATCT TACCTCGTAA TGGTAAAATC GCCTGCGTTC TAGAGTCACG ACCAGATTTT 9 600 

GTAGACCCCC CGGCAGAGTC CCCTTCGACT AAGAAAATCT CACATTCTTC AGGACTTTTA 9660 

CTAGAGCAAT CGGCTAATTT ACCTGGAAGG CTTGCTACAT CTAACGCTGA TTTACGACGT 97 2 0 

GTTACTTCAC GCGCTTTTTT CGCAGCAACA CGTGCACGTG CCGCCATAAT ACCTTTTTCA 97 80 

ACCACTGTAC GTGCGACTTG TGGATTTTCA T AT AAAAAT C (FTCAAAGTG CTCTGAGAAT 98 4 0 

AATTTATCTA CAACTTGACG CACTTCAGAA TTACCTAATT TTGTCTTCGT TTGACCTTCG 9900 
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AATTGAGGAT CACCATGTTT GATAGATATA ATTGCTGTCA TACCTTCACG TGTATCTTCA 9 9 60 

CCAGAAAGTC TATCTTTTTC TTCTTTCATA ATCTTGCTAC TTAAACCATA ACTATTTAAG 10(12 

ACACGCGTTA ATGCACGTTT GAATCCGTCT TCATGCGTAC CACCTTCATA CGTATGAATG 1008 0 

TTATTTGCGT AAGTTAAAAG ATTTGTGGCA TATCCTGAGT TATATTGAAT CGCAATTTCT 1014 0 

ACTTCAATAT CATCTTTAGA TTGATGAATA TAAATTGGCT CATCATGAAT AGGTTCTTTA 10200 

TTTTCGTTCA ATAACTCAAC G17CGATTTA ATACCGCCCT CATAGTGATA GGAGTCTTCT 102 60 

CTAACGTTTT CTTCATCACG TTCATCTCTT AATGTGATTT GAATTCCTTT GTTTAAGAAA 10320 

GCAAGCTCTC TAATACGCTG CTGTAATGTT TCATAGTTGT ATACAGTTGT CTCTGTGAAG 10380 

ATTTCTCCAT CTGCTTTAAA ACGAAtGaCA GTACCTGTCT TAtCAGTnCB GCCAACTTCT 10440 

TTTAAGTCAA ATTGAGGTAC ACCTTTTTTA TATGCTTGAT GATATATAGT CTCATTTCTG 10500 

TGTACATATA CTTCTAAGTC TTGTGACAAT GCGTTTACAA CTGATGAACC AAC AC CAT G T 10560 

AAAC C AC C AG ATACTTTGTA TCCGCCACCG CCAAATTTAC C AC C AG CAT G TAAAACAGTT 10620 

AAAATAACTT CGACAGCTGG ACGTCCCATT TTTTCTTGAA TATCAACTGG GATACCACGT 10 68 0 

CCGTTATCCG TTACTTTAAT CCAGTTATCT TTTTCAATAA CAACTTCAAT TTGATTTGCA 107 4 0 

TAACCaGCTA ATGCTTCATC GATACTATTA TCGACAATTT CCCACACTAA ATGGTGCAAA 108 00 

CCTCTCTCTG AAGTCGATCC TATATACATA CCTGGTCTTT TACGTACTGC TTCTAAACCT 10860 

TCTAATACTT GTATTTGCCC AGCACCATAA TTATCCGTGT TGTTTACATC TGACAATGCA 10 920 

GTCACCATCG CTTTCTGTTA CTTTATAATT TCACCTTGAT TAATACGATA CAATTTAGCG 1098 0 

TTATTCATGA TTTCATGATC AATACCATCT ACAGATGTCG TAG T GAC AAA TGTTTEACT 1104 0 

TTATGCTGAA TCGTACTTAA TAAATGCGTT TGACGCGAAT CATCTAATTC ACTGAGTACA 11100 

TCGTCTAATA ATAAGATGGG ATATTCCCCA ACTTCGATAT T CAT T AAC T C AATTTCAGCT 11160 

AATTTAATGG ACAAAGCCGT TGTACGTTGC TGTCCTTGAG AACCATATGT TTGAGCATCC 11220 

ATGCCATTCA CATCAAAACT TATATCATCT CGATGTGGTC CGAATAAGCT AATGCCTCGT 112 8 0 

TCTTTTTCTC TTTGCATATT ATCGCTAAGA ATAGACATAA TTTCTTCAAG TCGTGCCGCT 11340 

TCATTTTGAG CATAATCAAA TTTAAGACTA GGTAAATAAT TCAGCGACAA CGCTTCTTTA 114 00 

TCATTTGTGA TACCAGCATG AATCGGTTTA GCTAACGKT CTAGCTCTTG AATAAAATGT 114 60 

GCACGTTTAT CAGTTACTTT CATTGCATAT TCAGCAAACT GCTGATTTAA TACTTCCAAC 11520 

ATTGTTAAGT CCTTTTTTTG GCCTAATTGT AACTGCTTTA AGTAATTATT CTTTTGCTTT 11580 

AAAATACGTT GGTATTGAGC T AAAT CAT T T AAGTAAACAG CAGAAATTTG GCCCAACTCC 11640 

AT AT C T AT AA AGCGTCGTCT TATTtGrGGr GAGCCTTTTA CAATATTCAA ATCTTCTGGC 11700 

GCAAATAGAA CCACATTGAG GTGTCCAATA TATTGAGTTA GACGACTTTG CTCTAAGTGn 11760 
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ATTCACTTTG GACTTGTTTA CCTTTnTTAG TTATAAACAT TGTTAATGGG CATCGTGCCG 11820 



TGT 11823 
(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
ATAATTATTA ACATGGTGTG TTTAGAAGTT ATCCACGGCT GTTATTTTTG TGTATAACTT 60 

AAAAATTTAA GAAAGATGGA GTAAATTTAT GTCGGAAAAA GAAATTTGGG AAAAAGTGCT 120 

TGAAATTGCT CAAGAAAAAT TATCAGCTGT AAGTTACTCA STTTCCTAA AAG AT AC T G A 180 

GCTTTACACG ATTAAAGATG GTGAAGCTAT CGTATTATCG AGTATTCCTT TTAATGCAAA 24 0 

TTGGTTAAAT CAACAATATG CTGAAATTAT CCAAGCAATC TTATTTGATG TTGTAGGCTA 300 

TGAAGTTAAA CCTCACTTTA TTACTACTGA AGAATTAGCA AATTATAGTA ATAATGAAAC 30S 

TGCTACTCCA AAAGAAACAA CAAAACCTTC TACTGAAACA ACTGAGGATA ATCATGTGCT 4 20 

TGGTAGAGAG CAATTCAATG CCCATAACAC ATTTGACACT TTTGTAATCG GACCCGGTAA 4 80 

CCGCTTTCCA CATGCAGCGA GTTTAGCTGT GGCCGAAGCA CCAGCCAAAG CGTACAATCC 54 0 

mTTATTTATC TATGGAGGTG TTGStTTAGG a AAAAC C CAT TTAATGCATG CCATTGGTCA 600 

TCATGTTTTA GATAATAATC CAGATGCCAA AGTGATTTAC ACATCAAGTG AAAAATTCAC 660 

AAATGAATTT ATTAAATCAA TTCGTGATAA nA 692 
(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

ATACTGTAGC GCAAATTTCA CAATGGCATG TTATAGAAGA TTTETTACG AAT G AAT TAG 60 

GTATTAGTAT TTTACCAACA T C AAT T T C AG AgCAACTAAA TGGAGATGTG AAGCTGtACG 120 

CATTGAAGAT GCTCATGTAC ATTGGGAATT AGGTGTTGTT TGGAAGAAGG ATAAACAATT 18 0 

AAGTCATGCC ACAACGAAAT GGATAGAATT TTTGAAAGAC CGTTTAGGCT AACATATTAA 24 0 
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TAAAGCACTC ATTATTTAAG GCGCATCATT ACGTGGGTCA TTGAAATAAT GAGTGTTTTT 300 

TTGTGAAAAT GAAGTGAAAT TTAGAGAGCG TTTCCATAGA AAATAGTAAT ACAAACTATA 3 60 

AAAAAAGAGT ATTTTTATAT TGTGTACGCC ATCTTTATAA TAGTTATTGT AACAATTTAG 4 20 

ACATATTTAG AAAGGGATGG CGCCA7SCAC AAAGTCCAAT TAATAATCAA ACTACTACTA 4 80 

CAACTAGGAA TCATCATTGT GATTACTTAT AT TGGCACAG AAATTCAAAA GATTTTTCAT 54 0 

CTTCCCTTAG CCGGCAGTAT TGTTGGTCTA TTTTTATTTT ATTTACTATT ACAATTTAAG 600 

ATTGTACCGC TAACTTGGGT AGAAGACGGT GCAAACTTTT TATTAAAGAC STGGTCTTT 660 

TTCTTCATAC CGTCAGTTGT AGGtATTATG GaTGtgCTTC CGAAATTACG CTAAATTATA 720 

TACTCTTTTT CGCAGTCATT ATCATAGGAA CATGTATCGT TGCATTATCT TCAGGTTATA 780 

TTGCTGAAAA AATGTCyGtT AAACwTAAAC ATCGTAAAGG TGTAGACGCt TATGAATGAT 840 

TACGTGCAAG CCTTATTAAT GATTTTGTTG ACTGTCGTTT TATATTATTT CGCTAAAAGG 900 

T T AC AAC AAA AATATCCGAA CCCATTTTTG AATCCAGCAT TAATTGCATC TTTAGGAATT 960 

ATTTTTGTCT TACTTATCTT TGGAATTAGT TATAACGGGT ATATGAAAGG TGGCAGTTGG 1020 

AT C AACC AT A TTTTAAACGC AACGGTCGTA TGTl TAGCGT ACCCACTTTA TAAAAATAGA 108 0 

GAGAAAATTA AAGACAATGT CTCTATCATT TTTGCAAGTG TATTAAcTGG CGTCATGCTG 114 0 

AATTTCATGT TAGTGTTCTT AAC AC T T AAA GCATTTGGCT ATTCTAAAGA CGTCATTGTA 1200 

ACGTTATTGC CCCGATCTAT AACAGCCGCA GTAGGTATCG AAGTGTCACA TGAACTAGE 1260 

GGTACAGATA CGATGACCGT ACTTTTTATT ATCACAACGG GTTTAATCGG TAGTATTTTA 1320 

GGTTCGATGT TATTAAGATT TGGAAGATTT GAATCTTCTA TCGCCAAAGG ATTAACGTAT 1380 

GGGAATGCGT CACATGCATT TGGCACAGCT AAAGCACTAG AAATGGATAT TGAATCCGGT 14 4 0 

GCATTTAGTT CAATTGGGAT GATTTTAACT GCAGTTATTA GTTCAGTGTT AATACCTGTT 1500 

CTAATTTTAT TATTCTATTA ATTTAGATAT TTAAAATGAT AGACAGAAAG GGAGGCTATT 1560 

AGTAATAATG GCAAAAATAA AAGCAAATGA AGCATTAGTT AAAGCATTAC AAGCaTGGGA 1620 

TATAGATCAC TTGTATGGTA TTCCAGGAGA CTCAATCGAC GCATAGTCGA TAgTTTACGT 1680 

ACAGTGAGAG ATCAATTTAA ATTTTATCAT GTACGTCATG AAGAAGTAGC AAGCTTAGCG 17 4 0 

GCTGCTGGTT ACACAAAATT AACTGGTAAA ATCGGTGTGG CATTAAGTAT CGGTGGCCCT 1800 

GGTTTAATTC AT TT ATT AAA TGGTATGTAT GATGCCAAAA TGGATAATGT ACCGCAATTA IfflO 

ATATTATCTG GACAAACGAA TAGTACAGCA CTTGGAACGA AAGCATTCCA AGAAACAAAT 192 0 

T T AC AAAAAT TATGTGAAGA TGTAGCCGTT TATAATCACC AAATTGAAAA AGGTGACAAT 198 0 

GTGTTTGAAA TCGTTAACGA AGCAATTCGT ACGGCATATG AACAAAAAGG TGTAGCTGTT 204 0 

GTTATTTGTC CTAACGACTT ATTAACTGAA AAAATTAAAG AT AC AAC G AA TAAACCAGTA 2100 
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GAT AC AT C AA GACCAACAGT AGTATCACCA AAATATAAAG AC AT C AAAAA AGCGGTTAAA 2160 

CTAAT TAATA AAAGT AAAAA GCCTGTCATG TTAATTGGTG TAGGTGCGAA ACATGCGAAA 2220 

GATGAGCTAC GTGAATTTAT TGAAATGGCT AAAATTCCTG T CAT T CATC ATTACCAGCT 2280 

AAAACAATCT TGCCGGATGA TCATCCATAT AGTATCGGtA ACTTAGGTAA AATCGGTACC 2 34 0 

AAAACATCTT ATCAAACAAT GCAGGAAGCG GATTTATTAA TTATGGTTGG TACAAACTAT 2 4 00 

CCATATGTGG ATTACTTACC TAAGAAAAAT AT T AAAGCCA TTCAAATTGA CACAAATCCT 24 60 

AAAAATATCG GACATCGTTT CAATATTAAT GTAGGAATTG T T G GAG AT AG TAAAATTGCG 2 520 

TTGCATCAGT TAACTGAAAA TATTAAACAT GTTGCTGAAA GACCATTCTT AAACAAAACG 2580 

TTAGAACGTA AAGCGGTTTG GGATAAATGG ATGGAACAAG ATAAAAATAA TAATAGTAAA 2 64 0 

CCATTACGTC CAGAACGATT AATGGCATCA ATCAATAAAT TTATTAAAGA T GAT GC AG T G 2 7 00 

ATTTCAGCAG ATGTAGGTAC AGCAACAGTT TGGTCAACTC GAT AC T T AAA CCTTGGTGTA 2 7 60 

AATAACAAGT TCATCATTTC AAGTTGGTTA GGTACAATGG GTTGCGGTCT TCCAGGTGCA 2820 

ATTGCATCAA AAATTGCATA TCCAAATAGA CAAGCCATCG CAATTGCTGG TGACGTGCA 28 80 

TTCCAAATGG TAATGCAAGA CTTCGCTACA GCAGTACAAT ATGATTTACC TTTAACTGTA 2 94 0 

TTTGTACTTA ATAACAAACA GTTAGCATTT ATTAAATATG AACAACAAGC AGCTGGTGAA 3000 

TTAGAATATG CAGTTGATTT TTCTGATATG GATCATGCAA AATTTGCTGA GGCAGCAGGT 30 60 

GGTAAAGGTT ATACAATTAA GAGTGCTAGC GAAGTAGATG CTATAGTCGA AGAGGCATTA 3120 

GCACAAGATG TACCAACGAT TGTAGATGTA TATGTTGATC CTAATGCTGC GCCATTACCA 3180 

GGTAAAATTG TAAATGAAGA AGCGCTTGGT TATGGTAAGT GGGCATTTAG ATCAATTACT 32 4 0 

GAAGATAAAC ATT TAG AT T T AGATCAAATT CCACCAA'TT CAGTGGCAGC AAAACGTTTC 3300 

TTATAACTGA TTTAAAGGTT ATCACAATTG AATTGAACTA TAAAAACGGT AATTTCTATT 3360 

TCAACAAAAT GGGAATTGCC GTTTTGTTTA TTTATCACAA ATGATCGTAC TGAATTGATG 34 20 

ATAAAATTGT GAAAAAGTTG TTGAAAACGC TTTTACAAAT ATGTATAATA GCTATGAATT 3480 

AGATATCACT TGCGTGTTAC TGGTAATGCA GGCATGAGCA AACAACCGCA CTATGAGAAT 354 0 

AGTCTTGTTT GTTCATGCCT GCTTTTTTTG TACATGGAAG CGGAAATTGA GATAGGGGAT 3600 

GTTTATATGT TTAAGAAATT GTTTGGACAA TTGCAACGTA TCGGTAAAGC ATTAATGTTA 3660 

CCTGTTGCGA TTTTACCAGC AGCTGGTATT TTATTAGCGT TTGGTAACGC AATGCACAAC 3720 

G AAC AAT TAG TAGAAATTGC ACCATGGTTA AAAAACGATA TCATTGTAAT GATTTCGTCG 37 8 0 

GTCATGGAAG CAGCAGGACA AGTTGTATTT GATAACTTGC CATTATTATT TGCAGTTGGT 38 4 0 

ACAGCACTTG GATTAGCAGG AGGAGACGGT GTTGCAGCAT TAGfiGCGCT AGTAGGTTAC 3900 
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TTAATTATGA ATGCAACAAT GGGGAAAGTG TTGCACATTA CAATTGATGA CATTTTCTCA 3960 

TATGCCAAAG GGGCAAAAGA ATTAAGTCAA GCAGCGAAAG AACCAGCACA TGCTTTAGTA 4 020 

TTAGGTATTC CAACGTTACA AACGGGTGTG TTTGGTGGTA TTATCATGGG TGCTTTAGCC 4 080 

GCATGGTGTT ACAACAAATT TTATAATATT ACACTACCAC CATTTTTAGG ATTCTTTGCA 414 0 

GGTAAACGAT TTGTACCGAT TGTGACATCG GTCGTAGCAA TCGCAACAGG TGTGCTTTTA 4 2 00 

AGCTTTGCGT GGCCACCAAT TCAAGATGGA TTAAATAGTT TATCGAATTT CTTATTAAAT 4 2 60 

AAAAATTTAA CATTAACAAC GTTTATATTC GGTATTATTG AACGCTCATT AATTCCATTT 4 32 0 

GGTTTACATC ATATTTTCTA TTCACCGTTC TGGTTTGAAT TCGGAAGTTA TACAAATCAC 4 380 

GCAGGTGAAT TGGTTCGTGG TGACCAACGT ATTTGGATGG CACAATTGAA AGATGGCGTA 44 4 0 

CCATTTACTG CTGGTGCATT TACTACTGGT AAATATCCAT TTATGATGTT GGTTTACCA 4 500 

GCGGCGGCAT TTGCTATTTA TAAAAATGCA CGACCAGAAC GTAAAAAAGT CGTGGGTGGT 4 5 60 

TTAATGTTAT CAGCAGGATT AACTGCATTT TTAACTGGTA TCACTGAGCC ATTAGAATTT 4 620 

TCATTCTTAT TTGTAGCACC AGTACTTTAT GGAATTCACG TATTATTAGC TGGTACATCA 4 680 

TTCTTAGTAA TGCATTTATT AGGCGTTAAA ATTGGTATGA CATTCTCAGG TGGTTTCATA 4 74 0 

GATTATATTT TATATGGTTT ATTAAACTGG GATCGTTCAC ACGCATTATT AGTTATTCCA 4 8 00 

GTCGGTATTG TATATGCTAT CGTGTATTAC TTCTTATTCG ACTTTGCAAT TCGTAAGTTT 4 8 60 

AAATTGAAAA CACCAGGTCG TGAAGATGAA GAACTGAAA TTCGTAACTC TAGTGTCGCA 4 92 0 

AAATTACCAT TTGATGTCTT AGATGCAATG GGTGGAAAAG AAAACATTAA ACATTTAGAT 4 980 

GCATGTATTA CACGTCTACG CGTAGAAGTG GTTGATAAAT CAAAAGTAGA TGTAGCAGGT 50 4 0 

ATTAAAGCTT TAGGCGCATC AGGTGTATTA GAAGTTGGAA ACAATATGCA AGCTATCTT 5100 

GGTCCAAAAT C AG AT C AAAT T AAAC AT GAT ATGGCCAAGA TTATGAGTGG TGAAATTACG 5160 

AAACCAAGTG AAACGACAGT GACTGAAGAA ATGTCAGATG AACCAGTTCA CGTAGAAGCA 5220 

CTTGGAACAA CAGACATCTA TGCACCAGGT ATCGGTCAAA TCATTCCATT ATCAGAAGTA 5280 

CCTGATCAAG TATTGGCTGG TAAAATGATG GGTGATGGTG TTGGCTTTAT CCCTGAAAAA 534 0 

GGTGAAATTG TAGCACCGTT TGATGGTACA GTGAAAACAA TCTTCCCTAC GAAACATGCG 54 00 

ATAGGATTAG AATCTGAAAG TGGCGTCGAA GTACTTATTC ATATTGGTAT CGATACAGTG 54 60 

AAACTGAATG GTGAAGGATT CGAAAGTCTG ATTAACGTTGATGAAAAAGT AACACAAGGT 5 52 0 

CAACCATTAA TGAAAGTGAA TTTAGCATAC TTGAAAGCAC ACGCACCAAG CATCGTTACA 558 0 

CCAATGATTA TTACAAATCT TGAAAATAAA GAACTTGTCA TTGAAGATGT ACAAGATGCT 5 64 0 

GATCCAGGTA AGCTAATTAT G AC AG T C AAA TAATGATTAA AAAT G AAAC A GCATATCAAA 5070 

TGAATGAACT TTTAGTCATT CGTAGTGCGT ATGCGAAGTA GCGAGTTGAA AGAGAATACG 57 60 
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TTACAAAAGG CAGTAGCTTA AAATGAAGCT ACTGCCTTTT TAGTGCGCAA TGATGTATAG 5820 

CAGGTGTGTT GATGrTAATA AGTTAAATAT TAGTGTTAGA TATAGAAAAC ATTGCTTATG 5880 

TTTTTGTCAC AT T T TAG AAA AATGCATCTT CGCGACTAGC CAAATTAATA GTCTCATTGA 594 0 

AAAGTTTGAT TTTCCGTAAT AAAGTTCAAT TGCTTGTATT TTTATCATGA AAGTAATAAA 6000 

AATAAATTAA CATGATTTTA AATCTATTTG TAAGATAAGG AGATTTGTCA TTATGACAAC 6060 

AGAAGGTCTA TTAGTTGCAG AGAAAGAAAT CGAAGTGAAT GGTTACGBA TTGATGCGAT 6120 

GGGTGTCGTT AGTAATATCG TTTATATTAG ATGGTTCGAA GATTTGAGAA CAGCGTTTAT 6180 

TAATCAGCAC ATGAATTACT CAACAATGAT CAATCAAGGC ATTTCACCTA TACTTATGAA 62 4 0 

AACGGAAGCA GAGTATAAAG TACCTGTCAC AATACATGAC AAACCAGTAG GTCGTATTTA 6300 

CTTAGTTAAA GCAAGCAAGA TGAAATGGGT GTTTCAGTTT GAAATTGTGT CCGCACATGG 6360 

CGTGCATTGT ATTGGTACAC AGACAGGCGG TTTTTACAGA TTGAGTGATA AGAAGATAAC 6420 

CTCTGTGCCA CAAGTGTTTC AAGACATTTT AGCAACAAAA TAATGACTTC ATTTTAAAAT 64 8 0 

ATAAAAAGTA AGAAGGTGTT CGAAATGGTT AAGCAATTAA ATAGTGTCGA AGCATTCCGT 6540 

GAATTTATTC ATCAATATCC GTTAGCAGTT GTACATGTCA TGCGCGATCA GTGTAGCGTG 6600 

TGTCATGCCG TTTTACCACA AATTGAAGAC TTGATGCAAT CATATCCCAA TGTGCCATTA 6660 

GCTGTGATTA ATCAAAGTCA GGTGGAAGCT ATTGCTGGAG AATTAAATAT TTTCQTGTA 672 0 

CCTGTGGATT TAATTTTTAT GAATGGAAAA GAAATGCATC GTCAAGGGCG TTTTATCGAT 67 8 0 

ATGCAACGTT TTGAACATCA TCTTAAGCAA ATGAATGATA GTGTAAATAA CGATGTCGAT 68 4 0 

GAGCATTAAT ATCGCAAATG ATTAGCATTG CTAAGATTAT GTAGACATCA TAACTTATTT 6900 

CCCAGTAAAT ATTGGTAGTA ATTAGAATCA GCATGGTACA GTAGAACTAT AG T AG AAAT C 6 9 60 

ATCAAAGAGG AGTGACGACA AATGCGTAAA AAATGGTCTA CACTTGCGTT TGGATTTTTA 7 02 0 

GTTGCAGCAT ACGCACATAT TAGAATTAAA GAAAAACGCA GTGTGAAAAG TTATATGTTA 7 080 

GAACAAGGTA TACGATTATC TAGAGCTAAG CGTCGTTTA TGTATAAAGA AGAAGCGATG 714 0 

AAAGCAT TAG AAAAAATGGC GCCACAGACA GCAGGCGAAT ATGAGGGAAC CAATTATCAG 7200 

TTTAAGATGC CAGTAAAAGT GGATAAGCAC TTCGGTTCAA CCGTTTATAC CGTTAACGAT 72 60 

AAACAAGATA AGCATCAACG CGTTGTATTA TATGCACATG GAGGCGCATG GTTCCAAGAC 7 32 0 

CCACTCAAAA TTCATTTCGA ATTTATTGAT GAACTTGCAG AAACACTCAA TGCTAAAGTC 7 38 0 

ATCATGCCAG TATATCCGAA GATTCCGCAT CAAGATTATC AAGCGACGTA TGTGCTTTTT 7440 

GAAAAGTTGT AC CAT GAT T T ATTGAATCAA GTAGCAGATT CTAAACAAAT CGTTGTAATG 7 500 

GGTGACTCTG CGGGCGGTCA. AATTGCTTTA TCATTTGCTC AATTGTTAAA AGAAAAACAT 7 5 60 
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ATTGTGCAAC CAGGACATAT TGTATTAATT TCACCAGTTT TAGATGCAAC GATGCAGCAT 7 62 0 

CCTGAAATTC CTGACTACTT AAAGAAAGAC CCAATGGTAG GTGTGGATGG CaGTGTGTTC 7 680 

TTAGCTGAAC AATGGGCAGG GGACACACCT T T AG AT AAC T ACAAGTATC ACCAATTAAT 77 4 0 

GGTGACTTAG ATGGTTTAGG T C G TAT T AC A TTAACTGTTG GTACAAAAGA AGTACTGTAC 7 800 

CCAGATGCTT TGAACTTATC GCAATTGTTG AGTGCGAAAG G T AT C G AAC A TGACTTTATA 7 8 60 

CCTGGATATT ACCAATTCCA TATTTATCCA GTATTTCCGA 7 900 
(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: B9: 

GTCTAAATAA ACAAAATTAT CATTGATTaC TGAACTGGCA TTTCGAAGTA ATGCTTCAAT 60 

ATCATTCGAA TATTTCTTCA ATTTATGATT GTGAAATAAT TCTTGCATCA AAAATGGTCT 12 0 

TTGGTCACAT GAATGTGCAT CTGAAGCTAC AAAATGAGCC AAATTACATT CTATAAATTG 180 

TAATGATAAC TTTTGAATGT TTTTACCAAA TCCACCAACT AAAGAACTCG ATGTTAATTG 24 0 

ACTCAGTGCC CCATTTGCAA CC AAT T CAT A TAATATTTCC GGATTTTTGG CGATACTTCT 30 0 

ATTTCTTTCA GGATGTGCAA TGATTGGTAT GTAACCTCTC GATTGTATTT CAAAAAACAA 360 

TTGTTTTGTA TAATGTGGTA CTTCGCCCGT TGGAAATTCA ATT AAT AAT ATTTCGAACG 4 20 

ATTAATACCT TGAATACTAC CATTATCTAA GCCTTTCAGA ATCGAATCTG TAATTCTAAT 4 80 

TTCTTGCCCG GGAAAT AAT T TAATATCCAA TGCTTGAACT TCTGGATGCG TTCTTAACTC 54 0 

CGCCAATTTC ACAAGCACTT GTTGAAATGT ATTATCATAT CTCGGATGCA AATGATGAGG 60 0 

TGTCGCTACA ATACTTGTTA CACCTTCATC CTTAGCTTGC TTTAATAGTG CAATACTCTT 660 

TTCAATTGTT TTAGGACCAT CATCTATATC AAC T AAT AT A TGGTTATGAA TATCAATCAT 720 

GATTCATCAG TCCCATAATA TGCATAGTAA CTAGCACTTT TATCTTTAGG CATTCTATTT 7 80 

AAGACTACAC CTAATAATTT AGCACCTGH GCTTCAATAA GTTCTTTTCC TTTTTTAACT 84 0 

TCATCTCTAT TATTATTTTC CGAATTAACT ACGTAGACAA CATTGCCGGT AAACTTTGAA 900 

AATAATTGCG CATCTGTAAC TGTGTTCACT GGTGGCGTAT CGATAATTAC AAAGTTATAA 960 

TTCATCAATA ATGTGTCATA CAAATTTGCA AATGCCCTTG ATGTAATTAA CTCGACGGA 1020 

TTCGGTGGGA TTGGCCCAGA CGTCAAGACG TCTAAATCTT GAATTTCAGT T GAG AT AAT A 1080 

CTGTCTTGAT AAGTTGACCA ATTTAGCAAT AAACTTGATA GGCCTTCATT GTTTGGCAAA 114 0 
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TTAAAAATAT AATGCTGCGT AGGTTTACGC 

CCTGCTTGCG CATATGCAAC TGCTAAATTT 

GCCTCTGAAG TGATTACAAT GCTTTGAACT 

GATCTTATGC CTCGAAATTT CTCGCTAATA 

CTTGATGTAC TTCyTCGTGT ATTCGTCATG 

TGAACCTAAA ATAGGTAATC CTAGTTGCGA 

ATCTAATAAT TCTTTTAAGA AAATAATCAA 

GATAACTAAG TTGACAGATA CTATTGGAGA 

AAGTATCGTA AC AT TAT C AA CACTCATAAT 

TATTTTATTA ACAATTTTGT CAGATTCAGA' 

AATTTGAGAG TTTGTTTGAT TGGTTACTTT 

CTGACCATCA AnTTCTCTAG ATACTTTATC 

ATATGTATTA ACAGACTGCA AACTACTTTG 

CTTTTTCATG TTCACTAATA T T T GAG TAG A 



ATATCCCCGT CTACGATTAG TGTTTTATAA 
GCTGCAATTG TAGACTTACC TGCGCCTGGT 
GCACTGTCAG GATTTGCAAA CATAATATTT 
GGTGACTTTG GTTGTTCATG G ACAAT T AAA 
GTAATTXTC GTAAATTAAA ATTTTTGTAT 
TTCAACATCT TCTTCTGTCT TAATACGCTT 
TATTGCTAAA ACAAT ACCAA CAATAATGCT 
TACTTTTACA GCATTATCAT GTGCTGAGGA 
TTTAGGCATG TCATGAGCAA AAACTTTAGA 
TTTATTCCCA GTGGTAACTG ATACAGTAAT 
TAAAAATGAA TTCAACTCAG CTGTTGAATA 
TAGAATTCTA GGACTTTTGA TAATTTCCGT 
AACATTTTGG AAAGCTAAAT CACTTGAGGA 
AGCAGTATAT TTGTCAGGCA TAACAAAAAA 



GGTT 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1200 
1260 
1320 
1380 

1440 
1500 
1560 
1620 
1680 
1740 
1800 

1860 
1920 
1980 

1984 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

CAAATCCCTT GGTGATGAtA AAtGtATTGC TGTGTAGCCA AATAATCTTC GTATATATGA 60 

CTGACGTTCA ACAACAGCTT GCAATCGTTT CGTTGGTACA GTTACTTTCT TCTTGTTAAA 120 

GAGACCATAT TCAATTTTAA GTTGCTCATT TTCAAGCATC AC C G AAAAGC CATAAAATCT 18 0 

TATCATTGTT ATAATCGTTC CAATAATATA TGCCACTATT AATACTAGTA AAAT GAT GAT 24 0 

TAATACTGAA AT AC T T AC AA TTTGAACCCA TTGACTAATT TCATGATTTA GCTTCGACCA 300 

TGGGATCAAC TCTCTTACAG CCCCGTAAAT CGGTACTAAA GCTGCTAACG T T AC ACCAAT 3 60 

GGCGCCACTG GTCATTGCCA TAAATAGTGA TTCTTTAAAA TTCATCTGAT ATATAGGAAT 420 

GCGTTTATTT TTCTGATTAA GCATACTATC AGTGTTCTGC ACTTCATCTA AGCGACCTTC 4 80 

TGCGATGTCT TCCACATTAC CTTCAATGTC ATGATTACAG TTGTCATTCT TCTCAGCACT 54 0 
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AGACTTTTGC GCCACTTCTG TCTTCAACTC TGTTTGCAAT TGATCATAT ATCGTTCAAG 600 

ATATTCACCT TGTTTTTTCG AAATAACACT TAAGACAATA CCATCACTTG GTGTTTTGAT 6 60 

TTGTAAATCC ACACCACCAA TAATTTGATT GACGACCCCT TGTGTCATAT CAACGGACTG 720 

AATACGTTTT ATATTTAATT CTTTACGCTT TTTATTAAAA ATACCTGTTG TTAAAATGAA 780 

ATAATTATCC tCAATCCAAT ATCGCGTGTT CATAATTCCG ACAATTTGAG AAATGTATGA 840 

T AT T AAAAAG AATACAAATA CAATACCTAT CCATAAATAT GATTCGGGAT TCGTATAATC 900 

AAAATCTTTC AATTGAAAGA TAATGAAAAT AAAAAAGACG ACTATGTTTT GTTTGATAGC 960 

ATT GAT TAT G CCAT TAAAAT ATGAAATCEG ATGTAATTTT TGAGGTTCAG AC AT C AC T T T 1020 

CAACCCCTCT CAAATTCGAC ATAGTTCTCT CTTCGATTAT TTTAACATCG T CAT G AGAC A 10 8 0 

TCATCGGTAA ATAAATAGTA TGACCTGCAG TCATAAATCC AACTTTATAC AAATTAAGCA 114 0 

CTTTACTAAT TGGATTAGAT TTAATCGACA AGTATTGTAA ACGTTCAATT CG21TCGTTT 1200 

CTTCTTTATA TATAAAAAAT GAT GT AC GAT ATTGTACACT TAGTTGATCA ACTTTATAAA 12 60 

AGCGACAATG ATATTGCCAT AAAGGCTTAA TAAATAATTT TAATGTACTC AGAGCACCTA 1320 

AAACCAACAA AATATAAAGT AAGTAATGTG GCCATTCAAA TCTTAACCAT ATAAAATAAA 138 0 

AAAT G AC AT A CACAGCTACA CTCAATATAA ATTCTAAGCC ATTCGTAATG TAGTAATACA 14 4 0 

ACAATGCTGA CTTAGGACTC TTAGTCAACT TAGTATAATC TGACATATAC CCCTCTCCCC 15 00 

AAATAAAAAA TTATACGGAT TTATAATCTA TTTCATTTTA TTTTTATATG AT G AT AAT T A 15 60 

TAGCATATGG AATATTTCAT GCTAATTTAT TCTTCTAAA GG T AC AT C T A AAAAT TTAAT 162 0 

T AAG C AG AAA GTGCTTGAAT TGCTAAAAAG ACACCATGTT ATAATTTTAT CAACATGATG 168 0 

CCTTTCATCT ATAATCAATC TTTCATCTTA TCAAGAGCGA TATTTAGTTC AAGC AC AT T C 17 4 0 

ACATAATCAT TTGTTAACAC ACCACGCTGC TTACGATGTT G AAT C AAG T C GGCCACTCTT 18 00 

GAAGTAGATA CATGACGAGC ATCAGCAATA CGAGGTGCTT GCTTCAATGC ATTTTCGACC 18 60 

GTAATATGCG GATCTAAGCC CGACCCAGAA CTTGTTGCAG CATCTATTGT TACATTTGAA 1920 

TTCCCAAATT TAACATGATG TTTCATGCGT GCTATTAATT CGGTGTTTCC ATTCGATTCA 198 0 

TTACTTCCAC CTGAAGATAC GCCGTTTTTA TATAATTTTT CAGGATTCAT ATTATAATCA 204 0 

ACTGCACTCG GTCTCCCGTG AAAAT ATCGT GTCTCTGTCC AGTGCTGTCC AATCAATTTT 2100 

GATCCAACTA TACGATTGTC ATACGTAATT AAACTGCCAT TTGCTTGTTG ATAAAAAAAT 2160 

ATTTGACCAA TTAACGTGAT AGCTAACGGG AATAAAAATC CEATAATAC CAT AG T T AT T 2 22 0 

ATCGT TAAAC AAATACTATT TCTTATCGTA TTCATGGTAC AGGCTCCTTC CTCTTTACAC 2280 

AAAAAATTGT ACAATCATAT CTATTAATTT AATGCCTAAA AACGGGACGA TTAATCCACC 234 0 

TAATCCATAA AT C AAC AT AT T AT T TAT AAA GATTCTATCA ATGCTGTAAC CCTTTACTTT 2 4 00 
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TACACCTTTC ATGGCAATTG GAATTAAGGC AACAATGATT AATGCATTGA ATATCAAAGC 2 4 60 

TGATATAATT GCAGATTTAG G T G AC GAT AA ATGCATAATA TTTAGTGAAG TCATTTCTGG 2 52 0 

AATTGTTGAC ATCATTAGTG CAGGTAAAAT TGCAAAGTAT TTTGCTACGT CAT T AGCCAA 2 58 0 

ACTAAATGTC GTTAATGCAC CTCTCGTCAT TAATAATTGT TTGCCTATTT TTACAACCTC 2 64 0 

TATTAACTTT GTAGGATTCG AATCTAAATC AATTAGATTA GCTGCCTCTT TAGCACTAAT 27 00 

TGTCCCTGAG TTCATAGCTA ATCCTATATT CGCTTtGTGc tAGCGCAGGT GCATCATTTG 27 60 

TACCATCTCC TGTCATCGCA ACAATATGGC CTTTCGCTTG TTCATCTTTGATGACTTTAA 2820 

TTTTATCTTC GGGTTTACAC TCTGCAACAA ATCTATCAAC CCCGGCTTCT TTTGCAATTG 2880 

TAGCTGCTGT TAAAGCATTA TCACCTGTAC ACATAACTGT TTCAATCCCC ATTTTTCTCA 2 94 0 

ATTCAGTAAA TCGTTCTACA AGACCATCTT TAATCACATC TTTTAAATAA ATCACGCCAA 300 0 

GCATGACATT GTTTTCAATG ACTATTAAtG GnGTGCCACC TTTACTCGAT ACATCCATAC 3060 

AG AG AG AC T C AATATTAAGA GGAATATTGC CTTGTTGTTG TTTGACAAGA TTTATCATAC 3120 

TATTAGGTGC ACCTTTGAAT ACCGATATTT CATTTGTAAT GATTCCGCTC ATTCTAGTTT 3180 

CAGCTGTAAA AGGCTTATAT GTGCCATCAA TSTCTTTAGG CAGCTCATTT ATATACATcT 32 4 0 

GcttCGCTAA TCGTACAATA CTTTTTCCTT CTGGCGTATC ATCGTAGATT GATGACATAT 3300 

AAGCAGCGAC TATCAATTTT TCAAGCATTT GTTGATTCAC TGGTAAAAAT TCACTAGCGA 33 60 

TTCGATTGCC ATAAGTGATT GTGCCTGTCT TGTCTAAAAT CAT T AC AT C G ACATCTCAC 3420 

ATACTTCTAC AGCACGCCCA CTTTTCGCTA ATACATTGAA TTGAGTAACA CGATCCATGC 34 80 

CTGCAATACC AATCGCCGAT AACAAACCAC CGATTGTCGT TGGTATTAAA CATACTGTTA 3540 

ACGCAATGAG CATCGCAATA GGTAAAATTA AATGCAGGTA AGATGCTATT GGATATAACG 3 600 

TTACAATAAC GACTAAAAAT ATAATTGTTA ACGTTGTTAA TAATGTAAAA AGTGCAATTT 3 6 60 

CATTTGGTGT TTTATTTCTT TCCGCCCCTT CAACTAAGGC AATCATTTTA TCTAAAAAAG 3720 

ATGTACnCGC TTCACTCTCA ACACGTATTT CTAACCAATC AGATGTTACA AGTGTACCGC 37 8 0 

CAATGACTCC ATCAAAATCG CCACCTGATT CTTTTATCA: AGGTGCAGAC TCACCAGTAA 38 4 0 

TTGCAGATTC ATCAACGGTT GCTAATCCAT TTATTACAAC GCCATCAGCA GGGATTGTTT 3 900 

CTCCATTTTC TACCCGAATA TTTTGTCCGG CTTTTAACTC TGTGGCGTTC ACTATCCGAT 3960 

ACGCACCATT TTCTTCTATC AATCGAGCAG TTAAATTTGA TTGTGCTTGT CTTAAACTAT 4 02 0 

CAGCTTGCGC TTTTCCACGA CCTTCAGCAA AGGCTTCTGA AAAATTAGCA AACAATATAG 4 080 

TTATTAATAA TATGATAAAA ATTGTAATCA AATAACCTCG CGATAGATAG CTAGTTCCAA 414 0 

ATATGTCAGG AAAACAT AT T AATATCAACG TTAAAATCAT TCCAACCTCA ACGACAAACA 4 200 
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TTATCGGATT TTTTATTAAT TGTTTAAGAT TCAGCTTATA AAAAC T CAT T TTCAAAGCTT 4260 

CTATCACCAT TGTTTGATTG AAATATTTAT TTACATGATG CATTTTTTCC ATCCCTTTAC 4 32 0 

TTTATTTTAA AGTTAAAAAT TCACCAATAG GACCAAGTAA TAGTACTGGA ATAAATGTCA 4 380 

AACCACTTAG T AAAAC GAT A AAT AC GAT T A GTGATACGCC AAAATAGGT TTATCAATCG 4 440 

CTATTGTATA TTTATCTTGA TGGTATGATT TTTTATTCAC TAAACTTGAT GCAATCATTA 4 500 

ATTGCAAAAT AATTGGTATA TAACGAGAAA GCAACATAAT GATTCCTGTA GAGATATTCC 4 5 60 

AGAATGTTGT ATCATCTTTC AGTCCTTCAA ACCCTGATCC ATTGTTCGCA GCAGCTGATG 4 620 

TCATTTCATA CATAACTTGT G AAAT AC CAT GAAAAGACGG ATTCGTtATa CTTtCACTTG 4 68 0 

CTCCAGGAAT CATAAAAGCA AGTGCTGAAA ATACTAAAAT TAAAATTGGG TGTATGAGAA 4740 

AG AC T AAG AC AAT AC AT T T C ATTTCACGGG CGCCAATTGG CAT AT T T AAA TATTCTGGTG 4 8 00 

TTTTACCAAC CATCAAACTG CATATAAiCA CCGTCAGTAA G AC AAAT AT C AAT AAAT T C A 4 8 60 

TGAGTCCTAC GCCTTCGCCA CCAAATACAA CATTTAGCAT CATTAATACC ATTGGTCCTA 4 920 

ATCCACCTAT AGGCGTTAAG CTATCATGCA TGTTATTAAC AGAACCCGTT GTAAATGCCG 4 98 0 

TCGTAATAAC TGTAAATAGT GCTGACAAAC CTGCTCCAAA CCGTACCTCT TT&CTTCCA 504 0 

TATTCGGTCC ATAAATGCCT AAATTCGCTA GTATTGGATT ACCACGATAC TCACTCCACA 5100 

TAG T T AAT G T AAG AAT T GC T ATAAAAATGA AAAAC AT T GC GACAAATAAT ATCAACGCAT 5160 

GACGATGTAC TCGTTTACCA TGTCTACTTA ACATGCGACC AAATAAGAAC AACATTGACA 5220 

TAGGAAGTAA CATCATACTG CCCATTTCTA TAAAATTGCT CCAAATATTT GGATTTTCAA 52 80 

AAGGTGTTGC AGAATTTCCT GCTAAAAATC CTCCACCATT CGTACCAAGA TGTTTTATTG 5340 

ATTCAAGTGA TGCAATAGGT CCAAATGCAA TATGTTGAAT ATGTCCGCTT AAAGTCCGAA 54 00 

T CAT T AAAT T AGCATGCAAC GTTTGTGGTA CaCCTGAGT CATCAATAAA ATACTAATTA 54 60 

AACATGATAA TGGTAAAAGT ACTCGGACAA TAAACCGAAC AATATCTTGA T AAAAAT T AC 5520 

CAATGATATT AGTTAATCCA GTTAAACGTC TCAACATCGC TATACAAACG GCGTAACCTG 5580 

AT G C AC TAG A TGTAAACATT AAATATGTCA TTACAATCAT TTGCGTTAAA TATGTCACAT 564 0 

CTGaTTCACC GTTATAGTGT TGtAAATTAC TATTTGTTAA AAAAGATATT GCTGTATTAA 5700 

ACGCTAAATC TATCGATTGG TTTAAATTAT GATTTGGATT TAAAAAAAGC CATTGCTGAA 57 60 

CTATTAGCAA TACAAATGTT ATAAACCCCA TAAATCCATT AAATGCCAGA AAATGTTTGA 5820 

CATATGTTTT AGCTGACATG TGTTCTAAAT CTGTGCCGAT AATTTTAAAA C AC AT AT T T T 58 80 

CAAATCTAGT AAAT AT T AAA TCTACTCTTG ACGATTGCAC CAATGCTACG CGATATAGAT 5 94 0 

ATCCACTAAA AACATACGTA ATCATAACCA TCATTGTTAG AAACAAAATT ATTTCCATGA 6000 

TAACCCTCAC TT AAT AT ATT TCTAAAATTT TTCACTACGA ATAAGGCAT AAAATAAATA 60 60 
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CAAAACTAAT GCAATAACTA CCAGTAATAA AACGATGAGC ATTGCCATAA CCnCCTTACA 



6120 



ACACAACAAC ATCGTAACAA CTTGTTTATG AG AG AAAT n T TAATTTTCAA ACTTAGTTAT 



6180 



TAAGAAAnCA TTAAGATGTG TAT GC AG AAA TAAATTTTAT AGCATTTAAT TGTGAAGAAT 



6240 



ATTATGATAT TGCTATCGAG GTGAAGGTTA TG 



6272 



(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1978 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

AAATGATGTT T T AC AAT AAA T AT An AAAC G TATCAACATA TATCATCATA TTTTTAGTTT 60 

CAAGTGCAGC CTTTGCAATA TTCTTGTTAA GTGCGnACAT TAGTGCTCAC TCGGAACAAG 12 0 

TGTACGAAAT GACTGACCAT CAAATTAAGA ACAATACGAT AAAT AAAG C A TACGAACATA 180 

AAGACCCTAC AAACAATAGC GAACAAAGAG ATGGGAAAGT GTTCGCTTTA ATAAATTGAT 24 0 

AC AT T G T C AC AACGTTATTT TGCCTATTTT TGCGmAATAG CGTTTTTTAT TACwTTTTTG 300 

CTGATsTTAA ATT TGT TATA TTTTGTTAAA GT AT TAT AAT GATTfiATAA AC AAAT T G AA 360 

GGTAGGTTTT TTAATTGAGT AATTCTGATT TGAATATCGA AAGAATTAAC GAGTTAGCTA 4 20 

AAAAGAAAAA AGAAGTAGGA TTAACTCAAG AAGAAGCAAA GGAGCAAACA GCCTTAAGaA 4 80 

AAGCTTATCT TGAGAGTTTT AGAAAAGGG T TTAAACAACA AATTGaAAAT AC T AAAG T AA 54 0 

TTGATCCAGr AGGTAATGAT GTAACACCTG AAAAAATTAA AGAGATACAA CAAAAAAGAG 600 

ATAATAAAAA TTAAATCACA AATCTGTAAA GAATTTTCTG ACATTATAAC TTGAAATAAG 660 

TATTTTACTT ATCTTTTTAT TTTAAAATAA GTTATAATGT ATTTGATAAA ATTGAAGAAG 720 

GGAAGATACA CAAGATGTTT AATGAAffiLAG ATCAATTAGC TGTTGATACG CTACGTGCAC 78 0 

TAAGTATCGA CACAATCGAA AAAGCGAATT CTGGTCATCC AGGATTACCT ATGGGAGCTG 84 0 

CCCCAATGGC TTACACTTTG TGGACACGTC ATCTGAATTT TAATCCACAA TCTAAAGATT 900 

ACTTCAATAG AGACCGTTTC GTATTATCTG CAGGGCATGG TTCAGCATTA TGTATAGCT 9 60 

TGTTACATGT TTCTGGTAGT T TAG AAT TAG AAG AAT T AAA GCAATTTAGA CAATGGGGTT 102 0 

CTAAAACACC AGGTCATCCT GAATACAGAC ATACAGATGG TGTAGAAGTT ACTACCGGAC 108 0 

CACTTGGACA AGGTTTTGCT ATGTCAGTAG GATTAGCTTT ACAGAAGATC ACCTAGCAGG 114 0 

g AAAT TT AAT AAAGAAGGAT ATAATGTTGT AG AT CAT T AC AC AT AT G TAT TAGCTtCTGA 1200 
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CGGTGATTTA ATGGAAGGTA TATCGCATGA AGCAGCTTCA TTTGCTGGAC ATAATAAATT 12 60 

AAGTAAATTA GTTGTTTTAT ACGAT TC AAA TGATATTTCA TTAGATGGCG AATTAAACAA 132 0 

AGCTTTTTCT GAAAACACAA AAGCTCGTTT TGAffiCATAT GGTTGGAATT ACTTACTAGT 1380 

TAAAGATGGT AAT GAT T TAG AAGAAATTGA TAAAGCGATT ACTACAGCTA AATCTCAAGA 144 0 

AGGACCAACG ATTATTGAAG TTAAAACAAC AAT C GG AT T T GGTTCACCGA ATAAAGCAGG 1500 

AACTAATGGT GTTCATGGGG CACCTTTAGG TGAAGTTGAA AGAAAATTAA CATTCGAAAA 15 60 

TTACGGTTTA GATCCTGAAA AACGTTTTAA TGTTTCAGAA GAGGTATACG AAATTTTCCA 162 0 

AAATACTATG TTAAAACGTG CTAATGAAGA TGAATCTCAA TGGAATTCAT TATTAGAAAA 1680 

ATATGCAGAA ACATATCCTG AAT T AGCAGA AG AAT T T AAA TTAGCGATTA GTGGTAAATT 174 0 

GCCTAAAAAT TATAAG3ATG AATTACCACG TTTTGAACTG GGTCATAATG GTGCATCTCG 18 00 

TGCTGATTCT GGTACTGTTA TTCAAGCAAT CAGTAAAACT GTCCCTTCAT TCTTTGGTGG 18 60 

ATCAGCAGAC CTTGCTGGTT CAAACAAATC CAATGTAAAT GATGCAACTG ATTATAGTTC 192 0 

TGAAACACCT GAAGGtAAAA ATGTGTGGTT TGGTGTACGT fiATTTGCTA TGGGTGCT 1978 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

TAGTAGTATT TATTAAATTA TACGAAGGGA CCcAACACAG AAAATTCATT TTATTGAATT 60 

TTACATTTAT GTGCCAAGTT GGGAAAAATG TCTTATTTTT TCaAAGTATT TAAAAGTAAA 120 

ATTACATGTT AATACGTAGT ATTAATGGCG AGACTCCTGA GGGAGCAGTG CCAGTCGAAG 180 

ACCGAGGCTG AGACGGCACC CTAGGAAAGC GAAGC CAT T C AATACGAAGT ATTGTATAAA 24 0 

TAGAGAACAG CAGTAAGATA TTTTCTAATT GAAAAT T ATC TTACTGCTGT TTTTTAGGGA 300 

TTTATGTCCC AACCTTTTTA GAATATTAAA TTTCTACAAT TTCGTCATCT TCAACAATAA 360 

AGCCCATTGT ATTGACGCTG TTATTTAAGA AAGTCAGAAT AT AACGCAT T ACTTCATCAC 4 20 

GTTCTGGCTC ATTGTGAACC TCGTGGTAAA AACCTTGCCA AGCTTTAAAA TAT AAT T C AG 4 80 

GTGTTTGATA TTTTTCTTTA AACTCATCAA TTGCCCTAGT ATCAACAATT AAATCCTTCG 54 0 

TTCCATACAT TAATAGCGTT GGCATTGGTT GAATGTCATG AATffiGAGCC ATCGTATCTT 600 

TCATCGTCTC ATTAATTGTA T T AT ACCAAT GATACGTTGC TTTTTTTAAC ATTAAACCAT 660 

CGTTAACTGT TTCTTCAACA ATTTCTAAAT TACGTGTTAA ATCTTTTGGT TCTACACCAA 720 
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CAT T AAAACG TGTGTCTTTT GAAATTTTAC CTATATTTGA AACAAGTTTA TCTTTACGAT 78 0 

TTTTTCCATT CTTTTGAAGT TCTAGCATAG GAGAAATTAA CATCATCCCC TCGATTGGCA 84 0 

ATTCTACTTT TTCAAGTAAA TTTAATAAAA TCAAACCGCC AAGTCCTACC CCTAATACAT 900 

AAGTAGGAAT TTTATATTCA TTAGCTATCT TTAACCAGTC TAGCAAACTT TCGTGATACG 960 

TTTGAAAGTT TTCAATTTGT CCTTTATTAG CTCTTGAAGT TTGACCTTGA CCAGGCAAAT 1020 

CTCCCATAAT CACATGATAG CCATTTCTTC TTAACATCGT AATAACATAT GCATATCTTC 1080 

CCGTATGTTC T AAT AT AT T A TGAGCAATAA CAACGACGCC TTTCGCATCA TTTTCAGCTT 114 0 

CCCACTTCCA CATTATTATA CTGCCCCTTT TTCATTAATC TTCAATAACA SATTATAGC 1200 

AAATTCACTA TGTAGATTTC TAT T TAT AG T ATTATTGTTG TCCATATTAT TAT AT AT AAA 12 60 

TGAAATCAAC AT C AAT AAT A GTGTAATTAT ACATAATTAT TTTTGATTGT TTTTGATGAA ' 1320 

AACGCTTTCT CGAATATTTT TTTCATGCTA AACTTATTGT AAACACAAGG GTTTGGAGGA 1380 

GTAGCAATGG C AC TAT T AAA GAATTTTTTT ATCGGATTAT CT AAT AAT AG TTTTTTAAAC 14 4 0 

AACGCAGCAA AAAAAGTGGG CCCACGTTTG GGCGCCAATA AAGTCGTTGC CGGAAATACA 1500 

ATTCCAGAGT TAATTAATAC AAT C G AAT AC TTAAATGACA AGAATATCGC TGTTACGGTA 15 60 

G AC AAT T TAG GGGAATTTGT CGGTACAGTT GAJKAAAGTA ATCATGCTAA AGAACAAATT 162 0 

TTAACAATTA TGGACGCGCT TCATCAACAT GGCGTAAAGG CACATATGTC TGTTAAATTG 168 0 

AGTCAGTTAG GTGCAGAATT CGACTTAGAA TTAGCTTACC AAAATTTAAG AGAGATTTTA 1740 

CTTAAAGCAA ATACTTACAA CAATATGCAT ATAAATATTG ATACTGAAAA ATATGCTAG 18 00 

CTGCAACAAA TTGTTCAAGT TTTAGATCGC TTAAAAGGCG AAT T TAG AAA TGTTGGTACT 18 60 

GTAATTCAAG CATATTTATA CGATAGCCAC GAATTAGTTG ATAAGTACCA AG AT T T AC GA 192 0 

TTACGTTTGG TTAAAGGTGC ATATAAAGAA AACGAATCAA TTGCATTTCA ATCTAAGGAA 198 0 

GACGTAGATG CAAATTACAT CAAAATAATT GAACAACGTT TGTTAAACGC ACGCAATTTC 2 04 0 

ACTTCAATTG C AAC AC AT G A CCATCGCATC ATTAATCATG T AAAACAAT T TATGAAAGAA 2100 

AAT C AC AT T G AAAAAG AT C G TATGGAATTC CAAATGCTCT ATGGTTTTAG ATCAGAGTTA 2160 

GCAGAAGAAA TCGCAAATGA AGGCTATAAT TTCACTATTT ATGTACCTTA TGGCGATGAT 2220 

TGGTTTGCGT ATTTTATGAG AAGATTAGCA GAACGCCCAC AAAACCTATC TCTTGCTGTA 22 8 0 

AAAGAATTTG TGAAACCTGC TGGCTTAAAA CGTGTTGGCA TAATTGCAGC TTTAGGAGCT 234 0 

ACAGTTATGT TAGGTTTAAG T ACAAT T AAA AAATTATGCC GTAAATAGAG CAAGACATAA 2010 

ACAATAATTT AGGAGTCTGG AACAATAATC AATGTTCTAG GCTCCTAAAT GTTATATTGG 2 4 60 

CAGTTCTGAA ACATTTATAA ATCTTTGATT ATGAACTCAC AACAGTGCTC TATGCTTTTA 2 52 0 
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TAG AT T T T AA TAAAT TAGCC ATTTCAATTG CACTTACTGC TGCTTCAGCA CCTTTATTGC 2580 

CAGCTTTCGT ACCTGCTCTT TCCACAGCTT GTTCAATAcT TTCAGTCGTT AAAATACCAA 2 64 0 

ATATGACTGG TACATTAGTT TGATCATTCA CTTTAGAAAC ACCTTTCGCG ACTTCATTAC 27 00 

AAACATAATC AT AAT GAG AC GTAGCACCGC GAATTACGCA TCCTAATGTA ATTACTGCAT 27 60 

CATAATTTCC TGATGAGGCT AATTTTTTAG CTACTAAAGG AATTTCAAC GCACCTGGCA 2820 

CAAATGCTAC ATCAATATTG TCTTCATTAA CATCATGTCG AATCAAAGTA TCTTTTGCAC 28 8 0 

CTTCAAGTAA TCTTCCAGTG AT AAAAT CAT TAAATCGACT AACTACGATT GCAACTTTCA 2 94 0 

AATCTTTTCC AATTAATTTA CCTTCAAAAT TCATGTTAAA ATCCTCCTAT AT TAAAT G AC 3000 

CCATTTTTAT TTTTTTCGTT TCCATATAAT CAT GAT TAT G TACCGTTTCT GGTACGATAA 30 60 

CTTCAATTCT TTCTGCAATA TCAATGCCAT ATTGTTTTAA TCCCTCAAAT TTACTTGGAT 3120 

TATTACTTAA TAAATTGATA TGTTCGATGT TAAAATATTT TAAAATCTGT GCAGC AAT AT 318 0 

GATAATCTCG CAAATCTTCA TCAAAACCTA ATGCTAAATT TGCAGTTACT GTATCATATC 32 4 0 

CTTGCTCAAT TAATTCATAT GCGCGTAATT TGTTTAACAA TCCTATGCCA CGACCTTCTT 3300 

GAGGTAGATA AATAATCATG CCACCATGTT CATTGATATA C T T CAT AG AC GATTCAAGTT 3360 

GAGCACCACA ATCACAACGT TGACTATGGA AAATATCGCC TGTAAGgCAC GCAGATGTA 34 2 0 

AGCGTACATT TTCATGTTGT CGAATTGCAC CTTTTGTCAG TACAACTATC TCTTCATCTG 34 8 0 

TGTATGTCGC TTTAAAACCA TACATATCAA ATGTTCCGAA ATCTGTAGGC ATTTTCACTT 354 0 

TTGCCTTAAA" TTCAATTTCT GGTTCTAATT TTTTACGATA T T C AAT T AAA TCATCAATCG 3 60 0 

TAATCATCTT TAATTGATGT TTTTCTTTAA ACTTTTGTAA ATCTTGTCCT TTCGCCATCG 3660 

TGCCGTCATC ATTCATAATC TCACAAATGA CACCAGCGGG CTTGGCACCA GTAAGTTTAG 3720 

CTAAATCAAC AGCCGCTTCT GTGTGTCCAT TTCTAGCTAA TACGCCTTTA TCTTGTGCTA 3780 

CTAATGGAAA TAAATGACCA GGACGATTAA AATCTTAGC TTCACTACTA GGATCAATGA 38 4 0 

GCTTTTTGGC AGTCAATGTA CGTTCATAAG CACTAATTCC TGTTGTTGTA T C T AC AT GAT 3900 

CAATACTCAC TGTAAATTGC GTACCAAAGA TGTCGGAGTT ATCATCAACC ATTTGTACCA 39 60 

AATCCAAACG TTGTGCAATA TCTTTAGACA CTGGTGCGCA TATTAATCCC CtTGCTTCTT 4 020 

TCGCCATAAA ATTAATGGTA TTATCGTTCA TCCATTCAGT AACCGCTACT AAATCACCTT 4 08 0 

CATTTTCACG ATTCTCATCA TCTACTACAA TAATTGGTTC TCCATTTTTT AAAGCCATTA 414 0 

AAGCACTGTC AATATTATCG AATTGCATGC TACCCCTCCt AAAAACCAAA TGCTCTTAAT 4 200 

TTATCTACAG ATAATTGGTC TTTATCTTTA T T T AAAAT AT TTTCAACATA TTTAAACAAA 4 2 60 

ACGTCTGTTT CTAAATGTAC TTTATCTCCT AATTTTTTGG ATGAT AAAAT CGTTGAACGC 4 320 

CTCGTTTCTG GAATAAGATG AATGTCAAAA CTGTTATCAT GCTTATCAAA TACCGTTAGA 4 38 0 
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CTAACACCAT CCACAGTAAT AGACCCTTGC TTAACTAACT GATSTTAAT ATGTTGGCTA 44 40 

CATTGAATCG TAATAATTTT TGCATTGGCT GTTTCATTTA TTTTTGAAAC TGTTCCTAGT 4 500 

T CAT C T AC AT GACCGAGGAC AAAATGTCCA CCAAACCTAC CGTTACCACT CATGGC AC GC 4 560 

TCTAAATTTA CTTCTGATTG TCGCTTAACA TCTGCTAAAT AGGTTTTATT TTCAGTGCCT 4 62 0 

TTAATTACTT GAAC AG T AAA AGATGTCTGA TTAAAATCAA TCACTGTTAA ACATGCACCA 4 680 

TTAACACTGA TGGAATCACC AATATGCATA TCTGCCGTAA TCTTATGTGC TTCAATTTCA 4 74 0 

ATCGTCCTGA CTGATTGACG AATTTGAACA CTTTTAACGA CACCTATTTC TTCAACGATG 4 800 

CCAGTAAACA TGCATCATCA CTTCTTfCGT AAAGTTAATT TAACATTTTG ATTTAATAAC 4 8 60 

TCGGAATGAA CAATTTCAAA TTGGTTCGCA TCTGGTATCT CAATCACATC ATTTGTTTGA 4 920 

TAAAATTGAT AATTTCCAGA TCCGCCAATT AATTTCGGGG CATAATAGAG AATAAATTCA 4 980 

TCTATATAAT TAGATTGGAG AAATTCTGAA GTAGTGGTTG GACCTGCCTC BCTAGCAAA 504 0 

GTTCCAACTC CTCTTTTATA TAAATTGTGA AGAATTGTTG TTAAATCGCA AGACTTCAAG 5100 

TAAATAATTT CAATATGTGT TTGATTGGTT GTTAAATTTG GATTTTCAGT ATATATCCAA 5160 

ATTGGTGTTG ATTCATCTTG ATAAATTTGC TGATTAAAAT GAATATTCCC AGAC T TAG AC 5220 

AATATTACTT TTATAGGGTT TTTTCCATCT TGAATACGTG TAGTATATTG TGGATCATCT 52 8 0 

AATTCAACTG TACGTCTTCC AGTTAACACT GCGTCGTGTC GATGTCTTAA CTTATAGACA 534 0 

TCTTGTTTAA CCTCTTTGTT AGTAATCCAT TGACTTTGTC CATTATCATT CGCTTGTTTA 54 00 

CCATCTAAAC TTGCAGATAC TTTCACTGTA ATTlGTGGCA GTTGCTTTGC TTTTGCTTTA 54 60 

AAAAAGTCTT GGTATAATTG TGATGCCCGT TCATCATCAA CGCATTCAAC CTCAATACCG 5520 

TGAGCCCGTA ACGTCTCATC ACCATGTGTG TCTAACGAAT TGTCTTTTGT TGCGTATACT 5580 

ACTTTTGCTA TCTTACAATC AATTATTTTG T T AA C AC AG G GTGGTGTTGA ACCAAAATffi 5 64 0 

CTACATGGCT CTAACGTAAT ATAAATCGTC GCACCTTCAG CATTTTGTTG TGCCATATCA 57 00 

AGTGCTTGAA CCTCCGCATG CTTGTCACCT TTTCTCAAGT GTGCACCAAT ACCAACAATC 57 60 

CTACCTTCTT TAACTACAAC AGCGCCAACG GGTGGATTAA CACCTGTTTG ACCTTGTACC 5820 

ATATTTGCAA GTTGAATCGC ATAATCCATA AATTGACTCA AATGATCACC TCTATAAACA 58 80 

AAAATCCTCA CATCATGAAT TAAGATGCAA GGAG a AAAAT TTATCGTTAA ATAAGCCTAT 5 94 0 

TTGTACACAT TTTTACAAAT ACGCTACATT ATCTTTGTCG ATAATTAACA TTCTTTCTCC 6000 

CATCCAGACT TTAACTGTCG GCTCTAGAAT CTCACTAGAT CAGCCACTAA TATGAAACAT 60 60 

ATTAGCAGGT CGCAGGCTTT ATTTACTGCC GGTTGGGAAT TTCACCCTGC CCCGAAAgAA 6120 

TTaTATATGA AATTGTTATA GATTATTTGA GTACGTAGTA TGTCAACTAC ATTTAAAATG 618 0 
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ATACTATATG TTTTCTGAAA AAACAATTAA TGACGGTTTT AATTTAATAT AATCTGAGTA 6©D 

CTATAGGCAT CTCATTGATA TGATTCTTAC TAACAGACAT TAAAATCAAA CCTTCAATTC 630 0 

GTCTCTATAG AGCGTTCTCT TTATTATCTT CTAGTTACAA ATTATTGATT GtCACtGCGC 6360 

TGTTGTTGCT CATTCGATTC TAAAGCATCA TATAATTGAG AT AC T G TAT G CGCAACTTGT 64 2 0 

TCTACAATCA TTTTCACACC GTTTCGTAGT TTATTAACAC CGTTTGTCAT TTGACCTATC 64 8 0 

GCAATCATAT TTGTTAATGT TCCAAACCTT GGACTAATAA CTTGATTGGT TTCCGGAATG 654 0 

ATTTGTATGC CTCCCATTGG GTGTGCTTGT ACAATTTGTC TATTTTCAAG ATTTCTAATT 6600 

AATTGATCAT CTTGATCCAA TTCATTTAAA TGACTTTTTG CACCTGTGC GTTAATGACA 66 60 

ACATTATATA TGTCTACTGA TTCTTGGTTT TTGTATGAAA AATAATACAA CTTGCCATaC 6720 

ATGTTCACAT CTTCTAAATC TTTTTTCAAA ATTAAAGACT TATTTTCTAT TAATTCAATA 67 8 0 

AT TAG T T C AG CAGTTCTTGG AGGCATTGGA TTTGAATTTA ATTGAATCAT CTTTGAGTAT 68 4 0 

TTTTGATTAA ATTGATGTTG GTCTTCAATA CTTAAGCTAT TCCATATCCA ATTTAAATTC 6900 

TCTTTCAAAT GTTCAATCAT ACTTTGGAAA ATGCCCaTTT CTGTTGGACG CGCTAAATCA 6960 

TACTTCAAAT CTGCAATATG ATTTCCTGTA CGTCTATGTA CTAATTTTTT AAAATCAATG 7 02 0 

TCATATTCAG CACATTCTTT TAAAAATAAA GAAACTAAAG TATCAAGCGG TGCATTGCCG 7 08 0 

AAATGATGTT TTTTAATGTC ATTTAATTTG TCTTTAGTTA AGTACTTGAA TGTCACGTCT 714 0 

ATCATTGTAC CTCTTACACT TGGTAAATGA GCAGAACGAC TCGTCATAGT AATTGGTAAT 7 200 

TTTGGATGAT GAGCAGCAAC ATAACGGACA ACATCTAAAC TGGCAAGGCC TGTACAATA 72 60 

ATCGCAATAT CGTCCAGTTC ATTTACTTCG TCTAACGTAT TATATGTTGG ATAAGGCGTA 7 32 0 

gcGATATATC CTTTTTTACC CTTTAAGTTA TATGGATCAT GGTAGGCAAA TGTACCACAT 7 38 0 

GTTAAAAATA CATAATCGTA CGCTTGCCAT GATTGTCCTG AATTTGTAGT AC AT AT G T AA 7440 

TAAGTTAAAT TCGTTTCATC GAT AT T AG AA TTTGTATAAA TCTCTTGAAC TTTATTATAA 7500 

TTAGTTGATA TATTTGGATA TTTTTTCGTG AACATAGATA AATAAGATTT CATATAATGT 7 5 60 

CCGAATACAA ATCTCGGTAA ATATGCAG 7 58 8 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

nCTAGGTATT TTAAACCTAA TCTAGATAAA CTAGCTTCGT AAGCAGCTGC T AC AT T T T C A 60 
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CGACCGAAAT CCTCAAAATA TAATTTTGAA GTAATAAATA AGTCTTCTCT AGCAATACCA 120 

GTTGACTCCA ATCCGGCACG AATGCCAGCA CCTACTTGTT CTTCATTCCC ATAAACTTTT 180 

GCGGTATCAA TACTACGATA TCCTTGTTCA ATGGCATACT TAACACTTTC CATGCAATT 24 0 

TCATCATTTT CCACACGAAA TGTCCCTAAA CCAATTTGTG GCATCGTGTT TCCATTATAA 300 

AATGTTTTAA CCTCCATAAA TATCGCCTCA CCTTTTTGAT GT AT TAT AC C CTGTTATCAT 3 60 

AACAAATCTG AGTTGAATAC AT G AG AAAAA ACACTTAGAG CAATCAACCA CTAAAATTCT 4 20 

AGTAATATCT CTCAAATATT AATCAAATTG TAAAAGTAAT TCTGTTTAAT T TAT G AC AAA 4 80 

CTAAAAAAGC CGAAGTAACA ACATATAGTC ATCACTTCAG CCTAACATTT AATTGAATGA 54 0 

TTCAATTTTA TCCATCATTT GTTGTAAGTC TTCCACGTTG TATTGAATAC GACCATGGAA 600 

TACAAATTTG TTAAAGAACT CGTCTAATTG TTCAGCACG AC AAGCAC T T TGACAGCACT 6 60 

ATTTTGATTA TAATTTGAAA TCGTTACATC GCCTTCATTT TTAAGATTAA AGTATAAAAT 7 20 

TGAAGTTGGT GTATATTTGG CACCTAATTC TTTTTGTAAG TCTTCAGCCA ATTGTTTAAT 7 80 

CGCCTCAATT TGATCTGAAT AAT T T AC AAA TGATAATGAA CGTTTGTCAT CATTTTGATC 84 0 

CATCACAATA GTTTGCGGTC TAGATTTATC TAAATCCAAT GTATCAAATA CTTGTTCCAT 900 

TGGTGGTAAA TCTTTAAATT GACCGCCACT AATACC AT T A TAAACATGAC CTTTTAACAA 9 60 

T T GAG AAT C A ATAATATAAA GACCAGTTCT TGTTAATACT AAATGACTAA TTCGTTCAAT 102 0 

ATTATTAAAG CCATCCTTTG GTAAAAAGAT ATTTGCCATA ATGTGCATAT CTTCTGGTCG 1080 

AATTCGTTTT TCTTTAACTA ATCTTTCACG AATACCAATT AATCTCATGT CCGTTACATA 114 0 

TTCACTATGA TTTTTCGAGA AC AAT T T T AA TGCGTCAATC TCACGATCTT TTGTACTAAC 12 00 

CATGTGATTA TAATCTTCTT GTTGTTTTGT AATTGTCTTT TTATTTGAA TACGCTCTTT 1260 

CTCTAAAGCT TCTTCATGAG ACTTTTTAAT GTTTTGTTCT TGTTGTTCAT ACTTTTCTTC 1320 

TGTTTGTCGC TTAACTTTTT TCTTACTACC TAAGGCAACT AAAAAAAGGA CAAAAAAGAT 1380 

TAATGCAATG AgCTACTGCA ATAATGAGTC CAATGACTAT CGGTGAAGAT AAATCCATCA 14 4 0 

CAACAACGCT CCTTTTTAAT ATATGAATAA CTTTAATTAT AATAGAaAAG CTAAAGATTT 1500 

TCGATACATA TTATCATTTA TATACCGAAA ATCTTTTATT TAGCTATATT CAATTCATCT 1560 

TATTATTTTA CTGCGTCTTT TAATTCTTCC ACTTTGTCTA ATTTTTCCCA TGGGAATAAG 1620 

ACATCTGTAC GTCCAAAATG ACCATAACEA GCAGTTTGTT TGTAAATCGG TTGTTTCAAA 1680 

TCAAGCATTT T AAT AATACC TGCAGGTCTT AGGTCAAAGT GTTTTCTAAC TGCTTCGACA 17 4 0 

AGTTGCCCTT CAGAAACTTT ACCTGTTCCA AATGTATCAA TTGCAATTGA CACTGGTTCT 1800 

GCAACACCAA TCGCATATGC CAATTGTACT TCACATTGAT CTGCTAAACC TCTECAACA 18 60 
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ATATTTTTAG CCACATAACG TGCAGCGTAT GCAGCTGAAC GGTCTACTTT TGTAGGATCC 1920 

TTACCACTGA AGCATCCGCC AC CAT G AC G T GCATAGCCAC CGTACGTATC AACAATGATT 1980 

TTACGTCCTG TTAATCCTGC ATCACCTTGA GGTCCACCGA TTACAAAGCG TCCTGTAGGA 2040 

TTGATGTAGA ATTTAGTTTG TTCATTAATC AAGTTTTCTG GAACAGTTGG ATAAATGACA 2100 

TGTGCTTTAA TGTCTTCTTG AATTTGTTCA AGTGTCACAT CCTCAGCATG TTGTGTTGAT 2160 

ACGACAATCG TATCAATACG TACTGGGTTA TCATTTTCAT CATATTCAAC AGTGACCTGA 2220 

ACTTTACCGT CTGGTCGTAA ATAATTTAAC GTACCATCTT TACGCACATC TGATAAACGT 22 8 0 

TTTGCCAATT GATGTGATAA ATAAATTGCT AGAGGCATAT ACGTCTCTGT TTCATTCGTT 234 0 

GCGTAACCAA ACATTAAACC TTGGTCACCT GCACCTGTTG CTTCAATTTC TTCTTCGCTA 24 00 

TCTTTATCAC GATACTCTAA TGCTTTATCC ACGCCTTGTG CAATGTCAGG TGATTGTTCA 2 4 60 

TCAATCGCAG TTAAAATTGC CATTGTTTCA TAATCATAAC CATATTTTGC TCTTGTGTAT 2520 

CCAATTTCTT TAATTGTTTC TCTAACAACT TTCGGAATAT CAACATATGT TGTTGTAGAA 2580 

ATTTCGCCGG CGATCAATGC CATACCTGTT GTAACAGTTG TTtCACAAGC TACACGTGCA 2 64 0 

TTTGGATCGT CTTTTAAAAT AGCATCTAAT ATTGCATCTG ACACTTGGTC AGCGATTTTA 2700 

TCTGGGTGTC CTTCTGTAAC AG AC T C T G AA GTAAATAATC GTTTGTTATT TAACATAGTT 27 60 

TGCTCCTTTA AATTTATATT ACGAAAATTC TCTCTCTGTG AGCTAAATAA AAAAGACCTT 2820 

CTAACTATTA ATATAGAGAG AAGGCCTAAT ACGTCCATTC GCTCTTATCG TTCAGACCTA 28 8 0 

TTTGTCTGCA AAcGGTTTGG CACCTTTCTT TTATAAAAAA GAGGTTGCTG GGTTTCATTG 2 94 0 

GGTCCATGTC CCTCCACCAC TCAGGATAAG AGAATCCGTT AAAAATAATA GTACCTAATT 30 00 

AATGAATTAA TGTCAATTTT TCACAAATAA ATTTACAGTA AAATATTGTA GATTAATTAT 30 60 

GTTAATGTGT TATACTAATT AAATGTAAAG GCTTACATTT AAATTATCGC TTTGGAGGGA 3120 

TTTAGGATGT CAGTAGACAC AT AC AC T G AA ACAACTAAAA TTGACAAATT ACTGAAAAAA 3180 

CCAACGTCAC ATTTTCAACT TTCGACGACA CAACTTTATA ATAAAATCTT AGACAATAAC 32 4 0 

GAAGGGGTAT TAACAGAACT TGGTGCTGTT AATGCAAGTA CTGGAAAATA TACTGGTCGT 3300 

TCGCCTAAAG ACAAATTTTT TGTCTCTGAA C C T T CAT AT A GAGATAACAT TGATTGGGGA 33 60 

GAAATTAATC AACCTATCGA TGAAGAAACT TTCTTGAAGT TATACCATAA AG T AC TAG AC 34 20 

TATTTAGATA AAAAAG AT G A ACTATACGTA TTTAAAgGcT ACGCTGGTAGCGATAAAGAT 34 8 0 

ACAATGTTAA AACTTACAGT CATCAATGAA TTAGCATGGC ATAATTTATT TGCTAAAAAT 3540 

ATGTTTATTA GACCTGAATC AAAAGAAGAA GCTACAAAGA TTAAACCTAA CTTCACTATC 3 600 

GTTTCTGCAC CACATTTTAA AGCAGATCCA GAAGTTGATG GTACTAAATC TGAAACCTTT 3 6 60 

GTCATTATTT CATTTAAACA CAAAGTCATT TTAATCGGCG GTACTGAATA CGCTGGTGAA 37 20 
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ATGAAAAAAG GTATCTTCTC TGTAATGAAT TATCTCTTAC CGATGCAAGA TAT T ATGAGC 37 8 0 

ATGCATTGCT CAGCAAACGT TGGTGAAAAA GGCGATGTTG CATTATTCTT TGGTCTATCT 384 0 

GGCACTGGTA AAACAACCTT ATCGGCTGAC CACACCGTA AACTAATCGG TGATGATGAA 3900 

CACGGCTGGA ATAAAAACGG GGTCTTTAAT ATCGAAGGTG GCTGCTATGC AAAAGCAATT 3 960 

AATCTTTCCA AAGAAAAAGA ACCACAGATT TTTGACGCAA TCAAATATGG TGCAATTTTA 4 02 0 

G AG AAC AC T G TAGTTGCAGA AGATGGTTCA GTGGACTTTG AAGACAATCG TTATAGSAA 4 08 0 

AACACGCGTG CCGCTTATCC AATTAATCAC ATTGACAATA TTGTAGTACC ATCTAAAGCA 414 0 

GCACATCCAA ATACAATTAT TTTCTTAACT GCGGATGCAT TTGGTGTTAT TCCACCGATT 4 200 

TCAAAGTTAA AT AAAG AC C A AGCAATGTAT CATTTCTTGA GTGGTTTCAC TTCTAAATTA 4 260 

GCTGGTACAa GCGTGGTGTG ACAGAACCTG AACCATCATT C T C AAC AT G T TTCGGAGCAC 4 32 0 

CGTTCTTCCC GTTACACCCT ACTGTTTACG CTGATCTATT AGGTGAACTT ATCGATTTAC 4 38 0 

ATGATGTTGA TGTTTATCTT GTTAATACTG GATGGACTGG CGGAAAATAT GGTGTAGGAC 4 44 0 

GTAGAATCAG CTTACATTAC ACACGTCAAA TGGTAAAC®. AGCGATTTCT GGCAAATTGA 4 5 00 

AAAATGCAGA ATATACAAAA GATAGTACGT TTGGTTTAAG CATTCCTGTA GAAATTGAAG 4 560 

ATGTACCGAA AACAATTTTA AATCCAATTA ATGCTTGGAG C G AC AAAGAG AAATATAAAG 4 62 0 

CACAAGCAGA AGATTTAATT CAACGTTTTG AAAAGAACTT CGAAAAATTT GGTGAAAAAG 4 68 0 

TTGAACATAT TGCTGAAAAA GGTAGCTTCA ACAAATAAAT TTGAATACTA AATCaAAACC 4 74 0 

ACCCGTGTGA ACGGGTGGTT TGTTCTGCGG CTATAAGCCT TCCTTACTGG CCAGCCCTAA 4 8 00 

AAGGGCACTG ACAAGTCAGC CAACTGCACT ACTATTCCAG CAACCCTAAA GGGTTACTCT 4 8 60 

TTTTTCTTTC TTTTTTTATT TTTCTCTCCA GTGAAAGGAT CTAAATATTC TTCCATTGAG 4 920 

ATTTGGTCTG CAACGATATC CTCTTGTAAT TGATTACGAA TATAATTTTC AATCACTTTT 4 98 0 

TTATTTCTAC CTACTGTATC CACATAAAAT CCTTTACACC AAAACTTTCT ATTTCCATAT 504 0 

CTATACTTTA AGTTAGCATG TCTATCAAAT ATCATTAAAC TACTTTTCC TTTTAAATAG 5100 

CCAACAAATG ATGATACCCC AAGTTTGGGT GGTATACTAA C T AAC AT AT G GATATGATCT 5160 

TTACATGCCT CTGCTTCAAT TATCTCTACA CCTTTTCTTT CACATAATTG ACGCAATATA 522 0 

ATCCCTATAT CTTTTTTTAT TTTTCCATAT ATCACTTGTC TTCTGTATTT AGGTGCAAAG 52 8 0 

ACAATATGAT ACTTACAATT CCATTTTGTA TGTGCTAAAC TGTTTGTGTC AGATGACATT 534 0 

AAATAGCATC TCCTCGTGTT GAT TATTTTG GTTGGCTGAC CAATATTTAT TCTAGCACGT 54 00 

AGAGATGCAT TTTTTGTGAC AATGGTAGAA CCTTTTCtGa ACCATACGCA TAGCGTATGG 54 60 

TTTTCTTTTT AC AAT T AAAG AGCCAACCET TGTTATAGTC TAACAATGGT TGGCTCCTCT 5520 
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TATTTTATGT GCTAAAAATT TATAGGCAAT TTTATTACAA CAATGTACAT TTAAGGTGAC 55 8 0 

CTTCATGCCA AAATCGCATC ACTCATTTAA TGGAAGCAGC ACGTCTTCAT ATAAAGTACC 5 64 0 

GATCCCTAAT TCAACGCATG TAGTACCACA TCTTCAAAGC TTGATAGTTC CCAGCGCAC 57 00 

ACCACGTTTC ATACTAGCTA TGCGACTCAA CTTGGTTCAT AAACTCTTTA ATATAAGTCA 57 60 

ATGTTTCAAC CATCGCTGGT GGTCTTGGCA CATGTCCTTC TGCCATTTGA TAAAATGTTT 5820 

CATGCGTGGC ACCTTTTAAC TCTAGTTGGT CCGCTAAATA ATACGCATGA TGAATACCAA 58 8 0 

CTTGCTGGTC TTTCCCTCCA TGTACAATTA ATATTGGCGG ACTGTTTTCA TTAATGTTTG 5940 

GAATCGCTTG GCGTGCCTCA TATGCCGCTC GATCTTTTTT CGGATGACCA ATCATTCTTC 6000 

GTAGCATGCC TCTTAAATCG ACACGTTCTT CAT AC AT T AA ATCAATATCT GAGACACCAC 60 60 

CCCAGATTGT ATAACTTGTT ACTGGTAAGT CTTG&ATGT CAACAATCCT TGTAAACCAC 612 0 

CTCGCGAAAA ACCAACCATG TGGATAAATG CATGTGGATA TTTATCATGT AGCAACCTTA 618 0 

ATAATTGCGT CACATCATTT AAATCGCCAC GGTAAAATTC GTCTTTGCCT TCACTCCCAT 624 0 

TGTTACCTCG GTAGTATGGC CCAATCACTA AAGTTTGACT ATCTGAAAAT TGCATTAATC 6300 

TACCTGCGCG CACACGTCCT ACTTGACCTT TGCCACCTCG CAAATAAACT ACAATGCGAT 6360 

TTACTTCATG ATGTGGTGTC AT CAT T AAAG CTTTTACTTG TAAGTCATCT GACAAATATG 6420 

TAATTTCTTC GAATTGATGC GTAAAATATT CAATTGGCAT TCGTTTACGT TTGATAAAAC 64 80 

CCAAGTGATT GCACCCTCTC TACGCATTTT AAAATGGTAC TATCTTGCAG TAAGAAACTC 65 4 0 

CGTTGTGCGA GTTCAATATC ATTGATACAG TTAAACAACA CTGGCCCTGC TGTTTCTAAA 6600 

TAATCGTTCT TGCTTACCAA TGATTCAACT TCGATAAAAT ATACATCTTT TACAAAATCA 6660 

GTTTGATCAT GTGTTTCAAT GGTATATTGT GCTATGTAAT AATATTTTT AACTTTGGCG 6720 

CCTGTTTCTT CATATAATTC aCGTGTAACT GCTTCAGCAC TACTTTCCCC GCGTTCCCTT 67 8 0 

TTACCACCAG GAAATTCAAT CCCCCGTAAA TTATGTTTGG TAAAAAGCAA TTGATTTTTA 68 4 0 

AACGTTGGAA TAGCTAGCAC ATGATTGCCA TCTGCTATCT CATTATCCTT TTTAAATGTC 6900 

AAATTAACTT GACGATTATC TTTATCCCTA AACTTCACGC GCATCACATC CCTACATTGT 6960 

ATGTTAATAT AATAGTTAAT TACTATCGTT GGAGGCATTA ATTATGAAAA AGATATTCTT 7 020 

GGCGATGATT CATTTTTATC AACGTTTCAT TTCGCCACTC ACTCCACCAA CTTGTCGTTT 7080 

TTATCCAACA TGTTCAGAGT ACACTAGAGA AGCGATTCAA TACCACGGTG CTTTCAAAGG 7140 

CCTTTATTTA .GGTATCCGTC GTATTTTAAA ATGTCATCCG CTT CAT AAAG GCGGCTTTGA 7200 

CCCTGTTCCG TTAAAAAAAG ACAAGTCAGC AAGCAAGCAT T C AC AT AAA C ATAACCATTA 72 60 

ATATGGTTGT AATTGAGTTA TATCCACTAA AGGGGGGCGA AATTCGAGTCGCCCCTCTTT 7 32 0 

TAATATGCCT GAATGCGCCA CCACATCTTG TTCAAAATAA TAACCTGCTG GTGTAACATC 7 38 0 
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TCCTGGATAA TCACCTTTAC GAG C AAG CAT CGCTGTAAAA TAGCGGCTTA AACCATATTC 7440 

GTACATGCCG CCAATAACCA CTTTTGCACC ATGACTTTTC AAAGTATCAA TTGCCGTTTG 7 500 

CACTTTATCA ATGCCACCTA GACGAAATGG TTTTAATACA ACAACTTTCA CATTGTATAA 7 5 60 

TTCTATCAAA TTAATTATGT CCaACAACGA TGTTGCCTTT T C AT C AAG G G CTATTGGAGG 7 62 0 

TATTGTTCCA TCCGCTACTT CATCAAGCAT GGAGATATCT TTAAATGGCT CTTCGATATA 7 68 0 

AAGAACCTGT TCACGCGCTA ATAACTGTAA CTGTGTGAAA TCTTGACGAT CCAAGGACTC 7740 

ATTTGCATCT ATAACCAATT G AAAG T G AAA GTCTAATTCC CGTAACACTC TAATTTGATG 78 00 

CATGATTTGA GGCGTCCATT TTAATTTAAT TCTGGTCGGC TTTGTTGCTT TTAATGACTC 7 8 60 

TAGTTGTTTA TTTGATAAGC CGCTCGcTGT CGCTCCATAT GCTACTGAAA ATGAAGGAG 7 92 0 

TACATGAAAC ATTTGATACA ATGCCATGAC AATAGTTGCC CTTGCAGCAG GCGTATTTTC 7 98 0 

CAATGAATCT ACTAATTTTA GTGCTGCTTC ATACGTTTCA AATGATTTAT TTCTATTATC 8 04 0 

TTCGAACCAT TGCTCAATTA CATGTTTCAC TGAGGCAATT GTTTCATGAT CATACCAATC 8100 

TGTTTGAAAA GCGTTACATT CCCCGAAATA TGCATTTCCT TTGTCATCAA TCAATTCGAT 8160 

AAACAAACAA TCACGATGCG T T AAAG T G AC TTTCGGTGTT ACAATTTGTG ACTTAAATGG 8220 

CTCACTATAT T T AT AAAAAT GCAAAGCTGT CAACTTCATC AAATCATCCT CTATACAACT 8280 

TATTTCTTTG TAATTTACCT GTTGATGTAT AAGGTAAACT ATCAACCTTT TCAAAGTGTT 8340 

TCGGTACTTT ATATTTCGCT AAATGTTGTG ATAAATATGC AATCAATTGT GCCTTTGAAA 8 4 00 

TGTCACTTTC ACT G AC AAAA TATAATTTAG GCACTTGGCC C CAAGT AT C A TCAGGATGCC 8 4 60 

CTACACATAC TGCGTCACTG ATACCTGGAA ATTGctTCGC TACCGTTTCA ATTTGATATG 8520 

GATAAATATT TTCACCGCCA CTAATAATTA AATCTTTACG TCGGTCATAA ATCATGACAT 8580 

AACCTTCATG ATCTATTTCA GCAATGTCAC CCGTATTAAA ATAACCATTT TCAAACGTAC 8 64 0 

CCGTTAAATC T G T T GG AT AC AAATATACAT TCATCACATT GGCGCCTTTA ATCATTAATT 8 7 00 

CTCCATGACC TTCTTTATTA GGATTTTTAA TTTTTACGTC AACATTGGCA CTTGGCATCC 8 7 60 

CTACAGTGTC AGGACGTGCA TGCAACATTT CCGGTGTTGC TGTT AAAAAT TGCGAACATG 882 0 

TCTCAGTCAT ACCAAATGAA TTATAAATTG GCAGGTTATA TTGTAATGCC GTCTCTATCA 888 0 

TAGTGGCAGA TAATTTAGCA CCGCCGAGTA ATATTTTTTG TAAATXTAA GGTTCATGTA 8 94 0 

AACCTTGTTG CATAAGCCAA TTTAAAGTTT GTGGCACAAG CGAAATGTGC GTGATTCGTT 9000 

CATTTTTAAT CATCGTTAAA ATTTGTTCGG CATTGAATTT ATCAACAATG CGCACAGTAA 9060 

AACCTTCAAT AACAGCTCTT AAAAGTACAC TGAGACCCGA AATATGATAA ATCGGCAAGA 912 0 

CAGATAGCCA ATTAGTGTCA CGATCAAATC CCAAGCTCTC TTTACATCCG ATTGCACTGG 918 0 
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CATAATGATT ACGAAACGTT TGTGGCACCG CTTTTTGAGG GCCCGTTGTC CCTGATGTAA 924 0 

ACATAATCGA TGCAATGTCA TCTAAATTAA ATGATGTATT TAATATGTTG GACGGCGACT 9300 

CTTTCGGCAC CACAGTTTCA TTCGATGTTT CATATTGGAT ACCCATTGTG TTGTCCAACA 9360 

AACTGTTCGT TGTAATATCC CTTCCAGCGA ATTCAATATC ATCCAGCGAT ACAATTTGAA 9420 

ACCCTCGTAA TTCCAGTGGC AAGGTACAAA AAATCAATTG TACATCGATT GACTTCATCT 94 8 0 

GATTCGTCAT CTCATTAGGT GTCAACCTTG TATTAATCAT CGCAATTTCA AT"ATTGCCA 9540 

ACCAACATGC ATGTATTAAA ATGATCGATT GAATCGAATT ATCTATGTAT AGCCCAACAC 9 60 0 

GAGATTGTTG ATAAGCCTTG AGTCTTTTAG CCAATAGACT CGCTTCACAG TATAAATTTT 9660 

GATAAGTATA AGATTCTTGA CCGTCTGTTA TCGCAATATG ATGTCCATTT TGTTGTGCTT 9720 

GTTTATATAA CCAAAAGTCC ATGCGTTATT CCTCCAAAAT CATTTACATT ATAATTATAA 97 8 0 

CGATTTTATG ACATTCTAGC AGTGGTTATG TTTAAAAATA TAAAAAAGTA GACGAAT TGA 98 4 0 

TGCATTGATA TGATTGTTAT AATGCTCAAT ACATATCGTT ATATCATTCG TCTACTATTA 9900 
TCAGTTATTT TTATTTAATT TTAGTGTCAT TCTGH1ATTT TGATGTGGTG ATTTACCCAT T 9960 

TGTTGCCACA TCATCTGCAA TGTCAATTGG TATACGGTTC ATGTCTTGTA ATGCACTTAA 10020 

ATGGAATACT TCATCATCTA AATTTTCAAT GAGATATACA TAATATGTTA CCTTGTCCTT 1008 0 

TTTATATTTT AACGTTTTCC AAAAGTCCGG CTTGCAATTC AATACATTAT CCGGAATATA 1014 0 

TTCAATAAAT AAGTAACGTT TGCTGCCTAC TTTGTCTATG AAATATTTTG CAGTGCCTTT - 10200 

TTCTATACCT CTTATATGTG CATAGTCTGC TGAAAAGTAA ATACTACCTA TTGTTTCATT -102 60 

ATGTTGTTGT ATTTCAAATC GTTGGCCTAC TATTTTATTA TTTGTGCTAC nGGGGACTTA 10320 
(2) INFORMATION FCR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

TCCCCCCTAA TTTTAGCATA ATCCATCTCC CGGCTAAACG AGAAnCAGCC TGTnAnnGGC 60 

GTGTGGATTG GATTTTAAAA TCACCCTCAT AAATACTGTC ATCAATATGA TAAGTTACAA 12 0 

TTTCACCTAT TATTAAATCA GCCCCATCTA ATACATCTCC AAGCAATATC ATTTGCGmTA 18 0 

GTTTACATTC GAATCTCATT TTCGCATCTT TAATTCCEG CGTCTTAATC GTTGTAGATG 24 0 

TTAAAAGTGA TAATTCTGTA CGACTCAACT CACTGTCACC ATATGCTAAC GGCGCTGCAG 300 

TCTCATTAAT ATCTTGAACA TTATCTTCGT CTGTAATATG CACAACAAAG TCTCCAGTCC 360 
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GTTCTATATT TAATGCAGTA TCTTTTCTCT TACCTCCTGC ACGTTGAACT GCAATAGCAA 4 20 

TCATTGGCGG ATGATTATTA ACAATATTAA AAAAGCTAAA TGGTGCTGCA TTTACTGATG 4 80 

CATCTTGATT TAATGTTGTA ACAAAAGCTA TAGGTCGTGG AATAATTGAA CCAATTAATA 54 0 

ATTTATAGTT TTCTCTAGCA GTTAATGATT GTGCATCAAA CGTATACATA ATACCTACCT 600 

CTTTTCTAAG TATATCTAGG TATTTCTCCG ATTTTGGTTA ATTTAAACAT CTATTCTCCT 660 

CTGAAAATCA CTTGTATTTA TTTAGCAAAT CTTTTGAAAT ATGACACATA TGCATATCTT 720 

CTGGATATTT TTCTAAATGT TGCTGATGTT CTTCAGCACT TTTAATGTAG TTAGACAGCG 7 80 

GTAAGACTTC CACTGCAATT TGATCTCTGT CTTTACGTCG TTCAiEGAAC TGACGCGCTT 84 0 

CAATTAAGTG GTCATCTACA CAACTATATA AACCCGTTCG ATACTTTTGT C C AAT AT CAT 900 

TTCCTTGTTG ATTCACACTG TAAGGATCAA . TGATTTCAAA TAAATAATTC ATAATGTCTG 9 60 

TAATTGTTAA CATACGATCA TCGAAATGAA GTTTGACACA TTCAGCATAA CCATCATACG 1020 

GACCGTCTAA TTTAGAGCTT CTTCCATTTG CTCTTCCTGC TTCTGTATGT AT AAT T C C AG 108 0 

GTATTGTTGC AAAAAATGCT TCAACACCCC ATAAACATCC TCCTGCTACA TAAACAACTG 114 0 

CCATATTTAC ACCTCATCAT CCTTTTTTAT ATTTTTAACA AGGTTATACC ATTTAATACC 1200 

GCCATGACAT GATTCTGATA CACCTTCATT ACGATACCCA TATTTTTCAT AAAATGAAAT 12 60 

TAATGATTCT CGACATGTTA AC G T T AC AC C ATGTCGATGA TGATTCTTAG CAAGAGTTTC 132 0 

AAAATAGTTT AGTAAGCGAC CTGCAATACC CTGACCTTGA TAATTTGGTG CTACAACAAG 1380 

ACCTAACACA CTAATATAGC CACCTTCACT ATTATTTGTG GAGACATTTT TAATAAATC 14 4 0 

ATCGCTAATG TAACGCTCTT TTATGACTGG ACCGTTG 14 77 



(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3976 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

AGG T GAT TAT CCTAAAAATG CTCATGAGGT CGCTATTAAT GATAAGT TAG CTGCAGACAA 60 

CATTAGAGTC GGGGATAGAT TACATTTTAA AAATAATTCA ACT AGT TATA GAGTTTCTGG 120 

TATTTTAAAC GACACAATGT ATGCGCATAG TTCCATTGTG CTATTGAACG AT AACGGAT T 180 

TAATGCATTG AATAAGGTTA ATACGGCATT TTATCCAGTG AAAAATTTAA CACAACAACA 2 40 

ACGTGATGAG CTTAATAAAA TAAATGACGT TCAAGTTGTG AG T G AAAAAG ATTTAACAGG 300 
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TAATATTGCG AGTTATCAAG CAGAGCAAGC ACCGTTAAAT AT GAT GAT TG TTAGTTTGTT 360 

TGCTATTACA GCAATCGTTC TAAGTGCATT TTTCTATGTT ATGACGATTC AAAAAATATC 4 20 

ACAAATTGGC ATTTTGAAAG CAATTGGTAT TAAGACAAGA CATTTATTGA GTGCGTTAGT 4 80 

TTTACAAATT TTAACACTAA CAATAATTGG GGTAGGTATT GCTGTGATCA TCAIX&TAGG 54 0 

ACTATCATTT ATGATGCCGG TAACGATGCC TTTTTACTTA ACAACGCAAA ATATTTTATT 600 

AATGGTGGGG ATATTTATAT TAGTAGCGAT TTTAGGTGCC TCACTATCAT TTATCAAATT 660 

ATTTAAAGTG GATCCTATCG AAGCAATTGG AGGTGCAGAA TAATGGCATT AGTCGTTGAA 720 

GATATCGTCA AAAATTTCGG AGAAGGTTTG TCTGAAACAA AAGT T T T AAA AGGTATTAAT 78 0 

TTTGAAGTGG AACAAGGGGA ATTTGTCATT TTAAATGGTG CCTCTGGTTC TGGGAAAACA 84 0 

ACATTGCTAA CGATATTAGG CGGATTGTTA AGTCAAACGA GTGGTACAGT GCTTTACAAT 900 

GATGCGCCAT TGTTTGATAA ACAGCATCGT CCTAGTGVTT TACGATTGGA AGATATTGGT 9 60 

TTTATTTTTC AATCTTCACA TTTAGTTCCT TATTTAAAAG TGATAGAGCA ATTGACACTC 1020 

GTAGGTCAAG AAGCGGGAAT GACCAAACAA CAAAGTTCAA CAAGAGCAAT ACAACTTTTG 1080 

AAAAATATTG GTTTAGAAGA TCGCTTGAAT GTATATCCGC ATCAGTTATC TGGCGGTGAA 114 0 

AAGCAACGTG TTGCGATTAT GAGAGCATTT ATGAATAATC CGAAAATCAT TTTAGCAGAT 1200 

GAGCCCACAG CAAGTTTAGA TGCCGATAGA GCAACAAAAG TTGTTGAGAT GATACGTCAA 12 60 

CAAATTAAAG AACAACAAAT GATTGGTATT ATGATTACAC ACGATCGAAG ATTATTTGAA 1320 

TATGCAGATC GAGTGATTGA ATTAGAAGAT GGCAAAATAA CTGATTAGTG GCTTGTAAAG 138 0 

ACGCTAAATG TTAATGATTT AAGACATAGT AG T AT AAAAG TTAGATAACA GAATACGATT 14 4 0 

TGGGTTTACA AAAAACAGGC TGGGACATTA AGTTCTTAGG CAATGTAAAA AAGCTGATTT 1500 

CTATTAATTA TTTGATAGAA ATCAGCTTTT TTGATATGTA TTTBTAATG TACAGCTCGT 1560 

TGCATTCATA TAGCTTGAAG TCACGTTTAA AACCATATCT ATCATTATGG TATGCATATC 1620 

TTTTAAAACC TATTCTTTTG TTATTAGGAC AT AT AAAT T C ATCATTAAGT TCGTCATATT 168 0 

TCCAATTTTG AGTGTTAAAA ATGTCACTTT TAAACTTTCT AGTTTTATCT TTAATAAACA 17 4 0 

TGACATACGT AATAAGTGGC GTTTTATTAA AATCATCTAT AATAGCCATA TAGTTTTGCT 1800 

CACTATCATA AC AT G C AT C A GCTACAATAT ACTCCGGTAA ATAACCGAAG nTATTTTgAA 18 60 

TCATTGTTAA AAATGGAATT AAAGTTCTAG TATCTGTTGG GTTTTGAAAT AGGTCATAGG 192 0 

ATAAAACAAA TTGAGAATTT GTCGCTATTT GTAAATTGTA TCCTGGCTTA AGTTGGCCAA 198 0 

AGTGTCTTAT TTTTTTAAAG TATTTAAAAG T AAAAT T AC A TGTTAATACG TAGTATTAAT 2040 

GGCGAGACTC CTGAGGGAGC AGTGCCAGTC GAAGaCAGGG GCCCCAACAC AGAArcTGAC 2100 

ATATAGTCAG CTTACAACAA TGTGCCGGTT GGGGTGGCTG AGACGGCACC CAGGAAGGG 2160 
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ACCCGTCATC AAAAATTCTA TTTATAGAAT TTTACAGTAA TGTGCCAGAT GGGCATAGCG 2 22 0 

AAgcCATTCA ATACGAAGTA TTGTATAAAT A G AG AAC AG C AGTAAGATAT TTTCTAATTG 2280 

AAAATTATTT TACTGCTGTT TTTTTTAGGG ATTAATGTCC CAGACTCTTT AGTTTATTTA 2 34 0 

TTTTCAATAT AACAATTGTC TAATCAAGGA TTAACGAATA TTTAAAGATA GTTTGACGCA 2 4 00 

AT AT TAG AAA CAACCTATAA TAATAGTTTG TTTGTGGATT AACTATTATA AATAAAAGCG 2 4 60 

GCGTAAAGAC ATATAAACCA AC T AC T T G AA CAATATAACG TTAATAACAA TCTATACTGA 252 0 

TACATTACGC CTAGATAATC TTTGATGAGC ACA'GTAAGA AAAAGTGATA TGGTGTATGA 258 0 

CTTCCGACAC CATCGATAGA TAAACCTAAT TTTTGGGCTA GTCGTAAGGC GCGCAATACA 2 64 0 

TGAAACTGAC TTGTtACACA AACAATTTTA ACTGCTTCAT GATACAAATT GTTGATGATT 2 7 00 

TGTTTAGAAT ATAAAAAGTT TGTGTATGTA TTTATAGAGT GAGATTCCAT TAGTATATE 27 60 

GTTTTATCAA CACCATGTGC AATCAAATAA CGTTGCATAG CTAAAGCTTC AGAAATTGGT 2 82 0 

TCGTCTGGTC CTTGTCCGCC AG AT AC AAT G ATCTTTGTTG CTGATGCTTG TTGTTGATAG 2880 

ATATCAAGTG CACGATCTAA ACGCGCTGCA AGCATTGGTG TGACAAATTC GGTAAAAATA 2 94 0 

CCAGCACCTA ACACAATTAT GATATCAACT TCTTTGTTGT ATGATCTATG TCTATATGAT 3000 

ACTGTCCAAA C GAG AT AAC A AATAAAGGTT AGTAACAGGG AAAGAC AT AA TATAGCTAAC 30 60 

CACATAGACA AACCTTTCAC AATAGGTGAC TGAATCGTAC T T AT AAAT AG AAGTGCTGAT 312 0 

GTGTAGAGTA CAAATTTATA TGAAAAAGAT AATAATTTTT TAATAAATAA GCGACTAGAA 318 0 

GTATGAGAAA ATAAATATCT ATGTTTGAAT AGC AT G AT AA TACTGATTAT TATAAATGTT 324 0 

ACAAACATAG ACCAAGGGAA AGTATAGGTC ATGATGCTAT AGATGAGTGA C AAAAAT AT C 3300 

GAT AT G AC AA CTAAGATGTA GCATGTTAAA TTTAACGTCA GAGTATAGTT GAAAATTAAC 3630 

GGACAAATAA CGATAAGTAT AAATATTAAT AATAAATTCA ATAACATACT GACACCTCGC 3 4 20 

T TAT AAT AAA TATTAAATAT AAATGTAGAT GATTTAATTT ATTAAAGCAA GGAGAAAGCA 34 80 

GCAACATGTA AATCTTAATT TGTTATATTA TATATGGGTC AATATTTTTG TGTTTTTTAG 354 0 

ATGTGCGATA CAATGATTGC TGU3GATACT T TAT AAAAAT AGTAATTAGG GAGAAATAAT 3600 

TATGGTAAAA CATTTACAAG ACCATATTCA ATTTTTAGAG CAGTTTATAA ATAACGTTAA 3 660 

CGCATTAACT GCAAAAATGT TGAAAGATTT AC AAAAT G AA TATGAAATTT CATTAGAGCA 3720 

GTCTAACGTA TTAGGTATGT TAAATAAAGA ACCTTTGACA ATTAGTGAA TCACGCAAAG 37 80 

ACAAGGTGTA AATAAGGCCG CAGTAAGCCG AC G AA T T AAA AAGTTAATCG ATGCTTAATT 384 0 

AGTTAAGTTA GATAAACCAA ATTTAAATAT TGATCAACGT TTGAAATTCA TAACCTTAAC 3 900 

TGACAAAGGT AgAGCATATT TGAAAGAACG TAATGCGATT ATGACAGATA TTGCGCAAGA 3960 
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TAT TACT AAT GATTTA 

(2) INFORMATION FOR SEQ ID NO: 14 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



3976 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

GCTACCTAGG CATTTAAGAG ATCAAAAAAT GTATGAATAT GAACGTTATT TTTATGAGCA 60 

AGAACTTAAT GGCGTTGATG aAGGGGAAAT TTTAAAGAAG TTAAAAGACC CACAAGATGT 120 

TGCAGCTGAA ACAAAAGCTA GAAG13TTAT TGATTATGCT GAATCTAAAC CAACATTTGA 18 0 

AAATATTTCA AGAGCTGTTG CTGCTTCATT AAGTTTAGGC ATTCTATCTA TTTTTGTCAT 24 0 

CCTTATACCA GTATCTATAG TTGGATTATT TGTATTAGCA TTATTTTTAA TATCACTTTT 300 

GCTGCTGTTT TGTCCAATTA TTTTATTAGC ATCAGCAATA TCCAGAGGAATTGTGGACTC 360 

AATTAGTAAT GTATTTTTTG CCATATCATA TTCAGGATTA GGATTAGTAT TTATCATTGT 420 

CATATTTAAG ATTTTAGAAT ACATTTATCG TTTAATCTTA AAATATTTAC TTTGGTATAT 4 80 

TAAAACTGTC AAAGGAAGCG TTAGAAAATG AAGAAATTCT TTTTTATTGG GCTTTTAGTG 54 0 

TTTGTTGTCT TTTTTACAGC AGCAACCATT ATTTGGTTCA GCTATGATAA AAACAAATAT 600 

GGTACTAAAC AATATGATAA AAC AT T C AAA gACGATGCTT TTGACAATGT ATCTATAAAT 6 60 

TTGGATAGTA CAGAACTTCG TATAAAACGG GGGAATCAAT TTAGAGTTAA ATATGATGGT 7 20 

GACAATGATA TATTAATTAA TATAGTAGAT AiSACGTTGA AGATTAGTGA TAAAAGGTCT 78 0 

AAGACAAGAG GATATGCAAT T GAT AT G AAT CCTTTTCATG AGAATAAGAA AACGTTAACG 840 

ATTGAAATGC CTGATAAAAT GATTAAACGT TTAAATCTAT CATCTGGAGC AGGAAGTGTT 900 

AGAATCAGTG ATGTTGATTT AGAGAACACA AGTATTCAAA GCATTAACGG TGAAGTAET 960 

ATTAAAAACT CAAATCTTGA CGCACTAGAT TCAAAGACAA ATAATAGTTC TACTTATATT 1020 

AG T AAAAG T A AC AT T AAAAA TAGCAATATT AAAGTTGTTA TTGGTACGCT ACAAATCGAC 1080 

AA G AG T C AAA TTAAACAATC CATATTTTTA AACGATCATG GTGACATTGA ATTTAAAAAC 114 0 

ATGCCATCAA AAGTAGATGC AAAAGCTTCT ACTAAACAAG GAG AT AT T C G TTTTAAGTAT 1200 

GATAGTAAAC CTGAAGACAC TAT AC T AAAG CTAAATCCGG GAACGGGTGA TAGCGTAGTT 12 60 

AAAAATAAAA CATTTACTAA TGGtAAAGTT GGGAAAAGCG ACAATGTTTT AGAATTTTAT 132 0 

AC GATTGATG GTAATATCAA AGTTGAATAA ATAAAGGATG TAAGCACCGA TAT TAG G AAG 1380 

CATAATTTCT CTAATATCGG TGTTATTTAT TTGTTGGCAA AAGTTAAGTC GGTATCTATA 14 4 0 



531 

( opied h orn 1< 80(ii>0(i on 08/ 1/ t?00 ; 



TTGCCAGTAA AGTGAGTGAT ATTAAGGTCT TGACCATCTA AC CAT GAT T T GAAATCTATT 1500 

ATTTCTGGTG GCGCATTTTC TCCCAATGTA AAATATGCAG TTAATGTTTC AGGTTGATAC 3.60 

ATTGATGTAT GGATGGTGCC AGACCAGCTT TTGAATAGTT TACTGTAAAT TTCATACTGA 1620 

GGATTATTGA ATAACTTAAA TGCTGTAGTC ATATCTAAAT TATCATTAGT TTGTGAAATG 168 0 

GTACGCGCCA GTCTTTCTTT AGATTCTTTT GTATAATTAC GATTTTCATG TGTTAATATT 1740 

TCAAAATGAT TTGTACATAT ATTATCATAA CGAACATCTA TTGATCTCGG TGTCACTTCA 1800 

AC AAT T G CAT GGTTCAATGA TTTGTCCATC AGTATGTAGC TAAATGAGCT TCTGTGTGGT 18 60 

ATTTCTTTCA ATAAT TGGAT TGCTTCTGTT ACATTTCGGC AATTTTCAAG AAT T AGACGA 192 0 

CCAATCATAT AACATACAAA ACCATTTGCT GGTTTCTTCC GGTGCAffiAA GTTATAGCCC 198 0 

ATAGTTAATC CTGACTCATT CATACCATCC ATTCTTCCAG TTACCCTTGA TACAGGACCA 2 04 0 

ATTTGAGCTA AACCGCTATC TGTAGGTTGA TAAAGTAAGT AGCGACCATC ATAAGTTGCA 2100 

GGGTGGTAAT CAT AAT T T C T AACCATGAAG TCTTTGCCTT GAAAGACCGT GCAaCCACTT 2160 

TCTTTTAAAT CGGTAAAACG ATAATGTCCA AAGTTTAAAA TAATTTGGCG TGTTGGCATT 2220 

TTGAGTATAC TTTGTAGTCC CATTAATTCT TCCCATATTT GAGGTGCGTA TGTTTGGAAT 2280 

ATTTGATAAG TTTCATTTAC ATCTATATCG AAACGTGGGA CaCnTTTTTT CCATTCTTTT 2 34 0 

TCTCGATTTT TTAGAAGAGG TGTTTGTTGv AGCCATTTAC CAGTTTTAAC ACCTAACTCG 24 00 

AAATGTGAAC CTCTAAAAGT CATGATATCT GATGTCACTT GTTGCATATC ATCGGCCCCT 2 4 60 

TTCTTTTTAG TTGTAATATA TTGTAAATAA ATAGTAATCG TAT G TAT ATT GAATGTCATG 2520 

TTAAATAAAG TTATATTTTA CTAAATGAAA TATAAAATTG TTTGAGGTGA TTTTCCGGTG 2 58 0 

TATAAGACTT ATCAATCAGT TAAAACATAT TTTTATAGAT GGTGGGGATA TTGAGTTAAA 2 64 0 

AACTTAAAAT CATCTTATCA TAAATATCAA TCTTAAGTTA GCATTCACGA TAATAGTCAT 2700 

TGTTAACATT AGCATATAAG GTCATGTCAC GTTGAAACAG AGGTTCCTCG GCATTTTTGA 27 60 

AAATAATCAT TAGCTTGCTC ACT AT AT AAT T T GAT G AAT A TATTTCGTGA AAGTGGGTAT 2 8 20 

TTATTTAATG ATTATTCTAT ATATGATAGT ATAATGAAAT GTAGATAGGT ATTTAATTTA 28 8 0 

ACAGAGGTGA AATTGAGATG TGGAATTTTA TTAAATGtGT GkTTAAATTC GTATTTAGCT 2 94 0 

TAGTTGCTAT TACAACATTA GTTGCTGGTG TTGGTCTAGT AGCATTTGCT TATATCTTTA 3000 

AAAAAGATTT TGAAGATATT GAAAGAAAAA CTAAAGAAAT TATTTCTGAT ATTGAAAGTA 30 60 

AAAATAACTA ATAACATTTA GAGGCTGGGA CATAAATCCC TAAAAAACAG CAGTAAGATA 3120 

ATTTTCAATT AG AAAAT AT C TTACTGCTGT TCTCTATTTn ATcAmTACTt CGTATTGAAT 3180 

GGCTTCGCTT TCCTAGGGTG CCGTCTCAGC CTTGGTCTTC GACTGGCACT GCTCCCTCAG 32 4 0 
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GAGTCTCGCC ATTAATACTA CGTATTAACA TGTAATTTTA CTTTGGAAAT ACTTTTAAAA 



AATAAGACAC TTTGGCCCAA CTTGGCACAT AAATGTAAAA TTCAAT 



3300 
3346 



(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

GTTGAAGAAA GAAA.TATAAC AGTCAATTAT AATTATAACC TTGTTGAAAT CGACGGTGAC 60 

AAAAAAGTGG CTACATTCGA AC AT AT C AAA GCATACGATA GAAAAACAAT AAGTTATGAT 120 

ATGTTACATG TAACACCACC TATGGGTCCC T T AG AT GTAG TAAAAGAAAG TACACTTTCA 180 

GATAGTGAGG GTTGGGTAGA TGTTAACCCA ACCACATTS AGCATAAAAG CTACTCTAAT 24 0 

GTATTTGCAC TTGGTGATGC TTCAAATGTA CCTACTTCAA AAACAGGCGC ACTATTcGTA 300 

AGCAAGCACC TATCGTCGCT AATAATTTAT TGCAAGTGAT GAATAATCAA ATGTTAACGC 3 60 

ATCATTATGA TGGTTATACT TCATGCCCTA TTGTTACTGG ATATAATAGG TTAATACTTG 420 

CAGAGTTTGA TTATAATAAA AATACTAAAG AAACAATGCC GTTTAATCAG GCCAAAGAAC 4 80 

GTaGAAGTAT GTATATATTT AAGAAAGATT TATTACCTAA AATGTATTGG TACGGCATGC 54 0 

T AAAAGG AT T AATATAATAA AG T AC AG AAA ACAATAAATT T T TAATG AAA AATCTTTTAC 600 

TATAAAAGAT TAAGTATTTA AATGACGTGT CAGTGTTGTG TTTATATGTC GTGAATTTTT 660 

AGCTCTAAAT AGTATAAGAT TGAAAAAGTT GTTACTGTTT TAAAT GATC A CGATGAAGTC 720 

AT TCAAT AAG AATGATTATG AAAATAGAAA CAGCAGTAAG ATATTTTCTA ATTGAAAATC 780 

ATCTCACTGC TGTTTTTTAA AGGTTTATAC CTCATCCTCT AAATTSTTA AAAATAATTA 84 0 

ATGGTATTTG AGCACGTTTA GCGACTTTAT GACTGACATT ACCAATTTCC ATTTCTTGCC 900 

AG AT AT T C AA ACCACGTGTA CTCAAAATGA TAGCTTGGTA TGTACCTCCA ATAGTAATTT 960 

CAATAACTTT GTCTGTTGAA CACTAAGAGC AATTTTAATT TCATAATGTG TTGTAAACAT 1020 

TTTTTTTGAT TGGAGTTTTT TTCTGAGTTA AACGATATCC TGATGTATTT TTAATTTTGC 108 0 

ACCATTTCCA AAAGGATAAG T G AC AT AAG T AAAAAGGCAT CATCGGGAGT TATCCTATCA 114 0 

GGAAAACCAA GATAATACCT AAG T AG AAAG TGTTCAATCC GTGTTAAATT GGGAAATATC 1200 

ATCCATAAAC TTTATTACTC ATACTATiAT TCAAT TTTAA CGTCTTCGTC CATTTGGGCT 1260 

TCAAATTCAT CGAGTAGTGC TCGTGCTTCT GCAATTGATT GTGTGTTCAT CAATTGATGT 1320 

CGAAGTTCGC TAGCGCCTCT TATGCCACGC ACATAGATTT TAAAGAATCT ACGCAArCTC 138 0 
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TTGAATTGTC GTATTTCATC TTTyTCATAT TTGTTAAACA ATGATArATG CAACTCAAy 14 4 0 

ArATCTAATA GTTCyTTGCT TGTGTGTTCG CGTGGTTCTT TTTCAAAAGT GAATGGATTG 1500 

TGGAAAATGC CTCTACCAAT CATGATGCCA TCAATACCAT ATTTTTCTGC AAGTTCAAGT 15 60 

CCTGTTTTTC TATCGGGAAT ATCATCGTTA ATTGTTAACA ATGTGTTTGG TGCAATTTCG 1620 

TCACGTAAAT TTTTAATAGC TTCGATTAAT TCCCAATGTG CATCTACTTT ACTCATGCGT 1680 

TTGATAAAAA CTTAAATAAT ATTAATTCGG TCATCAGTGG CGTTAAATCT TTTATCATTT 1740 

TTAGTTATAG TTGATAAATT TATATTTATA AGCATATATG GATATTTCAT CAAAAATTTT 1800 

TATTTATATA AATCCGAACT GCATACATAT TTGTTAAAT AAGAGGTATT ATTTTTCGGG 18 60 

AAATTGCTGT CTGAGTTAAA AGGATTAGTT TTATAAAATG AGTTGAACTA TAGCCAAAAA 1920 

CGATTAAAAT ACTGATAATC CATTTTTGtA TTATGTTAGG GACTTTTTTA CTTAATTTTA 1980 

ACCCTATTGG aGCmAATATA ATACTCCCTA TTATAAGGAA TAAGGCGTCA TATAAaGGGA 2 04 0 

TATAACCTTG AATAAGTTTG ATGACAAAAG CACCAATTGA AGATATAAAA GCAATTACTA 2100 

TACTATTAGC GACTACAGTA TTCATTGGTA ATTTGAATAA AACCAATAAT ATAGGAATAA 2160 

TAATGAAGGC ACCACCTGCA CCTACTATAC CTGAAATAAT ACCAATGAAA AGGCCAATGA 2220 

TAACTAATAA ATATTTATTA AATGAAGACT TTTCGGAACT AGGTTtCACT TTAATAAACA 2280 

TTAATGTTAA TGCAAGTAAA GCAATAATGA TATATACCGT ATT T AC AAAT GTAGCATCAA 234 0 

ATAAATTTGC TAG AAAT GC A CCTAACATAC TCCCT 2 37 5 
(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

GAGGTTTCTA GACAAGCTTT TAATAACTTA CCAAACTIAT TAAgrTGGTT gTGtTGGACT 60 

GCCtATTATC mAAGtATTAT GaGTTGTTTA ATATTAGtGC TAArACATAC GAAGAGTGGT 120 

TTAAACAATT TAG TAGT AAG AAAGCACAAT TCAGTATTAA TCTCACGGAT AAATGGATAA 180 

TTCAAATCGC ATATGGTAAA TTAATAATAA TGGCTAAAAA TAATGGCGAT ACATATTTTA 240 

GAGTTCAAAC AATTAAAAAG CCAGGTAATT ATATTTTTAA CAAATATCGA T T AGAGAT AC 300 

ATTCTAATTT ACCAAAATGT TTATTTCCGC TTACAGTGAG AACACGACAA AGTGGCGATA 360 

CATTTAAACT GAATGGGCGC GATGGTTATA AGAAAGTGAA TCGCCTGTTT AT AG AT T G T A 4 20 
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AAGTGCCACA GTGGGTTCGG GATCAAATGC CAATCGTATT GGATAAACAA CAGCGCATTA 4 80 

TTGCGGTAGG AGATTTATAT CAACAACAAA CAATAAAAAA ATGGATTATA AT TAG T AAAA 54 0 

ATGGAGATGA ATAGCGTTAT GCATAATGAT TTGAAAGAAG TATTGTTAAC TGAAGAAGAT 600 

ATTCAAAATA TCTGTAAGGA ATTGGGAGCA CAATTAACAA AGGATATCA AGGTAAACCA 660 

TTAGTATGCG TGGGTATCTT AAAAGGCTCA GCAATGTTTA TGTCAGATTT AATTAAACGA 7 20 

AT T GAT AC C C ATTTATCAAT TGATTTCATG GATGTTTCTA GTTATCACGG AGGCACTGAG 7 80 

TCAACTGGTG AAGTTCAAAT CATTAAAGAT TTAGGTTCTT CTATTGAAAA TAAAGACGTA 8 40 

TTAATTATTG AAGATATCTT AGAGACTGGT ACTACACTTA AGTCAATTAC TGAATTATTA 900 

CAATCTAGAA AAGTTAATTC ATTAGAAATA GTTACTTTAT TAGATAAACC AAACCGTCGT 9 60 

AAAGCGGACA TTGAAGCTAA GTATGTAGGT AAAAAAATAC CAGATGaATT TGTTGTTGGt 10 2 0 

TACGGTTTAG ATTATCGTGA ATTATACCGA AACTTACCAT ATATCGGTAC GTTAAAACCT 1080 

GAAGTGTATT CAAATTAATT TTTTAATCAA TTTCAGTTAT TAT T AC TAT G CGTTTGAGAA 114 0 

ATAATAGTGT AGACTCAAAA AT AT G AAAAA TGTATTTCAT AT AT AT T T AA TTTTAGACAA 12 00 

G AC AT AT G T C TTGAAAAGTT GAAAAATATA GAGATTGATA AAACTAATAC GGTGTGAAT 1260 

GACATTGATG TTAAGCTCAA TTACTAGCTT ATAAAACATG TCATATGTTA CAATTTTTGT 1320 

TAGTTTTATT ATGGGAAGTA GGAGGAAATG ACGCATGCAG AAAGCTTTTC GCAATGTGCT 1380 

AGTTATCGTA ATAATAGGCG TTATTATTTT TGGTCTATTT TCATATTTAA ACGGTAATGG 14 4 0 

AAATATGCCG AAACAGCTTA CATATAATCA ATTTACTGAG AAGTTGGAAA AAGGTGACCT 15 00 

TAAAACTTTA GAAATCCAAC CACAACAAAA TGTCTATATG GTAAGTGGTA AAACG AAAAA 15 60 

TGATGAAGAC TATTCATCAA CTATTTTATA TAACAACGAA AAAGAATTAC AAAAAATTAC 1620 

TGATGCTGCT AAAAAGCAAA ACGGTGTAAA ATTACGATT AAAGAAGAAG AAAAA CAAAG 168 0 

TGTCTTTGTG AGTATACTTT CAACATTAAT TCCAGTTGTA GTCATAGCGT TATTATTTAT 17 4 0 

TTTCTTCCTA AGCCAAGCAC AAGGTGGCGG TAGTGGCGGT CGTATGATGA ACTTTGGTAA 18 00 

ATCTAAAGCA AAAATGTACG ATAATAATAA ACGTCGTGTT CGTTTCTCTG ATGTAGCAGG 1860 

G G C AG AT G AA GAAAAACAAG AATTAATTGA AATTGTTGAT TTCTTGAAAG AT AAT AAAAA 1920 

ATTCAAAGAA ATGGGATCTA GGATTCCTAA AGGTGTCTTA CTTGTTGGAC CTCCAGGTAC 1980 

TGGTAAAACA TTACTTGCTA GAGCGGTTGC AGGTGAAGCT GGCGCACCAT TCTTCTCTAT 2040 

TAGTGGTTCA GACTTTGTAG AGATGTTTGT TGGTGTTGGT GCGAGCCGTG TTCGTGACTT 2100 

ATTCGATAAT GCTAAGAAAA ACGCGCCTTG TATCATCTTT ATCGATGAGA TTGATGCTGT 2160 

TGGTCGTCAA CGTGGTGCAG GTGTTGGTGG CGGTCATGAT GAACGTGAAC AAACCCTAAA 22 2 0 

CCAATTATTA GTTGAAATGG ATGGTTTCGG TGAAAATGAA GTATCATTA TGATAGCTGC 228 0 
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TACAAACCGT CCTGATATCC TTGACCCAGC CTTATTACGT CCAGGTCGTT T T GAT AG AC A 2 34 0 

AATTCAAGTT GGTCGTCCAG ATGTGAAAGG CCGTGAAGCA ATTCTTCATG TTCATGCTAA 2 4 00 

AAACAAACCA CTTGATGAAA CGGTTGATTT AAAAGCAATT TCACAACGTA CACCTGGTTT 2 406 

CTCAGGTGCT GATTTAGAGA ACTTATTAAA TGAAGCATCT TTAATTGCTG TACGTGAAGG 2520 

TAAAAAGAAA ATT G AC AT G A GAGATATCGA AGAGGCAACG GATAGAGTTA TAGCCGGACC 2580 

TGCTAAGAAA TCTCGAGTTA TTTCTAAGAA AGAACGTAAT ATTGTTGCTC ATCACGAAGC 2 64 0 

TGGTCATACA ATTATCGGTA TGGTACTTGA TGAGGCAGAA GTAGTGCATA AAGTTACTAT 27 00 

TGTTCCACGT GGACAAGCAG GTGGTTATGC AATGATGCTA CCTAAACAAG ATCGTTTCTT 27 60 

AATGACTGAA CAAGAGTTAT TAGATAAAAT CTGTGGTTTA CTTGGTGGAC GTGTATCAGA 28 20 

AGATATTAAC . TTTAACGAAG TATCAACAGG TGCTTCAAAT GAGTTCGAfi GTGCAACACA 2 88 0 

AATCGCACGC TCAATGGTTA CGCAATATGG TAT GAG T AAA AAATTAGGAC CAT TACAGTT 2 94 0 

CGGTCATAGC AATGGTCAAG TATTCTTAGG TAAAGATATG CAAGGTGAGC CTAATTATTC 3000 

AAGCCAAATC GCATATGAAA TTGATAAAGA AGTTCAACGA ATCGTTAAAG AACAATACGA 30 60 

ACGTTGTAAA CAAATTTTAT TAGAGCACAA AGAACAATTA ATTTTAATTG CTGAAACATT 3120 

ATTAACAGAA GAAACATTAG TTGCTGAACA AAT TCAATCA TTATTCTACG AAGGTAAATT 3180 

ACCTGAAATT GAT TAT GAT G CAGCTAAAGT TGTTAAAGAT GAAGATTCTG AATTTAATGA 324 0 

TGGTAAATTC GGTAAATCTT ATGAAGAGAT TCGTAAAGAG CAATTAGAAG ATGGACAACG 3300 

TGACGAAAGT GAAGATCGTA AAGAAGAAAA AGATATTGCT GAGGATAAAA AAGAAGCTGA 33 60 

TAAATCTGAT G AAAAAG AT G AACCAGCACA TCGACAAGCC CCAAATATCG AAAAACCTTA 34 20 

CGATCCAAAT CACCCAGACA ATAAATAATC GATTATATTC AGTACCTCTT TCTATSTAA 34 8 0 

AG T T AT AG AA AGAGGTACTT TTATCGTTTT TGAAAATACG TATTAGATTT TAAGTCGTTG 35 4 0 

AATTGTTATA G C AG AAAAT A ATTGTAAAAC AAGTTACTTC ATTATTTAGA ATGATGGGTG 3600 

TAGAATAAGT ACAATTGTTG CAT T T TAT GA AGTAAAGTAA TTTTTTAAAT ATAGAGTAAT 3 6 60 

AGAGGAGATT G AAA.T AAT G A CACACGATTA TATTGTTAAA GCATTAGCAT TTGATGGAGA 37 2 0 

GATTAGGGCT TATGCTGCTT TGACAACTGA AACTGTTCAA GAAGCACAAA C GAG AC AT T A 37 8 0 

TACATGGCCG ACAGCATCTG CTGCAATGGG AAGAACAATG caCAGCAACA GCTATGATGG 38 4 0 

GCGCAATGTT GAAAGGTGAT CAAAAATTAA CTGTCACET AGATGGCCAA GGACCTATTG 3 900 

GACGAATTAT TGCCGATGCA AATGCTAAAG GCGAGGTGCG TGCTTATGTA GACCATCCAC 3 9 60 

AAACTCATTT TCCATTAAAT GAGCAAGGTA AACTTGATGT AAGACGAGCG GTAGGGACAA 4020 

ATGGATCTAT TATGGTTGTT AAAGACGTTG GAATGAAAGA CTATTTCtCT GGAGCAAGTC 4 08 0 
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CaATTGTTTC AGGAGAACTT GGTGAAGATT T TACT TAT T A TTATGCTACA AG T G AAC AAA 414 0 

CACCTTCATC GGTAGGTCTT GGTGTATTGG TAAATCCTGA TAATACGATT AAAGCAGCAG 4 2 00 

GAGGATTTAT CATTCAAGTT ATGCCAGGTG CCAAAGATGA AACAATTTCA AAAT TAG AAA 4 2 60 

AAGCAAT TAG TGAAATGACA CCAGTTTCTA AATTAATTGA ACAAGGATTA ACGCCAGAAG 4 320 

GAT TACT AAA CGAAATCTTA GGTGAAGACC ATGTGCAAAT T T T AG AG AAA ATGCCTGTTC 4 38 0 

AATTTGAATG TAATTGTAGT CAT GAG AAAT TTTTAAATGC TAT TAAAGGA TTGGGCGAGG 4 440 

CTGAGATTCA AAATATGATT AAAGAAG AT C ATGGTGCTGA AGCAQ3ATGT CATTTCTGTG 4 500 

G AAAT AAAT A T AAAT AT ACT GAAGAAGAAT TAAACGTGTT GCTAGAAAGT TTAGCGTAAT 4 5 60 

TTAATTTAAA TCAATACGCT AAAATGTTTA TTTTTAGCGG TTTAGTGAAA T G TAG AAC T A 4 62 0 

AATAGTTGTA TAATCCTTAG TGATTTTGTT TGCTTTCTAG AATTTATTTG ATAAAATAAT 4 68 0 

TCTATATCCG ATAAATAAAC TAAGATTTCA ACAACTAACT AAAAAGG AG T GTTCTTAATG 4740 

GCACAAAAAC CAGTAGATAA TATTACTCAA ATTATTGGCG GTACACCGGT AGTCAAATTG 4800 

AGAAATGTAG TAGATGACAA TGCAGCAGAT GTTTATGTAA AATTGGAATA TCAAAATCCA 4 8 60 

GGTGGTTCTG TAAAGGATAG AATTGCTTTA GCAATGATTG AAAAAGCAGA GCGAGAAGGC 4 920 

AAAATTAAAC CTGGCGATAC AATTGTAGAA CCAACAAGTG GTAATACAGG TATCGGTTTA 4 980 

GCATTTGTAT GTGCTGCTAA AGGATATAAA GCAGTATTTA CTATGCCCGA AACAATGAGC 504 0 

CAAGAGCGTC GTAATTTATT AAAAGCATAC GGTGCGGAAT TAGTTTTAAC GCTGGATCA 5100 

GAAGCGATGA AAGGTGCAAT TAAAAAAGCT AAAGAATTGA AAGAAGAACA TGGTTACTTC 5160 

GAGCCACAAC AATTTGAAAA CCCTGCGAAC CCTGAAGTTC ATGAGTTAAC TACAGGTCCT 5220 

GAGTTATTAC AACAATTTGA AGGGAAAACT ATCGATGCGT TCCTAGCTGG TGTTGGTACT 52 8 0 

GGTGGTACGT TATCTGGTGT AGGTAAAGTT CTGAAAAAAG AATATCCTaA CATCGAAATT 534 0 

GTTGCTATAG AGCCTGAGGC TTCTCCAGTA TTGAGCGGTG GTGAGCCAGG T C C AC AT AAA 54 00 

TTACAAGGTT TAGGTGCTGG ATTTATTCCA GGCACTTTGA ATACAGAAAT CTATGACAGT 54 60 

AT TAT T AAAG TAGGAAATGA TACAGCGATG GAAffGTCTC GTCGAGTTGC TAAAGAGGAA 5520 

GGTATTTTAG CAGGTATTTC ATCAGGTGCT GCGATTTATG CTGCCATTCA AAAAGCAAAA 558 0 

GAATTAGGAA AAGGTAAAAC AGTAGTAACA GTATTGCCGA GTAATGGTGA ACGCTACTTA 564 0 

T C AAC AC C T T TATATTCATT CGATGACTAA TTAATGTCAT TTAAAAGAGT GAGTTATCTT 57 00 

TTTGAGATAA CTTGCTCTTT TTTTCTACCA TGTATATTTT TAAAAATATG AGCGTTAAAT 57 60 

TAAACATTTT TCTGATAAAA ATATCCAGTG AATGATAAGA TAATAAACGT ACATACTAAT 582 0 

AACTAGTAAA TAGCAGGAGT AAAT TTT ATT AG AG T T AAAC AATACATAAT TAAAGGGTGG 58 80 

TTAACATGAC TAAAACAAAA. ATTATGGGcA TATTAAACGT CACACCTGAT TcATTCTcAG 5 94 0 
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ATGGTGGAAA ATTTAATAAT GTTGAATCAG CTATAAATAG aGTGAAAGCC ATGATAGATG 6000 

AAGGTGCTGA CATTATAGAT GTTGGAGGTG TTTCAACGAG ACCCGGTCAT GAAATGGTTT 6060 

CATTAGAAGA T GAG AT G AAC AG AG TAT T AC CTGTTGTTGA SCTATTGTC GGTTT 6115 
(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10401 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

TAGATACTGG GnTAAAcaTc AAAAATAtyT GCtTaTTCaC GTGTTTAcGc TCCCtCAAAC 60 

GCAACGTTAA TTGCGTGTAA TCATTTAGTG TGAATTcAGA CGCTTCTTCC ATGACTATGT 120 

CTGATATGCC TTTTATCGAC TTTATTTTCT CTGGGTTATC TAATCCTTTA AACAAAAAAA 180 

CTGCGCCGTT TGGCAATTCA ACTTTGTTAT CAGTCTTATT CCAAAGGCAC ATGTCCCAAA 24 0 

TACCAAAGTT TATCAAACAA TCTTTAACAT CTTCGAACAA ACTATCTTTA ATTGTTGATT 300 

GTACTTTTCT AAGCCACAGT AT AC GC C TAG GATATTTCCA ATCTTGCAAT GCTTTGAGTA 3 60 

CAACTTTTTG TATAACGCCG TGAGACTTAC CGCTCGAACC TCCACCGTAA TGkACTTCAG 420 

TGAAGTtATC GTAATTGGTT AGTATTTCGA ATATGTTTCT ATTGAAAACA TTAGACGGTT 4 80 

TGTTAAAGTT TAATTTAACT TTCGTCATCG T AC T C AC C AA TATTAATCTC AATATTCTTC 54 0 

TGAGTAATTT CTTTTTTATC GATATACGCA CCATGTACTT TTBTATGTG GTCAATAGAT 600 

CTCTGACGCT CTTCAAAAGT TGGTGTGATT GTGTAAGTAA CCTCTTTTTC CACTTCATCG 660 

TTTAAATGGT CATATTTCTT ACTGTAAGCC TCTTGAGGTT CTCCTCTAGC AATAGAAGCA 720 

GATAACGCTA AAGCTTCTGT AATACTCATT AAACGCTCTT CTTGTATCTG TTCTAATCGT 78 0 

TCTTTAATAT ATTCCGAAAC ATTAACATTT CTTAACAATC GACTTGCTAA AGACTCTGCT 84 0 

GTTTTCTTAC TATAACCTGC TGTAATTGCT GCTTTTTTAC CATTACATCC ATTCATTATA 900 

TATTCATCTG CGAATCTCTT TTGTTTTTCG TTCATTTCAT TTACCACCAA CTCTCGCGCT 9 60 

ATACGCTTTT TAAAATTAAA AAAGGVTTGG CTATAATCAG CCAACCCACA TAGATCCTTT 102 0 

ATTCCTAATT GCGATAAGGG AAACGCAGTA CGATAGTCAA TATCCTACAC TATCATAATA 108 0 

TCTCATTTAA GGTATCAAAA ACTGCCACTT TACTGCCAAT TTCAGTCTTC CCCTAACTCT 114 0 

TCCGCCAATC TAGATATGAT TTTTCTTTTG ATTCTATGAG CAGTTCTATCAGAAATGTGT 1200 

AT G T C AAC AC AAACTTTCAC TAATTCCTTT TTATTAAAAT AATACTCTTG AATGAATTCG 12 60 
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CGTTCTTTCC TGCTTGATGT GTTGATTATA CGTTCAATAG CGCTCTTAAA CTCAAGGATT 1320 

TTACCTCTTC GTATACTACA AAGATAATTA GTTACTGCCA TTTCTGTTTT CGATGTATTA 1380 

GACGGTACAA ACTCCCCGCC TATATTTGTA TCTGTTGGAA TCCACGGTGT CATTATTTCA 14 4 0 

CTTCTTAAAT CTTCAAGTTG T T T AT GAT AA TTAGGATAAT CACACAACTC ATCTTCTAAC 1500 

TTTCGAACTG TTGATAATTT TAATCCGTAT TTCTTTTTAG TCATGAATAC CCTCCGTACA 1560 

AATATGTTTA ATCTTCAAAG TGTCTCAATC TA:TTCTTAA TATCTCTATC TCTCGCTCTT 162 0 

TAACTTTTAC ATCACCTTTT AACTGTTCCG CTTGTAACAT CACACCAAAC AATAAGATGA 168 0 

CTAGTAATAT AATTGCTATG ATTAACCACA TCATCTACTC CGACACCTCC GCCCTCATCA 17 4 0 

AATCAGACTG AT C AC T C AAC TTTGCGAAGT CACTTGGCGC CTCTACATCA TCATTAGD3 18 00 

TCATCATAAT ATATACTTGC TCAGTTACAT ACTTACCTAA CTCATACATC GCTAGTAAGA 18 60 

ATAATAGTCT CAAAATTTCT TTAACCACCA CTAAACACCC CATGTTAATT TATCGATAAT 192 0 

TTGTATAGCT TGTTTTAATG CGTCTCTTTT TTCTTTGATA TCTCTATTAT CGCCATCTTC 198 0 

ATCAGCTGAC ATTAA.CTCAC TGTCATATTC ATATAATAGT TCTGATATTT CATTACTAGC 204 0 

TACTACTAAT AAGTTTTCAT CTACATCAAT CGTTACCGTT TTCTTTGGCA TCTCCATCTC 2100 

TCCTTATCTT AACTTGTGCC TCGTATTTGC GCTCAGCTTC TTCTTTACTC TCTGCCTCAA 2160 

C AAC T G T AAA CGTCTGATTA TCTCTAGCAG TAGTAAAATGTTCATGTGGT TGTCCTGTTG 2220 

AATCTTTGAA TGTTGTGACT AAGTATTGCG TCACTTCTTA TCACTCCTTT GAATGATTCT 2280 

AAGTTTTTCT ACGAATAAAA GTATTAGTAC AACACTCAAT GTAGCCAACA TATTTTTTTG 234 0 

CTTTGCAAAA TCTACTATAA CGATTAAGAC TAATAACATT CCAATTCTGC ATGTAAATAA 430 0 

ATCTAATTCT TTGTACAAAA CCATATATCT GTTAAGTAAA TTGTTAAATA TTACTATGAA 24 60 

TACAAGTATT GGAACTAATG TAATGATGTA ACTCACTTCC CCAAAACCTC CTTGACTCGA 2 52 0 

TCTAAGATGT CTTTACACTC CGCTACTTCC GAAGCCTTTT TCTCCACGTT CTGAAACACT 2580 

TTCGAATTCC TCCACTTGCT TTAGTTCAGG TGTCCATATA GGCACGATAA CCAATTGAGC 2 64 0 

TAGTTTGTCT CCTTCGTTGA TTTGATAAGT TCCGTATTGT CTTATGGCGT CACTCAAATC 2700 

GATTTCTCCT TTAATATCAA AAACACCTGG TGTGATATAA CCATTCGATG CAATAGCGTC 27 60 

ATTCTTGATA TTAATCCCTA AATTGCCGTG ATATCCCGCG TCTATCTGC CTGTTTCAAT 2820 

CACTAAATGC GTTTTACTAC TTACACCACT ACGGCTAGTT AATAGTCCGA CATAGCCCTC 2880 

TGGTATGCTT ACAGCTACAT CTGTTTTAAT CACTGCCTTT TCTTGTGGCT CAAGTACGAC 2 94 0 

AGTTTCAGCT GAGAATATGT CATAACCTGC ATCCGTCTTA TGATTTCGTT CGGGCATTCT 3000 

AGCATTTTCT GATAATAGCC TTACTTGTAA TGTGTTAGTC ATTTTCCTGC TCCTCCCTAG 3060 

CTGTAGCAAA CGCTATTCTC AATTTCAATC TTTCAACAAT ATGAATTAGT GCGGTATTGA 3120 
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GGAATATTTC AAATTCTTCA ATGTTCTCAT CTATAAAATC AAGTATTTCT TCCTCTTGTT 3180 

CACTGTCAAA CTCGCTTAGT ACATCCCAiB. TATTTATGTC GCTTTTGCTC GTTTCTAATA 32 4 0 

CTCTTTTGAT TATTTCTGAA TTACTTTTAT TACTCATTTT CCTTGTTCCT CCTCATATTT 3300 

ATAGACAACT TGACCTGCCA TAATCCCTAC TGCTTCATCA AGTTCAATAC CTTCTTTAAC 3360 

TGAATGTTGA ATAGCATTTG TCATTCCCTC AAGTATTTCA TCAAACGCTT GTGTCTCTT 34 2 0 

ATACACGTCC TCAATCTCTT TTAGTAATCC CTCTGTGTCA TTACCGTTAT ACGCACTAGC 3480 

ACTGATCACT GATTGTTCAA TTTGTTCGCG GTTATTCATC ATTTCCATCT CCTCTAAAAT 354 0 

AAAGTTAGTT GCTTCTGCTC CTCGTATTCC AAACCATGTT GCTTTATATA TGTTTCGAGC 3600 

TCTTCCGCTG TATCAAATGT CTTTTTCACG CCTTGCCAAC CTGGCACGAT ATGCCCATGa 3660 

AAGTAATAAG TGCCGTTCAC TACATGGATA TGTGCCACTC GTTCGTTATC CTGATACAGA 3720 

TATCTCTTAG ATCCGAAAAA TTGGTTTAAG TATTCTTTAC ATGCGCTATC GGTTTTAGGC 3780 

ATTTATGCTT CCTGCCATTT CTTAAACATT TGGTTffAAG TAGTATCAAA CCAGTACGGA 384 0 

TCACGTGAAT GTTTTTGAGG CACATTAAAC AAATGTGGCT TCTTCTTACG TAGTTCAGCC 3900 

TCTTTACGTC GTTGCCTAGC CATTTCACGC TCTTTGCTCT CTCGCTCCAT GATTTTGGAT 3960 

AACACAATTT CTTTATACTC AGCTAAGCGC ATACCATAAG GTGCATGTAA GGCTTCTAAC 4 02 0 

AACGCCCAGC CACCTCGTAC TCTTTTTGCA ACCATTCCTG GAGTTAAACC GTTCTTTTTT 4080 

ATCAATTCAT TTTCATGTTC GGTAAATTTA TATGGTTTAc CGTTAATCTT TACGATACTC 414 0 

ATTTATTCCA CCTCTATACA TTTACTTTTT TTAATCCAAT CCTCTAATTT GTGCGTGTTG 4200 

TGATTTCTAG TAAATAGTTC AC TT AC ATT A ACACCTAGAG CATCTGCCAA TTTATCTAAT 4 2 60 

ACATTTAAGT TAACCATCTC AGCTTTTCCG TTTTTATATC CACTAATAGT TGATCTTGAT 4 320 

ACGCCAGTTT CATTGTGCAA ATCTTGGACA CTTACGTTAT CTCTAGCCAT GATTACCCTT 4380 

AAATTAGTTG CGAATACTtC GTTCAACTTC ATTTATTCCA CCCTATATA TGCATGTCTT 4440 

ATTGTTATGT TGTCATACTT TAGTAATTCG TCCGGATTGT CATCTAAGCG CTTTGCCAGC 4 500 

GTATCTTTTT CTTTATCCAC ATCATCGTAA TGCTGATATT CAACTTCTGT AGGTATTCTT 4 5 60 

ATATCAATCG TTGCGTTTAT ATATGCTTGT TGTTGCATTA GATCACTTCA TTTCTCTTTT 4 62 0 

TCTTTTACGT CTGACTTTCA CTAAGTCCTC ATATACCATC CATTCTTGAC CTGTGTATTT 4 680 

AGGCGCTTTA CATATCCACG TTAAATTCAC ATCTCTATAC TGATATCTGA ATATCTTCGC 4 74 0 

TTTGATGTTG GCAACTTCAG TCGCCTTACC TTTAACGTCT ATAACTTCAA CCAGTTTCCC 4 800 

TTCCTTCCAC AAAGAGAAAT CGGCTATATA CGTAATCGGT CTTTGTTTCC CGAATTTAGG 4 8 60 

TTGTAATTCA AATTTCGGTT GTATTTCGAT ACGATCATAG TTAGTGCCAT TC AT AT TACT 4 920 
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TTCTAAATAT TGGTAATATT CGCACTCTAC TTTGCTATCA AATACAATTC CTTTGTACTC 4 980 

AACTTTCTTA GCATTGTATT TACTCATTGT GCCACCTCTA AATATCAAATATCGTTGCTT 504 0 

GCAATCCTAG CTCTTGCTCA TATAGAAGCC CGTGAGCGCC TTTGAATCGT TTTAGGTCAC 5100 

TATCAGTCAT AATTTTCTTT TCGTCGCTGA AATGGGCTCC TGTGAGCGAA TAAACTTCAT 5160 

TTACGTTGTC TTTATACTTG AT G AC C T T AA TATCTTCCGT GCCATCTTCT CGGTATAAGT 5220 

AATATTTTTC TTTCGGCATT TTTTAACACT CCTTAATGTG TGTTTTCTTC CAGTTGATTT 5280 

CATTCATGAT TTTCTTTTCA ACTCTGTCGT AATCATCGAA AGGCGATAAC TCGTTATTGT 534 0 

CCAACAATCT ATTGACCGCC CAACCAGTCT CGATATATAC ATTTGCTACA ATCGGGTCGC 54 00 

TTTGCTTTGT CTCTTCATAC ATCGATTTCA A'AAGCTTTT GAATTGCATT ATATTCATGT 54 60 

GAAAAACCTC TGAGTCTTCT TGTAATACTC AAATTCAATT ATTCCGGTTT CGCCGTCTTT 5520 

GTTTTTGGCT ATGTTACATT CAACAATAGA TTTGCCAGTG ATACTGTCAT CTTCGTCACG 558 0 

GTTATAATAA TCATCACGGT AAAG TAG CAT CGCTAAACTC GCATCTGCTT CTATTCCGC 564 0 

TGATTCTTTC ATGTCCGATA GCATTGGTCT TTTATCCTGT CTAGACTCGA CACCACGATT 57 00 

CAGTTGTGAA AG TAG T AC G A TGATTGCGCC TGTCTCGTTA GCGATTATCT TTAAGTCACG 57 60 

TGATATCTTT TCTACTGCTA CACGTCTATC AACTTTCGCA TCAGTATCCA TCAGTTGAAG 5820 

ATAATCTATA AAAATAACTT GTTGCCTGTC TGAATGCCTC ATTGtTGCGC TCGCACATCT 5880 

TGCGGTGTGA TATTACTTTT ATCAGAAATA TCGATGCCTA ATTTCATGAT TTTATCCATC 5 94 0 

GCATTCGTTA ACTTTGTTAA GTCATCCGGC GTTAAGTTCC TGATTTCTTT TATCTTTGTT 6000 

AACTCAATAC CAGTAATTGT T GAT AAC AT A CGTTTCAATACCGATGTGCC GGTTGTTTCG 60 60 

AGACTAAAGA AAGATGTTTT GTATCCATTT TGTGCTATGT TCAGCATCAT GTTTAATGCA 6120 

AAACCTGTCT TACCCACTGA GGGACGCGCT GCGATGACGA TTAATTGTGA TGGTTCTAAT 6180 

CCCCCTATTT TGTAATCCAT TAGCTTGTAA CCCGTC TTAA TTTGCTTCTT AGGGCTATCG 2o4 0 

CTGTATAACT CTTCGACAAA CTCCTCAACA AACTTCTTGG TTCCATCTTC TTTTTTGTTA 6300 

GTAATTGTTT TTAAATCCTT GAGTTCATCA ATCAAGTTGT TAAAGTTTTG GTTCGTAGGT 6360 

TGTTGTTTGA ACTCAGTTAC CAATTCGTTA GCTTTGTTGA GCTGATAACT TTCCAATAAT 64 2 0 

TCTTGTTGAT AACGTTCAAA GAAGCCATAT CCAATGAAAT CGGAGTTGTA AAGTTTAGTT 64 80 

ATAGTATCTG CATCTAAAAA TTCTTTATCT TTAGTTGCTT TTAAATAGAT TTCTTGATGA 654 0 

TCTATCTTTC CGACGTCCAT TACATAATTG AAAAAGGTTT TAAACTTTTC GTTCGTAAAC 6600 

ATGTAATCTT TAACTCTTAT CTTTTCTAAT ACGTCCGGTT GTTTAACBAG CGTAGCGATT 6660 

ATTGTACTTT CAATTTCGAA TTGTCCGTAA TTCATTCGTT TTCGCCCCCA AATTCTGCCA 6720 

ACTTATTCAT GAACTTATCT AGCGCTATTT TTCTTTGTCT GACATATTCG GGGTCATTCT 67 8 0 
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GCATTTTCCA TTGGTGTGTA .GCGGTTTCGT TATCTACTGG CTCGATAGAT ACTTTTTTAG 68 4 0 

GTTCCTTACG CATGATTGCT ' GGTAAGTTAG GCGGGTACGG GTTGTTACTG T T GAT AT AAA 6900 

CATCTACCGC TTTTACAGTT GGTTGATAAT CTCCATTTTG ACTTAATACA TCAATCCACA 6960 

TTTCTAACTT CGGTTTATCA AAATCAATGT TGTATACGTA CCTAACTTTT TTAATAATTT 7020 

CTAATGCTTG TGTTTTGCTC ATCGGCATTA GTCATCACTC AATTCTTTTT CCATTTGTGC 7 08 0 

AATGACATCA TCAGTAGTAT TTTTTCTAGG TGCTATTTTA TTTTCTGCAT CTTCTTTTGT 714 0 

TTTGACATTC TCTTTAGCCC AGTTGTTTAA AACTTTAATT AAATAGCCAC CATGCGCACT 7200 

TTTGCTTTTA GTGTACTCAA CACCTACTTT TACAACTTCA AAAGCGTTTG TACCATATC 72 60 

ATCAATAGCA AACCCTAATT GTTCCATTTG ATTAGGTGTT AACTTATCAT CCAAATTTGC 7320 

AATTATATAT TTTATTGAAG AT G AG AAG AC GGCTTCTCTT TCTTCTTCTT TATTCTTATA 7 38 0 

TTCTTCTTCT TTTTCTTCTT CTCTTTCTTC TTCTTCTTCT GTATCGTTAC GTAACGTTAC 7440 

GGTAACGTTA CGTTTTGCTT CTAGTAACTT TTTCTGTTTC TCACGATAGC GTTGTTGTCG 7 500 

CAATTTATTT TTTTCTTTAT GCTTAGCTTT GCTATCTAAG CTTTGATGCT TCTCCCAGTT 7560 

TGTCACTTTT ATGACACCAT TAACTTTTTC AATCATGCCC AATGTCTCAA AAGTTTGAAT 7 620 

TGCTAACCTT ATTGAGTTAA TAGGTCTATT AAATTCATTT GCTAACATTT CTTCGTTGTA 7 680 

CGGCAAGTTT TCGGATAGCA TAATATAACC TTGTTCATTG TACTTTCCTG ATAAAGTTAG 774 0 

TAACTTAACC CAAATAGTTA TGATCGTATC TCTTTCGGGT AAAGCTTCGA TATATTTGAT 7800 

TTTGCTGTCA TCAAACATGC CAACTTTAAG TTTTATCCAC GATACTTCTC CCATTGTTTT 7 8 60 

CTCCTTTCAG CATTTTGTTG AGCCTCTCAT CAACTTTTAT CCACGAGTCA TGCAAGTGAT 7 92 0 

ATTTATCATC AAACGACTTA ACGCCAATTG CGTGCTGTTC ATTATGATGT TGTCTACACA 7 980 

GTGCTAACAC ATGTTTGTCG TAG T GAT T C A TTTTGTTTCT GTTCATGCCT CTGCCGACTG 804 0 

CTTCATAATG TGCCAGGTCT GCGTGAGGCT TTCCGCATAT TACACAGTTG CGGTTGATTG 8100 

TAGCCCAATA TAATAACGCT TTATCTTCGC TTAACAACTT ACTCGTTTCT ACACTCATAG 8160 

GTATTTGATG AT G AAAC AT A AACGCTATAA TCAGTTCTAT TAACTCCCTT GCAACTTTCA 8220 

TAGAACAGTC GCGCAGACTG ATTTCTTCAT AACCTTTCAT AATTCCAAT TCTGTTTGTA 8280 

ATAATTTTCT AGTTGATTCT ACTGGTTCGC CCCAGTGAAG TTCTATATCT CTACACATTG 834 0 

CGAATATTTT TTTGCGTTGT TCTATAGATA GTTTTTTATT GTCCGGAACC TCTACTTCTG 8 4 00 

CTTTTAGTGG ATATCCGTTT TCTAGTAAGT CAATGTGACT TTGTTCAAGT TCAACACCAG 8 4 60 

TAGCAACGAC GGAATAAGTA CCGTCATTGT CTTTCTGGTA TCTTGTAATG TATTGCATTT 8520 

AAACCACGTC CTAGAACGGT AAATCATCAT CATTGATTTC TATTGGACCA T T AGCAT TAG 8 580 
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CGAATGGGTT TGATTGTTGA CTCATTGGCG TCTGTTTCCC ATTTGCTTGC TGTTCTTTTT 8 64 0 

GTTTCATCTC ATCAGTTTTA GGTTCTGGTT TAT T AAC T AC TTCATCGTCT TTATTCCAAA 8700 

CTTTTACATA TGAGAGTCTT ACAAAATACT TGCCTTGTTC CTCGTTAAAT TTATTTTTAA 8 7 60 

GTACAATAGT TCCGATTTTG TTAATTAATT GATCTGTGTC AAAAGTTAAA TCTGGTAAGT 8820 

TCAATTTAAT TCCTAATCTA CTAAGTAACT CGATATATTG TTTTTCTTGATAATCTTGTT 88 80 

GGAATGGTGG GACGAATTGG TTGTGTTTGT ATTGTTTACC TTCGTTGTTT TCAAAAACAA 8 94 0 

TCGTGAAGTA TCTGTTTTCT CTGTCGTTAA ACTCGACATT TGCAACTTTT ACTGTAAATT 9000 

CTCCAGCTCC TAAAAAGTCC CCACCTTTCA TGAATGCCTC TTGATTAGTT TCTTGAATGT 9060 

ATTGTGTTCT ACCAGTGATT TTCATAATTT TTATACCGTC CTTTTAATTA ATTTTTAATT 9120 

ACCATTTCTA ATTGCTTGTA CAACATCGTT AATACTTGGA TTAATGAAAC GTTTGTTGTT 9180 

AATTTTGATG TTGCTTGAGT GTCTTATCTT TGTCTCGAAT AAATTTGATG GTTCAGCGTT 92 4 0 

AAGTACATAT TGATAAGTTT TTTCGCCGTC T'ECTCATGT TCTTCTATTG TCATTCTTGC 9300 

TAACACGTCA GATTGACTGA TGACTGCTTT TTTTATTTGG TCTTGTGCCT CTATCGTGAT 9360 

TGTTGGATTG ATAGTACTTC CCTCATCATC TTTGTCTTTG TTAATGCCCT CGTGTCCGCT 9420 

TATAGCAAGA TGAAATTGAT AATGTTCTTG TAATTTAGAA ATATAACGAT AAATACTSC 94 80 

AATGCGTGTA GCACACTCGC CCCAATCATT AAATGTCGGT TTCTTTGATT TACCGTCCAT 954 0 

GATGTCGTCC ATAGTGATAT CACGTAACTT TTGGATTGTT TCAATCACTA CAACATCAAT 9600 

TTGTTTTCCG TTTTCTCTTA GTTGTTCAAT AATTTTAGGC AGCATTTTAA TCACTGCACT 9660 

AAAATGCTTA TAATTCTTAA TCTGCACAAC TGCCCCATCT TCTGTTACCG TTGTTCCGTC 972 0 

CTCATTTATA TCTAGTACTA AGGCATTGTT ATCTTTTGTT AAAAACGTAG TTTTACCAGT 97 80 

ACCGAACTTG CCGTATATCG CAAATTTATA AAACTTGTTT GCATTTTGTT TGCTGATGTC 9840 

TTTTACACCT AGTTGCGTTA AAATATCGAC ATCTTGATTAGTTTTTTCAG TCATCTATTC 9900 

TCCCACCTTT ACCGTGTATG ACGTTGGTTT CTCCACAATG CTAGCACCCT CTAAAACTTC 9 9 60 

GCCGTTTGCG TCAATCAATG TGCCGTTTTC AG T T AC AT T G AAATCTTTCT TAATGTCTGA 10020 

TTGGCTAAGT TTTTTAGTTA CTTTTACATA GTTGTCAAAA CCTCGTTGCT CAAGTTGTnT 1QC6 0 

AATGACTTCT TGCTCATTGC TAACTTGAAT GACTTTTGAA CCTTTTCTGG CTGTCACTTT 10140 

TCCGTAAGtG TATTCAACTT GAATTTGCTA TCTTGTTCTT TTTGTATTCT GTAATATTCA 10200 

ATTACAAGGC TTTGTAAATA TTCTTTGCCA CTCTGTAATT TTTCTACTTC TTTATCTTTC 10260 

CATTCGTTTA TGCGTTCAAT TTCTTTATTT GCTAAATCGT TGATTTCATT CTCTTTAGTT 10320 

GTGATTGCAT CCAGTTTCTn AAAAAC C C AG TTAGCACTGT CTAGATCAGT nACTTTGAAT 1038 0 

CGGTCGTCTT GTTCGAATGT n 10401 
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(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

TTTCTCTCTA TTATTCTCGA TGCGTAGATA ATTGTTTAAA TTAAGTTTA TAGTAATGTT 60 

GAGTTTATAA TTTCATATAT C T AAAAAC AG GTGTTGTATA TATAATCATT CATCTAGTTA 120 

TACTTACTTT AAAAATAATA TAATTTCATG CGATGCAATT CATTGATGGA TGTTTTTAAT 180 

CTTAATCAAA TCCAaATAAA GCATATATTT TTAAATTCAC TTTCTTTCGA ATCGATTTTT 24 0 

ATCTCTTGnA TTAAACTTTT CCATTGTTTC ATTAAAGCTC TCTGTCATAT CTATTCCCAT 300 

TGAATTCGCT AAACATAACA ACACAAATAA ATTATCACCT AATTCTGCTT TAATCGTATT 360 

TGCTTCCTCT GAATCTTTCT TCTTTTTTTC ACCATAGGTA TGATTTATTT CACGTGCAAG 4 20 

TTCGCCCACT TCTTCAGTCA ATCTA.GCTAA GTTAGCTAAT GGTGAAAAAT ATCCTGTTTT 4 80 

AAATTGTCCA ATATATTCAT CAACTTCACG TTGCATTTCT ACCATTGATT TCATTTCTAC 54 0 

GTTCTCCTTA TAT T GC AT T T CTAATATAGT ATATATCAAT TTGAAGTCTC ATGCATGTTT 600 

AGTGCAATCT ACTGACTCTA GC T AAAC TAG ATGACAAGTT GTTTATTTTAGAAGTTTAAT 660 

AATTCAGTTT ATATAAATGT AATGCATTCC TAACTAAATT AAATCAATTG AAATTGGGAT 720 

TATAACTTTA TGATACGTAC CACTACAATA AAATAATATA GTGAATAATC TACCATTAGA 7 80 

AAAATAAGCA CAAAAAAACT AGCAACCACA CAAAAATGTG ATTAGCTAGT TAATAAGTGT 840 

CTAATTTAAG TTAATTGTTA ATCTATAAGA TTAATCACTT GAACGCGCAA TCAAAATAAT 900 

ACGTACAAGC TCTGCTACAG CGACTGCAGT TGCTGCAACA TAAGTCATTG CTGCTGCAGA 9 60 

TAATACTTTA CGCGCATGCT TGTATTCTTT TTCATTTACA ATGTTCAATG CCGTAATTTG 1020 

TTTCATCGCT CTTGAACTCG CAT C AAAC T C mCTGGTAAC GTAACAATTG AGAATAATAC 108 0 

CGCTAATGAC AT T AAACC AG CACCAATCCA TAAAGCAGTT GAACCaAATG CACTACCTAT 1140 

CGCTGTTAAG ATAATACCTA ACATGATGAT CATATAACTT AATGAACTCC CTAGGTTTGC 12 00 

AACAGGTACT AATGCTGCTC TGAATCTTAA GAACCAATAT CCTTGGTGAT CTTGAAGGC 12 60 

ATGACCAACT TCGTGGGCTG CAATTGCAGT TCCAGCAACT GATGGTCTGT CATAGTTTGC 1320 

AGGAGATAGT GAAACAACTT TCTTTTTAGG ATCGTAATGA TCTGTTAAGA ATCCTTCACC 13 8 0 

TTTAACAACT TCGACATCAT AAATACCGTT TGCATGTAAA ATTTCTAATG CAACTTCACG 14 4 0 
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ACCCGTTTTA CCACTAGTTG ATCTAACTTG TGAATATTTC TCATAGTTAG ATTTAACTTT 1500 

GTGTTGTGCC CATAAAGGAA GCACCATTAA T AT T AC G AAA TAAATTATCA TAGTAAAAAT 15 60 

TGAAGACAAT AAACTCACTC TCCTTTATAA ATATTTTACT GTCATTTGCC GTTTTTATCA 1620 

AAT CAT T T AC ACTTTAATAA TTTGTTTAAT TCAATATAA GCAAAAGTCC AAAAACACTT 168 0 

AGACAACATG ATAATACACC AATTTGCCAC ACATGTGTAG TTATAAAATC ATAATATGGA 1740 

AATTGAAGGT GAAAATAGTC AATATAATCA TTCAAAAACA CCCAAATCAT yGCTACACTG 1800 

ATTCCAATCA TAGAACGTTT AAACCTAGGA T AG AAG T AAA TTGCCTGAAC AGCCATTATA 18 60 

CTGTGGGAAA ACATTAATAC CAAACCATTT ACTGTAATAT CACCTTGTTC AATAATAAAT 1920 

AATATATTCA TTATAACTGC CCAAATCCCA TATTTGAATA ATGTTACAAA TGCCAGTGCA 198 0 

TCGATAATAC TATTTTGTTT TTGAATTAAT AT C AAT GAGA TAGAAATAAC T AAG TAT AAT 2040 

ATTGCAGTTG GGCTATCTGG AACAAAAATC TTAAAATGCC AGGGCGTATG ACTTAATTGT 2100 

TCACCATACC ATATATAACC ATAAATCATC CCTAATATAT TACAAATGAG TAGCATCATT 2160 

AACCAAGAAC GTTGATAAAG TGTATATTGC CAAAATGCTT TAATTGTCAT CTGCTAAGTC 2220 

CTCAAATTGA TTATGTTTAT TTACTAGCTT GAGTGTATTT AAAATTGCG TTAGTTGATA 2280 

AAAACGTTGC TTTTCATTCA TCTGTAAACT TAAATCAATA TTGTGTAACA AGTAATCTAT 234 0 

TAATAACGCA TGTTTATGCC GAT C T AT AG C CATACTATTT AAGTCAT GAA GATAAGTTTG 24 00 

ATAACTGGGC GATTCCGTTA GTTGAGCAAC TATTAAATCA TCTAGCCTTT GCTCACGCTT 24 60 

TGACACGTTT GCGAAGTGAA TTTGAATATC AAAAGCACAG TTATGATTAG CGATATAATC 2520 

AAATATTTCA TTTGTATTCA TTAACTTTAT ATTACGCTTA GTAAATTGAA TTGCAGAAGC 2580 

GTGACTTCCC ACTTCTGCAA TTTCTAATGT T T CAT GAT GA TTAATTTTTG TATCTACAAA 2 64 0 

ATGAATGTTT GCCAATTTCG CCTCATTG^C TTTTATATAG TTAAGCACCC AAACTGCAAT 27 00 

ACGCGACTTA AATCGATATT GAAAAAGTAA ATATTCAATA AAACTTTCTT TAATTTGATT 2760 

GAGTGTCTCT G AC AT C AAAT ACCCCATTTT AAGATTGCAA TCTTGaTAAT TCGTCATGCC 2820 

AATTTTCGTT ACTTGGcTCT AGTTCCAACA ATT GAT T T AA AATAGTAATT GCTGTTCCT 28 80 

TTTGACCAAT TTCAATTAAA TAGAAATAAT AATCACTCAT AAAAT C AAT A TTTGTTTTCA 2940 

TCGTTGGATA TGCTAATTCA AAGAAATGTT GAGCTTCTTT ATCTCGCTC 2 98 9 
(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: > 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

CATCAACTCC TTAATTACAC TGTAAATGAT ATGCGTCTTT TTGACAACTA TATTTGTCAA 60 

ATCTACACCA AAAAAT AT G A TTATCCACCT ATGTATGACA TTTTGAAACA AACACCTCAA 120 

CGCCTACAAG TCATAATTGT TTACTTTCGT TACACCTTCC TGCATAATTA ACAGCATTCT 180 

AATTTTAGTA TGATGCACGC ATTTTCACTA AATCAAACCA TTCAAAGGAG ACTATTATGG 240 

CATTTACATT ATCTGCAATT CAACAAGCAC ATCAACAATT TACTGGTGTT GACTTTCCAA 300 

AACTATTCAA AGCTTTTAAA GATATGGGGA TGACTTACAA TATCGTCAAC ATTCAAGATG 3 60 

GCACTGCAAC ATACGTACAT CAATCAGAAG ATGATATCGT TACGTCATCT GTAAAAAGTA 4 20 

ATCATCCTGT TGCTCAAAAA" TCAAACAAAA CAATAGTTCA AGACGTCTTA ACTAGSATC 48 0 

AACAAGGGCA AACAGATTTT GAAACATTTT GTGATGAAAT GGCTGAAGCT GGCATTTATA 540 

AATGGCATAT CGATATTCmA GCGGGCACTT GTACTTATAT CGACTTGCAA GACCAAGCTG 600 

TTATTTCAGA ATTAATCCCT CAATAAACTA TATTTATAGC AACATTTTAA TTATTTCATA 6 60 

AAATTTTATT GATAATCATT ATCGTTCGGT ATAAAGTAAA TACTATATAC TACTTATGAG 7 20 

TGAGGTTGAT TATCATGATA ACTAACACTT TTATTTTAGG CATCACAGGC CCAACAAGTC 7 80 

TTGTCGTCAT TAGCATTATC GCTTTAATTA TTTTTGGTCC GAAAAAATTA CCACAATTTG 84 0 

GCCGTGCCAT CGGTTCTACT TTAAAAGAAT TTAAATCEC AACAGAAGAT TTAGATAAAG 900 

AGTCTCACGA TACACCCAGT AAGGAATCGA AACAACAGCG AGAGCAATAG CACTGACCAC 9 60 

ACCTTACTGG TTCACTTTAG CGAACTACGC CATCGGTTAG TAAAAATTTT ATTGTCGTTC 1020 

GTCATTACGG TCATCGTCGT ATATGTyTCA TCATTTTGGT GGATGACACC ATTCATAACG 108 0 

TAT AT y AC C C GgCACATGTG TcCTTACATG CATTTcATTC AC AG AAAT GA TACAAATAAC 114 0 

GTG 1143 



(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

CAACGCCTGA ACGTAAACCA TATCGTTTCG CGATTTCCTC ATCTTGACTA TTTACTAAAA 60 

ACTCTCTCAT GGCGATTAAT GTTTCTTTTT CTTCTTTAGT TAATGGTAAT TCTAACTCAG 120 

CTGCTTTTTG ACGCAAAGTT GGATGACCAT CTCTAATGAT GTCTTTCATT GTTAACATAT 180 
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ATTGCACCTT CCTTATTTTA ATTTGTTTTA GTTGAATGAC AGTAAAAAGG TTGTTAAGAT 24 0 

ACTCATACAT TTTTATGTGT AAATATCTAC AAAGTTAACC AACTACTGCC AATGTTTATT 300 

TTAGATAGTA TATGTAAATT TTCAaGAtAT GCgTAATTGC gTTAAAAAAT GaTTAAAGTG 360 

TTGGTTTCAA GCAATGaTAC TTTAGAAATT TATTTATCAT CTTGACTTTA AAAATTATAT 420 

TATAAATGAC GTAACTGTCA ACAGATATAC TTAGTArTGA AG AT G T G T AA TGTAATTGTT 810 

TAAAATTGAT TTCCAAGCAG ATTTTATTTA TCATTTAATT TAAATAGCAA GTGGAGGTAC 54 0 

AAGTAATGAA ATTTGGAAAA ACAATCGCAG TAGTATTAGC ATCTAGTGTC TTGCTTGCAG 600 

GAT G TACT AC GGATAAAAAA GAAATTAAGG CATATTTAAA GCAAGTGGAT AAAATTAAAG 660 

ATGATGAAGA ACCAATTAAA ACTGTTGGTA AGAAAATTGC TGAATTAGAT GAGAAAAAGA 720 

AAAAATTAAC TGAAGATGTC AATAGTAAAG ATACAGCAGT TCGCGGTAAA GCAGTAAAGG 780 

ATTTAATTAA AAATGCCGAT GATCGTCTAA AGGAATTTGA AAAAGAAGAA GACGCAATTA 840 

AGAAGTCTGA ACAAGACTTT AAGAAAGCAA AAAGTCACGT TGATAACffiT GATAATGATG 900 

TTAAACGTAA AGAAGTAAAA CAATTAGATG ATGTATTAAA AGAAAAATAT AAG T T AC AC A 960 

GTGATTACGC G AAAG C AT a T AAAAAGGCTG TAAACTCAGA G AAAAC AT T A TTTAAATATT 102 0 

TAAATCAAAA TGACGCGACA CAACAAGGTG TTAACGAAAA ATCAwAAGCA ATAGAACAGA 108 0 

ACTATAAAAA GTTAAAAGAA GTATCAGATA AGTATACAAA AGTACTAAAT AAGGTTGGTA 114 0 

AAGAAAAGCA AGACGTTGAT CAATTTAAAT AATTAATATA AT ACAGAT GG TAGGAAACAA 1200 

CTAATACAGT TCCTATTATC TGTATCTTTT TTTATTAAAA CAGAACTTTT TCAAATGGTT 12 60 

TAACAGTCCC ATTTATTTGT GGTACAATTA GTAAGGATAA AATGAATTTC TATACAATTA 1320 

TGGGAAAGGT ATTGTGAATT GAATGGCTCC TAAGTTACAA GCCCAATTCG ATGCAGTAAA 138 0 

AGTTTTAAAT GATACTCAAT CGAAATTTGA AATGGTTCAA ATTTTGGATG AGAATGGTAA 14 4 0 

CGTCGTAAAT GAAGACTTAG TACCTGATCT TACGGATGAA CAATTAGTGG AATTSTGGA 1500 

AAGAATGGTA TGGACTCGTA TCCTTGATCA ACGTTCTATC TCATTAAACA GACAAGGACG 15 60 

TTTAGGTTTC TAT G C AC C AA CTGCTGGTCA AG AAG CAT C A CAATTAGCGT CACAATACGC 162 0 

TTTAGAAAAA G AAGAT T AC A TTTTACCGGG ATACAGAGAT GTTCCTCAAA TTATTTGGCA 168 0 

TGGTTTACCA TTAACTGAAG CTTTCTTATT CTCAAGAGGT CACTTCAAAG GAAATCAATT 17 4 0 

CCCTGAAGGC GTTAATGCAT TAAGCCCACA AATTATTATC GGTGCACAAT ACATTCAAGC 1800 

TGCTGGTGTT GCATTTGCAC TTAAAAAACG TGGTAAAAAT GCAGTTGCAA TCACTTACAC 18 60 

TGGTGACGGT GGTTCTTCAC AAGGTGATTT CTACGASGT ATTAACTTTG CAGCAGCTTA 1920 

TAAAGCACCT GCAATTTTCG TTATTCAAAA CAATAACTAT GCAATTTCAA CACCAAGAAG 198 0 

CAAGCAAACT GCTGCTGAAA CATTAGCTCA AAAAGCAATT GCTGTAGGTA TTCCTGGTAT 204 0 
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CCAAGTTGAT GGTATGGATG CGTTAgcTGT nATATCAAGC AACTAAAGAA GCACGTGACC 2100 

GCGCAgTTGC AGGTGAAGGT CCAACATTAA TTGAAACTAT GACATATCGT TATGGTCCTC 2160 

ATACAATGGC TGGTGACGAT CCAACTCGTT ACAGAACTTC AG AC G AAG AT GCTGAATGGG 2 22 0 

AGAAAAAAGA CCCATTAGTA CGTTTCCGTA AATTCCTTGA AAACAAAGGT TTATGGAATG 228 0 

AAGACAAAGA AAATGAAGTT ATTGAACGTG CAAAAGCTGA TAT T AAAGC A GCAATTAAAG 234 0 

AGGCTGATAA CACTGAAAAA CAAACTGTTA CTTCTCTAAT GGAAATTATG TATGAAGATA 24 00 

TGCCTCAAAA CTTAGCAGAA CAATATGAAA TTTACAAAGA GAAGGAGTCG AAGTAAGCCA 24 60 

TGGCACAAAT GACAATGGTT CAAGCGATTA ATGATGCGCT TAAACTGAA CTTAAAAATG 2520 

ACCAAGATGT TTTAATTTTT GGTGAAGACG TTGGTGTTAA CGGCGGTGTT TTCCGTGTTA 2 58 0 

CTGAAGGACT ACAAAAAGAA TTTGGTGAAG ATAGAGTATT CGATACACCT TTAGCTGAAT 2 64 0 

CAGGTATTGG TGGTTTAGCG ATGGGTCTTG CAGTTGAAGG ATTCCGTCCG GTTATGGAAG 2700 

TACAATTCTT AGGTTTCGTA T T C G AAG TAT TTGATGCGAT TGCTGGACAA ATTGCACGTA 2760 

CTCGTTTCCG TTCAGGCGGT ACTAAAACTG CACCTGTAAC AATTCGTAGC CCATTTGGTG 2820 

GTGGCGTACA CACACCAGAA TTACACGCAG ATAACTTAGA AGGTATTTTA GCTCAATCTC 2 88 0 

CAGGTCTAAA GGTTGTTATT CCTTCA5GCC CATACGATGC GAAAGGTTTA TTAATTTCTT 2 94 0 

CT ATT AG AAG TAATGACCCA GTCGTATACT TAGAGCATAT GAAATTGTAT CGTTCATTCC 3 00 0 

GTGAAGAAGT AC C T G AAG AA GAATATACAA TTGACATTGG TAAGGCTAAT GTGAAAAAAG 30 60 

AAGGTAATGA CATTTCAATC ATCACATACG GTGCAATGGT TCAAGAATCA 21GAAAGCTG 312 0 

CAGAAGAACT TGAAAAAGAT GGTTATTCTG TTGAAGTAAT TGACTTACGT ACTGTTCAAC 318 0 

CAATCGATGT TGACACAATT GTAGCTTCAG TTGAAAAAAC TGGTCGTGCA GTTGTAGTTC 324 0 

AAGAAGCACA ACGTCAAGCT GGTGTTGGTG CAGCAGTTGT AGCTGAATTA AGTGAACGTG 3300 

CAATCCTTTC ATTAGAAGCA CCTATTGGAA GAGTTGCAGC AG C AG AT AC A ATTTATCCAT 3360 

TCACTCAAGC TGAAAATGTT TGGTTACCAA AC AAAAAT G A CATCATCGAA AAAGCAAAAG 342 0 

AAACTTTAGA ATTTTAATAC ATTTTAAAAG TTAACGAAGT TAGCGTATTT TAGTCTCATT 34 8 0 

GATTAAAATG AAATGTTTAA TTTACGAAAT CTTi.GGAGGG CAAAAACGTG GCATTTGAAT 354 0 

TTAGATTACC CGATATCGGG GAAGGTATCC ACGAAGGTGA AATTGTAAAA TGGTTTGTTA 3 600 

AAGCTGGAGA TACTATTGAA G AAG AC GAT G TTTTAGCTGA GGTACAAAAC GATAAATCAG 3 660 

TAG T AG AAAT CCCATCACCA GCATCTGGTA C T G TAG AAG A AGTTATGGTA GAAGAAGOT 37 2 0 

CAGTAGCTGT AGTTGGTGAC GTTATTGTTA AAATCGATGC ACCTGATGCA GAAGATATGC 37 8 0 

AATTTAAAGG TCATGATGAT GATTCATCAT C T AAAGAAG A ACCTGCGAAA GAGGAAGCGC 384 0 
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CAgcAGaGCA AGCACCTGTA GCTACTCAAA CTGAAGAAGT AGATGAAAAC AGAACTGTTA 3900 

AAGCAATGCC TTCAGTACGT AAATACGCAC GTGAAAAAGG TGTTAACATT AAAGCAGTTT 3 9 60 

CTGGATCTGG TAAAAATGGT CGTATTACAA AAGAAGATGT AGATGCATAC TTAAATGGTG 4 020 

GTGCACCAAC AGCTTCAAAT GAATCAGCTG CTTCAGCTAC AAGTGAAGAA GTTGCTGAAA 4 080 

CTCCTGCAGC ACCTGCAGCA GTAACATTAG AAGGCGACTT CCCAGAAACA ACTGAAAAAA 414 0 

TCCCTGCTAT GCGTAGAGCA ATTGCGAAAG CAATGGTTAA CTCTAAGCAT ACTGCACCTC 4 200 

ATGTAACATT AATGGATGAA ATTGATGTTC AAGCATTATG GGATCACCGT AAGAAATTTA 4 2 60 

AAGAAATCGC AGCTGAACAA GGTACTAAGT TAACATTCTT ACCTTATGTT GTTAAAGCAC 4230 

TTGTTTCTGC ATTGAAAAAA TACCCAGCAC TTAACACTTC ATTCAATGAA GAAGCTGGTG 4 380 

AAATCGTTCA TAAACATTAC TGGAATATCG GTATTGCAGC AGACACTGAT AGAGGATTAT 4 44 0 

TAGTACCTGT TGTTAAACAT GCTGATCGTA AGTCTATTTT CCAAATTTCA GATGAAATTA 4 500 

ATGAATTAGC TGTTAAAGCA CGFGATGGTA AATTAACAGC CGATGAAATG AAAGGTGCTA 4 5 60 

CATGCACAAT CAGTAATATC GGTTCAGCTG GTGGACAATG GTTCACTCCA GTTATCAATC 4 620 

ACCCAGAAGT AGCAATCTTA GGAATTGGCC GTATTGCTCA AAAACCTATC GTTAAAGATG 4 68 0 

GAGAAATTGT TGCAGCACCA GTATTAGCAT TAT CAT T AAG CTTTGACEC AGACAAATTG 4740 

ATGGTGCAAC TGGCCAAAAT GCAATGAATC AC AT T AAACG TTTATTAAAT AATCCAGAAT 4800 

TATTATTAAT GGAGGGGTAA AACATGGTAG TTGGAGATTT CCCAATTGAA ACAGATACTA 4 8 60 

TAGTAATCGG AGCAGGTCCT GGTGGATACG TTGCAGCAAT TCGTGCAGCT CAATTAGGAC 4 92 0 

AAAAAGTAAC AATCGTTGAG AAAGGTAATC TTGGTGGTGT TTGCTTAAAC GTAGGATGTA 4 98 0 

TTCCTTCAAA AGCATTACTA CATGCTTCTC ACCGTTTTGT TGAAGCACAA CATTCTGAAA 5040 

ACTTAGGTGT TATTGCTGAA AGTGTTTCTT TAAACTTCCA AAAAGTTCAA GAATTCAAAT 5100 

CATCAGTTGT TAATAAATTA ACTGGTGGTG TTGAAAGCTT ACTTAAAGGT AACAAAGTTA 5160 

ACATCGTTAA AGGTGAAGCA TATTTCGTAG ATAACAATAG CTTACGTGTT AT GG AC G AAA 5220 

AGAGCGCACA AACATACAAC TTTAAAAATG CAATCATTGC AACAGGTTCA AGACCAATTG 5280 

AAATTCCTAA TTTCAAATTC GGTAAACGTG TTATCGACTC AACAGGTGCT TTAAfiTTAC 534 0 

AAGAAGTACC aGGTAAATTA GTTGTAGTTG GTGGAGGATA CATTGGATCA GAATTAGGTA 54 00 

CAGCATTTGC TAACTTTGGT T C AG AAG T AA CCATCCTTGA AGGTGCTAAA GATATCTTAG 54 60 

GTGGCTTCGA AAAACAAATG ACACAACCTG TTAAAAAAGG TAT GAAAGAA AAAGGTGTTG 5520 

AAATCGTTAC TGAAGCTATG GCTAAATCAG CTGAAGAAAC AGATAACGGA GTTAAAGTTA 5580 

CTTATGAAGC TAAAGGCGAA GAGAAAACAA TCGAAGCTGA TTATGTATTA GTAACTGTAG 5 64 0 

GTCGTCGTCC AAACACAGAC GAATTAGGCC TAGAAGAATT AGGTGTTAAA TTCGCTGACC 57 00 
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GTGGATTATT AGAAGTTGAT AAACAAAGCC GTACGTCTAT CAGCAATATC TATGCAATTG 57 60 

GTGATATCGT TCCAGGTTTA CCACTTGCTC ACAAAGCTAG CTATGAAGCT AAAGTTGCTG 5820 

CTGAAGCAAT TGATGGTCAA GCTGCTGAAG TTGATTACAT TGGTATGCCA GCAGTATGCT 5880 

TTACTGAACC AGAATTAGCT ACAGTTGGTT ATTCAGAAGC GCAAGCTAAA GAAGAAGGTT 5 94 0 

TAGCAATTAA AGCTTCTAAA TTCCCATATG CAGCAAATGG TCGTGCATTA T CAT T AGAT G 600 0 

ATACTAACGG ATTTGTTAAA CTTATTACAC TTAAAGAAGA TGATACTTTA ATCGGTGCTC 6060 

AAGTAGTTGG TACTGGTGCA TCAGATATTA TCTCTGAATT AGGTTTAGCA ATTGAAGCTG 612 0 

GTATGAATGC TGAAGATA1C GCATTAACAA TCCATGCACA TCCAACATTA GGTGAGATGA 618 0 

CTATGGAAGC AGCAGAAAAA GCTATCGGAT ACCCAATCCA TACAATGTAA TAACTGATTA 62 4 0 

TCTATAAAGA TTCAGTCATT AAAAGCTGTA GCATATGCTA CGGCTTTTTT GTTTTAGGTA 6300 

AAGTAATGTA AGGAAATTGA TTTGAGATAT CGTTAACATG TGASTGCAT GT TAT ACT AG 63 60 

CGATGCTAAT AAAAGAATTG AAATGGAGGG TTCAACAATG GAATATGAGT ATCCAATTGA 6420 

TTTAGACTGG AGTAATGAAG AGATGATTTC AGTGATAAAT TTCTTTAATC ATGTAGAGAA 64 80 

GTATTATGAA TCCGGCGTGA CGGCAGGCGA CTTTATGGGT GCATATAAAA GATTTAAAGA 654 0 

AATTGTGCCT GCTAAAGCAG AGGAAAAACA AATTTTTAAT ACTTTCGAAA AAAGTAGTGG 6600 

CTATAATAGT TACAAAGCAG TTCAAGATGT AAAAAC T C AC TCTGAAGAAC AAAGAGTAAC 6660 

AGCTAAAnAA TAATTCGTTC GAAATTAACA CAATTTAATA GGAATTTTTC TTTAAAACTA 6720 

TTGCTAATAA AGCTATATTT TGATACETTT ATCAAGTGTT AAACAAAATG TTTGATAAAA 67 8 0 

GTAAACTTAA TATAGCTTTT TTAGGTGGAA AAATAAATGA ACATAGGTAA TAAAATTAAA 68 4 0 

AATCTTAGAA GAATTAAAAA TTTAACGCAA GAAGAACTTG CTGAACGTAC AGACTTATCG 6 900 

AAAGGC TACA TTTCACAAAT AGAAAGTGAA CATGCCTCAC CAAGTATGGA ACTTTCTTA 69 60 

AAT AT T AT AG AGGTGTTAGG AACGACGCCA AGTGAATTTT TTAAAGACAG TGAAAATGAA 7 02 0 

AAAGTAT TAT ACAAGAAGGA AGAACAAGTT ATTTATGATG AGTATGATGA AGGTTATATA 7 08 0 

TTAAATTGGT TAGTTTCAAA GTCAAATGAA TATGATATGG AGCCATTAAT ATTAACTTTA 714 0 

AAGCCTGGAG CATCATATAA AAATTTTAAT CCATCAGAGT CTGATACGTT TATTTATTGT 7 200 

ATGTCAGGTC AGATAACACT TAATTTAGGC AAAGAGATAT ATCAAGCACA AGAAGAAGAC 7 2 60 

GTTTTGTATT TTAAAGCACG AGATAATCAT CGTTTGTCAA AC G AAT C AAA CAATGAAACA 7 320 

CGAATACTTA TTGTAGCGAC AGCTTCATAT TTA'AGGGGG GATCTTATTT GGAACCGTTA 7 380 

TTATCATTAA AAT C AG T TAG TAAAAGCTAT GATGATCTTA ATATCTTAGA T G AC AT AG AT 74 40 

ATTGATATTG AATCAGGATA CTTTTATACA TTATTAGGTC CTTCAGGTTG TGGTAAAACA 7 500 
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ACAATTTTAA AATTAATTGC AGGGTTTGAA TATCCTGACA GTGGTGAAGT GATTTATC& 



7560 



AACAAACCAA TTGGTAATTT AC C AC C AAAT AAACGTAAAG TGAATACAGT CTTTCAAGAT 



7620 



TATGCATTAT TTCCACACTT AAACGTCTAT GATAATATCG CTTTTGGTTT GAAATTAAAA 



7680 



AAAT TAT C AA AAACCGAAAT TGATCAAAAA GTAACTGAGG CATTAAAATT AGTAAAACTT 



7740 



TCAGGTTATG AAAAAAGAAA TATTAATGAA ATGAGTGGCG GACAAAAGCA ACGTGTTGCA 



7800 



ATTGCACGTG CTATCGTAAA TGAACCAGAA ATATTATTGT TAGATGAATC TTTATCCGCA 



7860 



TTAGATTTGA AATTGCGTAC TGAAATGCAA TATGAATTAC GAGAATTGCa AT C TAG ATT A 



7920 



GGtATTACAT TTATATTTGT aACACATGAT CCA 



7953 



(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

GGCGTGATCA TACGACCGTC ATTCATGCTC ATGAAAAAAT ATCTAAAGAT TTAAAAGAAG 60 

ATCCTATTTT TAAACAAGAA GTAGAGAATC TTGAAAAAGA AATAAGAAAT GTATAAGTAG 12 0 

GAAACTTTGG GAAATGTAAT CTGTTATATA ACAGCACTAA TGATnACAAT CATTTTTTAC 180 

ATTTCTATAT GCTAATGTGG CAAGATGAGC AAAACTCATT TTGTGGATaA TGTTTaAAAG 24 0 

T CAT AC AC AC CATACACAAG TTATCAACAT GTGTATAAyT cGcCAAATCT ATGTTTTTAA 300 

GACTTATCCA CCAATCCACA GCACCTACTA CTATTACTAA GAACTTAAAA CCTATATAAT 360 

TAT AT AT AAA CGACTGGAAG GAGTTTTAAT TAATGATGGA ATTcACTATT AAAAG AG AT T 4 20 

ATTTTATTAC ACAATTaAAT G AC AC AT T AA AAGCTATTTC ACCAAGaACA ACATTACCTA 4 80 

TATTAACTGG TATCAAAATC GATGCGAAAG AACATGAAGT TATATTaACT GGTTCAGACT 54 0 

CTGAAATTTC AAT AGAAAT C ACTATTCCTA AAACTGTAGA TGGGAAGAT ATTGTCAATA 600 

TTTCAGAAAC AGGCTCAGTA GTACTTCCTG GACGATTCTT TGTTGATATT ATAAAAAAAT 660 

TACCTGGTAA AGATGTTAAA TTATCTACAA ATGAACAATT CCAGACAT T A AT T AC AT C AG 7 20 

GTCATTCTGA ATTTAATTTA AGTGGCTTAG AT C C AG AT C A ATATCCTTTA TTACCTCAAG 7 80 

TTTCTAGAGA TGACGCAATT CAATTGTCGG TAAAAGTGCT TAAAAACGTG ATTGCACAAA 840 

CAAATTTTGC AGTGTCCAcC TCAGAAACAC GCCCAGTACT AACTGGTGTG AACTGGCTTA 900 

TACAAGAAAA TGAATTAATA TGCACAGCGA CTGACTCACA CCGCTTGGCT GTAAGAAAGT 960 

TGCAGTTAGA AGATGTTTCT GAAAJCAAAA ATGTCATCAT TCCAGGTAAG GCTTTAGCTG 10 2 0 
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AATTAAATAA AATTATGTCT GACAATGAAG AAGACATTGA TATCTTCTTT GCTTCAAACC 1080 

AAGTTTTATT TAAAGTTGGA AATGTGAACT TTATTTCTCG ATTATTAGAA GGACATTATC 114 0 

CTGATACAAC ACGTTTATTC CCTGAAAACT ATGAAATTAA AT T AAGT ATAGACAAT GGGG 120 0 

AGTTTTATCA TGCGATTGAT CGTGCCTCTT TATTAGCGCG TGAAGGTGGT AATAACGTTA 12 60 

TTAAATTAAG TACAGGTGAT GACGTTGTTG AATTGTCTTC TACATCACCA GAAATTGGTA 132 0 

CTGTAAAAGA AGAAGTTGAT GCAAACGATG TTGAAGGTGG TAGCCTGAAA ATTTCATTCA 1380 

ACTCTAAATA TATGATGGAT GCTTTAAAAG CAATCGATAA TGATGAGGTT GAAGTTGAAT 14 40 

TCTTCGGTAC AATGAAACCA TTTATTCTAA AACCAAAAGG TGACGACTCG GTAACGCAAT 1500 

TAATTTTACC AATCAGAACT T AC T AAAAAT AAATATAAAT AAAGGATGAC GTGATTAATT 15 60 

AAAAC G T CAT CCTTTATTTT TTGGCAAAAA TMTTCTAGG TGCGTATGTA AAATAAATTT 162 0 

GGCAGCATTT TAAACAGCAA ATAAAAGACG CCAATTAAAT T TAT G AC AAA TGTATCCAAA 1680 

ATTTAATAAG TGTGCTTATA TGCCCTTTAA ATTTAAAATT TTAATAGTCA AT AACAAGT T 17 4 0 

GAATATAAAA GTTAAACGCC GTTAAATAGC GTTAAAAAAT T G AAAAT G AC AGTATTGEA 18 00 

AAAAAT AAGA ATTAATTATT TATATGTAAA CGGTTTCTAC CTCTATTTTA AATGAAATTT 18 60 

G T G AC AAAAA AAGGTATAAT ATATTAATGA CATACAAAGA AATGGAGTGA TTATTTTGGT 192 0 

TCAAGAAGTT GTAGTAGAAG GAG AC AT T AA TTTAGGTCAA TTTCTAAAAA CAGAAGGGAT 1980 

TATTGAATCT GGTGGTCAAG CAAAATGGTT CTTGCAAGAC GTTGAAGTAT TAATTAATGG 204 0 

AGTGCGTGAA ACACGTCGCG GTAAAAAGTT AG AAC AT C AA GATCGTATAG ATATCCCAGA 2100 

ATTACCTGAA GATGCTGGTT CTTTCTTAAT CATTCATCAA GGTGAACAAT GAAGTTAAAT 2160 

ACACTCCAAT TAGAAAATTA TCGTAACTAT GATGAGGTTACGTTGAAATG TCATCCTGAC 222 0 

GTGAATATCC TCATTGGAGA AAATGCACAA GGGAAAGACA AATTTACTTG GAATCAATTT 22 80 

ATACCTTAGC TTTAGCAAAA AG T C AT AG AA CGAGTAATGG ATAAGGGACT CCATACCGTT 234 0 

TTAATGC 324 7 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO: 154: 

ACAAGACGTn TCTATAACTT ATCTGAAATC GCTCGTCAAG ATAAAGATTA TGCAACTATC 60 
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TCATTCTTAA ACTGGTTCTT AGATGAACAA GTCGAAGAAG AATCAATGTT TGAAACTCAC 120 

ATCAATTATT TAACTCGTAT CGGCGATGAC AGCAATGCAT TATATCTTTA CGAAAAAGAA 180 

CTTGGCGCTC GTACATTCGA CGAAGAATAA TTAAACATCA C T AC AAT AG A CAGATAAATA 24 0 

TCATACGACA TGATAGGCAT TTGGGTCACT TACAATAACC CAATGTCTAT ATTATTTTGC 300 

TTTACGGAGA TCACTAGATT CATTTTCTGA ATCATTGATC TGCGTTTTTT CATTTTCAAG 3 60 

GCTAATTATT GTATTTTTAG TCATTTATTT TTTAAACTAC 2LATGTTAAT AACTCTAAAT 4 20 

TTGATGTTGA ATTAATTTGA CGATTTTAAA GCATATCATC ATTTACTTTT TAATCAGAGT 4 80 

TACATCCAAA TGATAGATTT CACGTTATAC CTTCACGTAT AATATTATGT ATCGTTTGTA 54 0 

AG C AAAT G AC TAAAAGTCTA TTAATATATA CATTTAATTA ATTGAAAGGA TTGACTACAT SD 

GATACAAGAT GCGTTTGTTG CACTTGATTT TGAAACAGCA AATGGTAAAC GTACAAGTAT 660 

TTGTTCTGTC GGAATGGTTA AAGTCATTGA TAGTCAAATA ACAGAAACAT TTCATACTCT 720 

TGTGAATCCG CAAGACTATT TTTCACAACA AAATATTAAA ATTCATGGCA TACAACCAGA 7 80 

AGATGTTGAA AATGCACCTA CGTTTGACTA CGTATTTCCA TATATGATGC AATTTATTGc 84 0 

aGATTTACCT GTTGTCGCAC ATAACGCGGC ATTTGATATG AACGTCTTAC ATCAAAGCAT 900 

TCAAAATATT GGTTTACCAA CTCCAAATTT AACTTACTTT TGTAGTTATC AACTTGCTAA 9 60 

AAGAACCGTT GATTCGTATC GATACGGTTT AAAACATATG ATGGAGTTT ATCAATTAGA 102 0 

TTTTCATGGT CATCATGATG CAT T G AAT GA TGCCAAAGCA TGCGCAATGA TTACTTTTAG 1080 

GCTACTGAAA AAT TAT GAAA ATTTAACATA TGTAACTAAT ATTTATGGTA AAAATCTAAA 114 0 

AGATAAAGGC TAGGACTAAA TAAAATACTC CCTTCAAAAG TAAGCATTGT AAAAATGTAA 1200 

ACTTTGCAGG GAGCTTTATT TTATATAAAG TCATATATCG TCATATTTTT ATAAGTTGAT 12 60 

TGTTCTAAAT TACCTACAGT GAC AC C AAT A AGTCGAATTG GTACATCAGG GTCTTTTAAA 132 0 

TCGTTATAAA GTAAATATGC AAT AT T AT AA ATATCTTCTT CAGAACTAAC CGAATCTCTT 138 0 

AAACTCATCT GTTTAGATAG CGTTTCAAAT TGATAAGTTT TAATTTTAAC CGTTACAGTT 1440 

TTAGCTGACT TCTGTAATTT ATTTAGACGT TCAGCTGTTT TACCTGnACA AT T C C CAT AC 15 00 

TTTTCTTAAA ATCTCTTCAT CATCATTCAC GTCTGTTGCA AATGTGCGTT CAGTCCCTAC 15 60 

TGATTTTCTT ACTCTTGATG ATTTCACTTC ACTATGGTCA ATACCGCGTG CCTTGTATA 162 0 

TAAACCCCGA CCTCTTTTTC CAAACAAACG TATTAATTCA AATTCCGTTT TCTCATATAA 1680 

ATCTCTACCG T T AAAAAT AC CATTATCATG CATTACTTTT TTGGAAGCTT TACCTACGCC 17 4 0 

TGGaAAATCT CCAATATCCA ATGTCATCAA AATATCATGG aCATTTTGAT AATCAATCAC 18 00 

AGTCATACCA TCAGGTTTAT TCATACCACT CGCTAATTTA GCTAAAAATT TGTTATAAGA 18 60 

AACACCTGCA GATGCTGTTA AATGTGTCTG CTCTAGAATA TCTTTTCTAA TATACTGAGC 192 0 
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AATTTTCGAA GCAGGAAGGT CTGGTCTCAC TAATTCTGTA ATATCTAAAT ACGCTTCATC 198 0 

CAATGACATC GGTTCTACCT TATCTGTATA ACTTCGGAA ATAGACATAA TCTGCGCAGA 2040 

TGTTTCTCGG TAAGCACCAA AATTACTTGT GACAAAGTAT CCATTTGGAC ATAATTTATG 2100 

CGCTTGTGAC ATAGGCATTG CTGAATGGAC GCCGTATTTT CGTGCTTCAT AGGATGCCGT 2160 

AGAGACAACA CCCCTACTGC TTGCTTTACC ACCAACAATG ACTGGTTTCC CTTTCAATTT 2220 

GGGGTTATCT CTCATTTCGA CTTGTGCAAA AAAATAGTCC ATATCTATAT GAATAATTCG 2280 

TCTCTCAGTC AAGTGCTCAC CTCCCTACTA ATTTTTACTT TTATAACGCA CAAAAATATC 2340 

TCAACATAAT TATACGCTGT GTACGATTTT TTTACATAAA TCTTGCACTT AGCGATAACT 2 4 00 

ATATTGaGAT AACTACAAGT TGTTATaAAA TCAATTGCTA TTTAAGCATG AT GAT G AAGA 24 60 

CGATTGAGTA AGAAAACATA GGTAATCTGA AATAATTCAA GCAAATTCAT TTTGTTGGTA 2520 

TCATCATATT AAAATTTATT ATTGAGTCGG CTTTTGATGA TACAAATAAA TACTATCTTC 2 58 0 

ATCTTGTTGA AC AT AT T T AA AACTACTAGA TATTGTTAAT GcAAfiAGCA GAATATCTAT 2 64 0 

AAAGCAATAA GCGGTATGCA TACTAAACAT AAAAATAAGT GATGAATAAC CAAATACCTT 27 00 

AATTAAAATA AGCAAGCCAG TACTTAATAG GATTAGTGGT GACAGCATAA TAATTGAGAA 27 60 

TTGCCATTTG TTGAAGCAAG CATCTGCTGT TTGGAATAAG ATTCTGTCTT TTTTTATATT 2820 

AAACATAGGT TTGCTATCTT TTTTAAATAA AAGAAATAAT GCTCTATGGA TAAGTTCATG 2880 

TAAAATCAAT AAAATAATGA ATCCAGCAAA CCCATATACA AG AT T GAT G A TGATATTTTG 2 94 0 

ATCGACAACC GCTGTGACAC CTAACGCCCA CTTATACGTA AATAAAATCA CGAATAACGC 30 00 

AATAACAAGT TGCAAGATAA TAAACCTfCG CATTTGAAAA TTATTTGTCG TTAAATCAAT 30 60 

TTTATGCATT ACCAACCCTC C C GAT CAT GA CATTCTTATT CTTCTTTAAA TATAGTATAC 312 0 

AATGTCACAT TTAATTTAAA AAG T T CAT AT CAAGAAAGTA AATTGGCTGT AATAAAATTT 3180 

TAATATACGA CTTCTTTCTT CACTTATTAA GGCGAAATTT TATCtCAAAT CAGTGCGCT 324 0 

ATTTCAAATT GAATAATGCC ACTGTCTCAA CATGTGTTGT TTGTGGAAAC ATATCTACCG 3300 

GTGTTACCTC TTCAAGTTGA TATTTTTCAG CTAATAATAA TGCATCACGT TGCTGTGTTG 3360 

CGGGATTACA TGAAATATAG ACAATACGCT TAGGTTCTAA TGTAAGCAAA GTCTGAATAA 34 20 

ACGTTTCGTC ACAGCCCTTT CTTGGCGGAT CAACCATTAC AACATCTGGT TTAATCCCTT 34 80 

GTGCTTTCCA TTGTAAAATA ACTTCTTCAG CTTTCCCACA GACAAAAGTT GTATTATTGC 354 0 

ATTGGTTTAT AGTCGCATTT TGTTGTGCGT CTTCAATTGC AGAAGGTACT ACTTCAACAC 3600 

CGTATACATG TTTTGCAAGT GGTGCCATAT ATAfECCTAT TGTTCCAATA CCACAATAGG 3660 

TATCTAATAC AACTTCATTA CCTGTCAATT GCGCATACTC AATTGCTTTA TTATATAATT 3720 
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TCTCTGTTTG TTCAGAATTA ATTTGGTAGA ATGACTGATC ACTTATTTTA AATGTACTAT 37 8 0 

CTGTTAATTG ATCAATAATT GTATCTTTAC CATATAGCGT TATAGATTGA CGTCCCATAA 38 4 0 

TAACATTAGA GTGGCTATCA TTAATGTTTT GTTTAATGCT TGTCACATTA GGAAATGCAT 3900 

CTAATATCTT CTCAACAACA GCATTTTTTT GTGGCCACTT TTTACCATTA GTTACAAAAA 3960 

TAATCATCAT TTCGTCTGTA TGATATCCTG TTCTTACAAC CAAATGTCTC ATTAAACCTT 4 02 0 

TTTTCAATTG TTCTTGATAA ATACTTACAT TTAAATCTTT TAAAATAGAT TTAACTTCAT 4 08 0 

TCATCACTTC TTGATGTTGT GAATCTTGTA TTAAACAACT TTCCATGTCA ATAATGTCAT 414 0 

GGCTTCTTTG AC G AT AAAAG CCCATAATAA CTTCATTCTG TTCATTCTTA CCAACTGGAA 4 2 00 

TCTGGGACTT GTTTCGATAT CTCCAAGGAT CTGTCATGCC ACTGTATCG TTAATCTTAG 4 2 60 

AATTATCAAA ATGCGCTTTT CGCTGAAACA AATTAATCAC TTGTTCCTTT TTCATTTCAA 4 320 

GTTGTGCTTC GTATGATAAG TGTTGAAGTT GGCACCCACC ACAACGTTCA TAATATATAC 4 38 0 

ATGGCGGTGC AAC AC GAT T C TCACTTCTCG TATTTATATT AAGCAACTTA CCTATCGCAA 4 40J 

AGTTCTTTTT TACTTTGATA ATTTTATATT CAATTTGTTC ATTAATTAAA GCTTGTGGTA 4 500 

TGAAAATAGG AAAGCGATCT ATTTTTACGA CACCATGGCC TTCATGCGTT AAATCAACAA 4 5 60 

CTGTTCCCGT TTTTATGTCA TTTTTAGCTA TTGCTTGCAA AATTTTACCT CCAAAATGAA 4 620 

CAGGTTAGGA ACAAAATTAT GCGCTTCCTA ACCTGCCATT ATATATTTCA CTATTTCTGT 4 680 

TTATTCTTCG ATTAAATTGT CAT C AAC AT G AT CAT TAT T T ATTAACTCTT CAT T T AC AAT 4 74 0 

ATCATTAGGT GCAAAGACAT CTATATGACG TTCTAGATTT AAGAAATTCG CTGGTAATTT 4 800 

ACCACCATAT TCTCCATCTA CATTTAGTTG TAAGTCTGTG AATGATGAA TATTAATTGC 4 8 60 

CTTTGCTTTT TCATAAATAA CTTTAGGATG CTTAGTATGT TCTCCTCTTG AAGCTAAAGT 4 920 

CATAATATGA CCAAGTTCTG CAAGGTTTGA TTTTTCAACT ATAATTAACG TAAAATAGCC 4 980 

GTCATCTAAC TTAGCGTCCG GCACTAATTT TTCAAATCCT GCCATTGAAT TTGTTAAACC 504 0 

TAAAAAGAAT AATAATGCTT CTCCTTGGAA AACATTACCA TCATATTCAA TTCTTAAATC 5100 

TACAGCTTTC ATTTGAGGTA ACATTTCGAA ACCTTTGaTG TAATAAGCAA AT GG AC C AAC 5160 

AAT AG AT T T C AATTTACTCG GTGTTTCATA AG AG AC T T G C GTCAATTGTC CGCCTGCAGC 5220 

TAAATTAATA AAGTATCGAT TATTCATTTT ACCAATATCT ACTTTAGTAG AATGACCTTC 52 8 0 

AAT GAT G AC A TCAAGTGCCC CCATGATGTC ATTAGGTATA TGCAATGCAC GTCCAAAGTC 534 0 

ATTAACAGTA CCCATAGGAA TGACACCTAG CTTAGGACGA TTAGGCTTTT CTGCGATACC 54 00 

ATTAACTACT TCATTTAATG TTCCATCACC ACCTGCAGCG ATTAATACAT CATAATTTC 54 60 

ATGCATAGCT CTTTCTGCTT CAAGTGTGGC ATCACCTATT TTCTCGGTTG CATATGCACT 5520 

CGTTTCATAT CCCGCTTTTT CTAATTTTAT TAAGGCATCA GGTAATTCTC TTTTAAATAG 5580 
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CTCTTTACCT GATGTCGGGT TATAAATGAT TCTAGCACGT TTCCTCATAT CTTATCCCTC 5 64 0 

TACTTAAAAT TCATATATTT TAACTTCATC TTTGTTTCGT CTAATAGGGA GTGGGACAGA 5700 

AATAATATTT AACAAAATTT ATTTCGTTCT ACCCCAACTT GCATTGTCTG TAGAATTTCC 57 60 

TTTCGAAATT CTCTATGTTG GGGCCCCACC CCAACTTGCA CATTATTGtA AG c T G AC AG A 5820 

AAGTCAGCTT CTTTGTTTGG GGGCCCCGCC AACTTGCAIA TTATTGTAAG C T G AC AG AAA 5880 

ATCAGCTTCT ATGTTGGGGC CCCACTAGAA TTGAAAAAAG CTTGTTACAA GCGTATTTTC 594 0 

TTTCAGTCAA CTACAGCCAA TATAACATTG TAGTGCCTAG GACATTGAAT TTATGACCCA 6000 

GGCTCAGTCT TATTTCATCA TTCTTAATAT CGTTAAAGAC CAACTTGTAT C T T AAAC AAA 60 60 

TACTATCTCA ATATGTACAA AGCTTGTTAT TTATTCAGCA TTTTTTGCCG TTCTTCATTA 6120 

TAt AGcTTCG TCAGTTATGC TATTTTACCT TTAAAATGAT GTTGTAAATA TAATGTTGTC 6180 

TTTATACAAA AATGTTTTAT ACCTTTTTAT TATACAGTTT GCCTTAATAT AATTAnAAGT 62 4 0 

AACGCATTAA TAAAATTAAT ATTTTTACCA TTAACATGTA CAATGAATAA AGTTAAAAGT 6300 

AATTTGACTT C TAT AG AT AT AAATAAACCC TCGATTGCAT CTAAGTCAGC AATCAAGGGT 6360 

TTATTTTTTA AATCTTCATA GTTTGATGAT TTAAATTATC TTTTATCTAA TTCTTGTTTT 64 2 0 

AATAGTTGAT TTACTAATTG TGGATTAGCT TGACCTTTAG ACGCTTCAT AATTTGACCA 64 8 0 

ACTAAGAAGC CCATAGCTTT GCCTTTACCA TTTTTGTAAT CTTCAACTGA TTGTTCGTTA 654 0 

TTGTCTAATG CTTCATTTAC AAAT T T TAG A AGTGTTGCTT CAT C AGAAAT TTGAACTAAG 6600 

CCATTATCTT CCATAATCTG TTTAGCATTA CCACCTTTAG CTGCTAACTC TGGGAAGACT 6660 

TTCTTCGCAA TTTTACTGCT CATTGTTCCG TCTTCGATAA GTTTAATCAT ACCTGCTAAA 6720 

TTTTCTGGTG TTAATTTAGT ATCTAATAAT TCTACTTGAT TTTTATTTAA ATATTCGTTT 67 8 0 

ACGCCACCCA TTAACCAGTT AGATGTTAAT TTAACATCTG CACCGTGTTC AATTGTTGAT 68 4 0 

TCAAAGAAAT CTGACATTTC TTTAGTOAAT GTTAATACGT GTGCATCGTA TGCAGGTAAA 6900 

CCTAATTCAT TTACATACTT AGCTTTACGT TCATCTGGTA ATTCAGGAAT TGTCTGACGA 6960 

ACACGCTCTT TCCAAGCATC ATCAATATAT AAAGGTACAA TGTCAGGCTC TGGGAAGTAA 7 02 0 

CGGTAATCAT CAGAACCTTC TTTAACACGC ATTAAAATTG TTTTACCTGT AST T CATC A 708 0 

AATCGACGTG TTTCTTGTCC GATTTCTCCA CCATTTAACA ATTCTTCTTC TTGGCGTTTT 714 0 

TCTTCATATT CTAAACCTTT ACGTACATAG TTAAATGAGT TTAAGTTTTT CAATTCGGCT 7200 

TTAGTACCAA ATTTTTCTTG ACCATATGGA CGTAAAGAGA TGTTAGCATC AC AAC G T AAA 7 2 60 

GATCCCTCTT C CAT C T T AAC GTCTGATACA CCAGTGTATT GAATAATTGA ACGCAATTTT 7 320 

TCTAAATATG CATATGCTTC TTTAGGTGAA CGAATATCTG GTTCAGATAC GATTTCAATT 7 380 
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AGCGGTGTAC CTTGACGGTT CAAGTCAACT AATGAAT AC T CACCTTTATG TGTTGACTTA 744 0 

CCAGCATCTT CTTCCATGTG AAGACGAGTA AT AC GAT TC GTTTTGTTTC ACCGTCGACT 7500 

TCGATATCGA TATATCCATT TTCACCAATT GGTTGATCAA ATT G AG AAAT TTGATATGCT 7 5 60 

TTTGGATTAT CTGGATAGAA ATAGTTCTTA CGGTCAAACT TAGATTCTGT TGCGATTTCC 7 620 

ATATTTAGTG CCATTGCAGC ACGCATTGCC CAGTCTACTG CACGCTTATT AACAACTGGT 7 68 0 

AAGACACCTG GATATGCTAA GTCGATAACA TTTGTATTTG AGTTAGGTTC TGCTCCAAAA 7 74 0 

TGCGCTGGTG ATGGAGAAAA CATTTTTGAG TCCGTTTTTA ACTCTACGTG AACTTCAAGT 7 800 

CCTATAACTG TTTCAAAATG CATGATTTCC ACTCCTTATA ATTTTTCATA AACGTCATGT 7 8 60 

AAAT TGT ATT GTGTTTCATA TTGATAAGCG ACACGATATA ACGTTTTTTC ATCGAATGGT 7 92 0 

TTACCAATGA ACTGTAAACC GATTGGTCGG CCATTTGATT GTCCACAAGG AACAGAAATA 7 98 0 

CCAGGTAATC CAGCTAAGTT TACTGGTGTT GTTAATAAAT CATTGGCATA CATTGTTAAT 8 04 0 

GGATCATCAA TTTCTTCACC TAAATTAAAC GCaGTgTnAG GGCTGTTGG ACCAACTACT 8100 

AC AT C AT AAT TTTCGAATAC TTTATCAAAG TCATTTTTAA TCAATGTTCT AACTTTTTGA 8160 

GATTTTTTAT AGTAAGCATC ATAGTAACCT GAACTTAATG CAAATGTACC TAAGAAAATA 8220 

CGACGTTTTA CTTCTTTACC GAAACCTTCA GATCTTGACA T T T T AT AT AA TTCTTCTAAT 8 2C8 

GAATGAGCTT CTTTAGAATG AT AAC C AT AA CGAATTCCGT CAAAACGAGA AAGGTTTGAC 8 34 0 

GAAGCTTCTG ATGATGCAAT CACGTAATAT GATGGAATAC CAAATTTAGT ATTTGGCAAT 8 4 00 

GATACTTCCT CAACGACAGC ACCTAAAGAT TTTAAAGTTT CTACAGCGTT TTGAACTGCT 84 60 

TCTTTTACGT CATCAGCTAC ACCTTCACCT AAGTATTCTT TAGGTAATGC AACTTTTAAT 8520 

CCTTTAATAT CTTTACCAAT TTCAGATGTA AAGTCTACAT CAT C AAC T GG TGCACTTGTA 8 58 0 

GAGTCATTAA CATCTGCACC AG AAAT AG C T TCTAATACGA TTGCATTATC TTTTACATTT 8 64 0 

CGAGTCAATG GACCAATTTG GTCTAATGAA GATGCAAAAG CAACTAATC AAATCGAGAT 8 7 00 

ACACGACCGT ATGTTGGTTT CATACCGACA ACGCCACAAT ATGCAGCCGG TTGTCTAATT 8 7 60 

GAACCACCTG TGTCTGAACC TAAGCTAAAT GGTACTAAGC CAGCTGCAAc TGCTGCTGCA 8 820. 

GATCCACCTG ATGAACCACC TGGCACTGCT TTATGGTCAA ATGGGTTAAC TGTTTTTTTG 8 88 0 

AAAT AAG AT G TTTCTGTTGA ACCACCCATT GCAAACTCAT CCATATTTAA TTTACCGATT 8 94 0 

AAAACGGCAT TTTCATTATG TAGTTTTTCC ATTACAGTAG ATTCGTAAAT TGGCACAAAA 9000 

CCTTCTAACA TTTTACTTGC ACATGTTGTT TCTAATCCGT TTGTAATAAT GTTATCTTTT 9060 

AT ACCC AT T G GAATACCAAA TAATTTGCCA TCCATTTGAT CTTTTGCTTG TAATTCATCC 9120 

AATTCTTGCG CTTTTTTGAT TGCATTTTCT TTATCCAGCG C T AG AAAAG A CTTAATTGTT 918 0 

GGATCAGTCT CTTCAATTGC ATCATATATA TCTTTAACAA CATCAGATGG TTTGATTTTT 924 0 
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TTGTCTTTTA TTAAAGTTAA TAAATTCTCA ACCGATTCGT AGCGAATGCT CATCTACGC 9300 

GTCCTCCTCA TTCATGATTG TAGGCACTTT AAATTGTCCA TCTTCTGTTT CTTTGGCATT 9360 

TTTCAAAGCT AATTCTTGTG GAATACCTTT AATTGCTTTA TCTTCACGTA AAACGTTTTG 9420 

TAAATCTAAA ACGTGATATG TAGGTTCAAC GCCTTCTGTA TCAGCGCTAT CATTTTGTTT 94 8 0 

TGCAAAATCT AAAATGCTTT CTAATGTGTT GGCCATTTCT TCCGTTTCTT CAGGAGAAAT 954 0 

TTGAAGTCTT GCAAGATTCG CGATATGCTC AACTTCTTCA CGTGTTACTT TTGTCATTAA 9600 

TAAAAGCCTC CTTTAAGTCA TTCATCACTA AATTGTATCA AATTTCCAAT TAAAAATCTA 9660 

AGTATTTATG AGGTGCTACT TTAATTTCAT ATAAACTGA TAAACATTAT CATTCGTTTA 97 2 0 

T C AAAT CAT T TTTTATGAAA ACAACACTCT TTTAATATTA GACAACCCAA TTCAATATTA 97 8 0 

TGATTATGCA ATTTTAATTA TAAAAATTTG CTCATATCCA AAAAATACGA AAGCGCTTTC 984 0 

TATATTGGTA TGCAAGTATT TCAAAAAGAA TAAATTTAAT TTTCCTACTT TTCTAAACAT 9900 

TTATCTTTAT GTATAATGTT TTCAAGTAAC T AAAT T AT AA AT T AAAT AAA GGGAGTGTTT 9 9 60 

ATCATGCTTA CAATGGGGAC AGCATTAAGT CAACAAGTAG ATGCCAATTG GCAAACTTAT 10 020 

ATTATGATTG CCGTCTACTT CTTGATACTA ATCGTTATTG GCTTTTACGG TTACAAGCAA 1008 0 

GCAACTGGTA ACCTAAGCGA GTACATGTTA GGTGGACGTA tATTGGACCG TATATTACTG 1014 0 

CATTATCAGC TGGAGCTTCA GATATGAGTG GAT G GAT GAT TATGGGGCTA CCTGGTTCTG 10200 

TCTATAGCAC TGGTCTATCA GCTATGTGGA TTACAATCGG TTTAACATTA GGTGCTTATA 10260 

TAAATTACTT TGTTGTTGCT CCTAGACTTC GTGTTTATAC CGAATAGCT GGAGATGCAA 10320 

TTACATTACC AGATTTCTTT AAAAATCGTT TAAACGATAA AAATAATGTG TTAAAGATTA 10380 

TTTCTGGATT GATTATCGTA GTATTCTTTA CAT TAT AT AC ACATTCTGGT TTCGTATCTG 104 4 0 

GTGGTAAACT ATTTGAAAGT GCTTTTGGAT TAGATTATCA TTTCGGTTTA ATATTAGTTG 10500 

CTTTCATTGT CATTTTCTAT ACTTTCTTTG GTGGATATTT AGCTGTATCA ATTACAGATT 10560 

TCTTCCAAGG TGTCATTATG TTAATTGCGA TGGTTATGGT CCCTATTGTT GCTATGATGA 10 620 

ATTTAAACGG CTGGGGAACG TTTCATGATG TAGCAGCTAT GAAACCTACA AATTTAAATT 10 68 0 

TATTTAAAGG GTTATCATTT ATAGGAJTTA TCTCTCTATT TTCATGGGGA TTAGGTTATT 10740 

TCGGTCAACC TCATATCATT GTAAGGTTTA TGTCTATTAA ATCACACAAG ATGCTACCTA 10800 

AAGCTAGACG TTTAGGTATT AGCTGGATGG CTGTTGGTTT ATTAGGCGCT GTGGCTGTTG 108 60 

GTTTAACAGG TATTGCATTC GTACCTGCTT ATCATATTAA ACTAGAAGAT CCGAGACAT 10920 

TATTCATCGT GAT GAG T C AA GTACTCTTCC ATCCTCTTGT AGGTGGTTTC TTACTTGCTG 10980 

CGATTCTAGC TGCAATTATG AGCACGATTT CTTCACAATT ACTTGTAACA TCTAGTTCAC 11040 
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TAACGGAAGA CTTTTATAAA TTAATTCGTG GTGAAGAAAA AGCTAAAACG CACCAAAAAG 11100 

AATTTGTTAT GATTGGAAGA TTATCTGTAT TAGTTGTAGC AATTGTTGCC ATCGCGATTG 11160 

CATGGAATCC AAAC GAC AC A ATTCTAAACT TAGTAGGTAA CGCTTGGGCC GGATTTGGTG 1122 0 

CATCGTTCAG TCCACTTGTG CTATTTGCAC TTTACTGGAA AGGTTTGACA CGTGCCGGTG 11280 

CTGTAAGTGG AATGGTTTCA GGTGCCTTAG TCGTATCGT TTGGATTGCA TGGATTAAAC 1134 0 

CATTGGCACA TATCAACGAA ATATTCGGCT TATATGAAAT TATTCCTGGA TTTATTGTAA 114 00 

GTGTAATCGT TACATATGTT GTAAGTAAAC TTACTAAAAA ACCTGGTGCA TTTGTTGAAA 114 60 

CTGACTTAAA CAAAGTTCGT GACATCGTTA GAGAAAAATA ATTCATAAGT CTTAACAAAT 11520 

TAAAAAGGTA CTAATGTTAA TCAAAATTAT GACTAACATT GGTACCTTTT TATTATCTTT 11580 

AATTTACAAT TAATTACTGT AAATATGAAC TTGCGGTTCT TTGTCATCTT TTGTCTTACT 11640 

AATTAAAGCA CGTGGTTGGT TACCATCTTT AATACGAATT TCATAGTTAT CGATTTTATC 11700 

GAAATATTTA TTCGCTTGTT CAGTAACGTA CTGTGTAATA CCAATTGTTT CAGCTTGTCC 117 60 

ATAGTAATCG ATTGGTAAAT CTACTACTAA TCGTTGTGGC TTTTTATCAA CAAATTTAAC 11820 

TTTCCCTACT GCTTGTGTGA AATTAGAAAA ATATGATTGC AAATTATCAT TAAATTGCTT 11880 

GAAATTATTA TTTAAATTTT CATCATAATC TGCTGCTGTT GAGAAGGTA ATAAAGCTGA 11940 

TTTTTCATTG ATATTATGCC ATTCATTAAG CTTTGTTTGA CTCTTTTCTG CAGTCGCTTG 12000 

AGTGATAAAT TCACCTGGTG TGATTGAATC TTCACTTGAT TGCTTATAAA TTGCAAAATG 12060 

AATTGGTATA TCTTTTAAAT CATCATTTTC ACGTAACCTT GATAATATCT CAC TAGCCAT 12102 

TTGTTTACCT TGCTTTTTAA CTcGCTATCA TCTAGTTTTT TAC TAAAAGT CGATCCATCT 12180 

TTTTCTTTTT TATAGTAATA AACACTATTC ATAGCTAAAC CAATCGTCAT ACCTTTAATA 12240 

TTCTTACCTT TTGTATCTCC AC CAC C AT AA AAATCTTGCT CTAAAATGTT AGATAAATAG 12300 

GCTGGTGATT TTTCTGCAAT CTTTTCAGGA TCTGTTTCAC CTtCGTGTGA TGGATTAAGT 12360 

CCTAAATTTT CATTCGCTTT CTTGTCTTTT TTATCTTTTT C AG AC AT T T T ATCGATTTCA 12420 

CGTTTTGTAT ACTTAGGATT TAAATAGGCA TTAATTGTTT TCTTGTCCAA AAATTGACCA 12480 

TCTTGATACA AATATTTATC TGTTGGAAAT ACTTCTTTAC TTAAGTTCA TAAACCATCT 1254 0 

TCAAAGTCGC CGCCATTATA ACTATTTGCC ATGTTATCTT GTAAAAGTCC TCTTGCCTGG 12600 

CTTTCTTTAA ATGGTAACAA TGTACGATAG TTATCACCTT GTACATTTTT ATCCGTTGCA 12 660 

ATTTCTTTTA CTTGATTTGA ACTATTGTTA TGTTTTTGAT TATCTTTTCC AGCCTGGTCA 12720 

TCCTTATGGT TAC C ACAAGC AGCGAGTATA AAGATAGCTG TAATCAATAA TACTAATGTA 12780 

CGCTTCATCG ACATACCCCT CTAACTATTT AATTCATTTT GCTTATCTAC AAATTGTTGC 12840 

TCTGTCCAAA TTTCAATACC TAAACTTTGT GCTTTTGTTA ATTTTGAACC TGCATCTTCA 12900 
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CCAGCAATAA CGACATCTGT ATTTTTAGTA ACGCTACTTG TAACTTTAGC ACCTTGTGAT 12960 

GCAAGCCATT TAGATGCTTC ATTGCGTGTC ATTTGATGTA GCTTACCAGT CAG T AC T AT C 13020 

GTTTTACCAC TAAATTCAGG ATGTCCTTCA ATATCTGATG TTTTGATACC TTTATAAATC 13080 

ATATTAACAT GTTTATCTTT TAATTTTTGA ATTAAAGCAC GAATATCTTC ATTTTEAAA 1314 0 

TAAGTAACTA CAGATTGTGC TACTTTATCA CCTATATCAT GAATTTCTAC TAATTCCGCT 13200 

TCAGTTACCG TTAGTAATCG ATCTATCGTT TCATATTTTT CTGCTAACAC TTGGCTCGCT 132 60 

TTAACACCTA AATGCCTAAT ACCTAGACCA AATAATAAAT TTTCTAAAGA GTTGTCCTTA 1332 0 

GCTTGTTGAA TGGCAGCTAA TAAATTATCA ACTTTTTTCT GCCCCATTCT GTCTAAAGGT 13380 

AATAAATCTT CTTCTGTTAA ATAGAAAATA TCAGCAACAT CTTTAATTAA TTCGCTTTGA 13440 

TAAAGCTGTT GAATAATTTT AGTGCCTAAA CCATCAATAT TcATGGCTTG TCTTGaTACA 13500 

AAGTGnATCa ATCCtTcAAC AAGTTGTGCT TGGTCATTT GG 13542 
(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1893 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

CAGTAAACAC CTCTGATTAC GAATATTTAT ACATTTATTT TAACACATGC ACTGATTTAC 60 

GACTACTAAA CACCTTTACG TAAAAAGGGT AAACATGGTT TATCTATCTT GGTTATCTAT 120 

TTATAAATAT TTnTCATATT ACGCATAACA ATTGCTTAAA ATATGTATAA AAATGAATA 18 0 

ATGTGTAATA AACTTGCTAA TTATTAGATT TAATAAGCGT CAATTGTTTG AACATATTtA 24 0 

ATTAAAATCA CATTGATATC ACAGATACGA ATATTGTCGT ATAGAAATTG AAAATTCTAT 300 

TTTTTAAATG AAAGTCTTCA ACATAATTTT AAGTTTCAAC AT G AG AAAAA TCGATTAACA 3 60 

AACAACGTCA GTTGAATATG CCTTTTGAGA CATTTCAAAC TTTACAATTG TTGCTAATCG 4 20 

ATATATTTGC TTTTAGTGAT CCCTGCTATA AAATAAATCA ACGATTTCTA ATAAGTGTTT 480 

TGTATTGAAT TGTTCATCAA TTTGCGTTAG TTCATCCACT GCTGCGTCTC TATGATAAGT 54 0 

CAATTTATCT TCTGCGCCAT CTTTCCCTAA TAAACTCACGTACGTACTTT TATTATTTTC 600 

AAGATCGCTG CCCACTTTTT TACCTAACTT TGCTTCATCA CCATAGCAGT CTAATAAATC 660 

ATCTTTAATC T GG AAC AT C A TACCTAAATG ATAACTATAA CTTTCTAAAT GTTCTTTAGT 720 

TGTATCATCG ACATTAGCGA TATCTGCTGC ACTCATAACC GCAAAAGTTA ATAATGCTCC 870 
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TGTTTTTGTT TTGTGTATCA 

CATATCTAAC ATTTGACCGC 
TAGAACTTTT ATTTTTACTT 
TTTAGTTAAT AAAGCATCAC 
TGTTAATTTT CCTCGTCGAT 
TGAATATGTA TGAATCATTT 
ATTTAGTGAA TCTAAAGTGA 
TAATGAATAC AACATACTTT 
CGATAATTCA TTATTGACTT 
CATCCTCAGC TTCTTCTTTT 
TGTCACAAGC TGCTGATAGT 
ATACTGTTTC ATTATCTAAT 
AACTTTGCGT TTCTTTAGTC 
CCATCTTTCA TTGTTAACGT 
ACTTCGTCTT TTTTATTAAC 
TTGTTTAAGT TTTCTACTTT 
AATAATTTAA CAAGTTGGTT 
CTTAGTAATG CTTTTAAATn 
TCCAAAGTTG CCCGAATTnG TGGTTTCAGG CCC 
(2) INFORMATION FOR SEQ ID NO: 15 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



840 

900 
960 
1020 
1080 
1140 

1200 
1260 
1320 
1380 
1440 
1500 
1560 

1620 
1680 
1740 

1800 
1860 
1893 



TTTCCAAAGT TTCAAGATCA ATTGGTTGGC CTTCGCTTTG 
CGACCATTCC AACATGACCA CTTGCTATTG ACAGCCGTTG 
CATCAGTTAA TCTATCATCA CTTGAAATAA GTTCAAATGC 
CTGCTAATAT CGCAGTCCAC TC AC CAT AT A CTTTATGATT 
AATCATCATT ATCCATCGCT GGTAGGTCAT CAT G AAT AAG 
CTAGTGCAAT TGCGCTCTTC ATACCTAACT CATACTCGGT 
GTAATAACAG AACTGGTCGG ATGCGTTffiC CTCCAGCATT 
CTTCTAGCTG AGTATCCATT ACTGATTTAT TTATCGCAAC 
CATCTATTAA TTTATTCATC GGTAGATTCG TCATTTTTTA 
ATTAAGTCAT TCACCTTTTT TTCGGCATTT TTTAAAGTTG 
TTCATACCAC GTTGATATAA AT C T AAT GAT TCCTCTAAAG 
TTTTGAACAA TTTGCTCTAA TTCTTGCATC ATTTCTTCAA 
ATTATTACAC CTTACTTTCG TAACTTTTGC ATCTACTAAG 
CAATTGATCA TTTTCTGTTA AATCTTTAGT ACTCGTAATG 
AATTGCATAT CCACGCAACA TTGTATTAGT TGGACTTAAA 
ATTTTTCAAA TCATTTTTAT AACTTAATAT CTTAGAATTC 
TGTCAATTGA AGATTATnTT GTTGTTCTTG ATTASACTA 
ATAACGTTGG TGCAACAGCA TTAAATCGAG GCCCCGGTGG 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

AAAATATATT CCTTCACTTA ATATTCAATT AGAGAAAAAC ATGGTGATTG TAATATGTTG 60 

TGCAATATTT CTGGGTGTTT TAATACTTTT TTTATTTCTG AATCGTAAGC TAAGGTTGGA 120 

AATTTATAAT AATAACTCTA GTAAAGGGAA AATAATTTTA TTTCCTTCAT TAAAAAACTT 180 

TTGTTTCACA ATATTTTATT ATTTTTTATT TGGCGGTCTT TCAATAATGG CTCTAAGTAT 24 0 

GTTATTAACT TTAAATCCTC AAAATATAAT AGCETTTATT GGTTGGTTGG TAATGACTGC 300 

AGGTTTCTTT CTGTTAAACA TGTCATCGAT TAT T G AC AAA AAAAT T TAT G TATTATCTAA 360 
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AACTAACACG GTGGAAAAAT GATGGTTTAG CTGGATTTAC TGCAGGTTCT ATTTCGGCAA 



420 



TACTTGTATA TTGGACCAAT CAAAAAAATG AATTTGGAAT AAAAGATAAA AACGATTGS 



480 



TAGGACATAA ACTAGACGTT GGTATAGATG CTGTAGAAAA ATCTGCAGAA AAAAC AG TAG 



540 



ATGGTGTTGA AAATGTCATG GTGAAGCTTC AAAAAGTATT TCTAATCATA TAAGCCCTAA 



600 



GAAATGGAGC TGGTAAATGT TGCTATGCGA ATCTAAAATC ATCAATAAAA ACCCAAAATA 



660 



TAGAATTATT AAATATAATG ATGAATACTT AATGGTCGAT ATAATAAGCA CTTGGATTAG 



720 



TTTATTTTTT CCTTTTATTA ATTGGTTCAT CCCaAAAGaA TACGTCAAAA TTAGTAGAGA 



780 



AGAGTTTGAA AACTTAAATA TTGTTaAACC TGct AAAAAG A 



821 



(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2343 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGffGTT CTCTATTTAT ACAATACTTC 60 

GTATTGAATG GcTTCGCTTT CCTAGGGTGC CGTCTCAGCC TTGGTCTTCG ACTGGCACTG 12 0 

CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACAT GTAATTTTAC TTTGAAATAC 180 

TTTTAAAAAA TAAGACACTT TGCCCAACTT GCACATAAAT GTAAAATTCA ATAAAATGAA 24 0 

TTTTCTGTGT TGGGTCCCTT CTTATAATTT AATAAATACC ACTAAACTAA ATTAACGAGG 300 

TGCCTTATGT ATAAAAATTA TAACATGCCC CAACTACACT ACCAATAGAA ACTTCTGTTA 3 60 

GAATCCCTCA AAATGATATT TCACGATATG TTAATGAAAT TGTTGAAACr ATACCTGATA 4 20 

GCGAATTCGA TGAATTCAGA CATCATCGTG GCGCAACATC CTATCATCCA AAAATGATGT 4 80 

T AAAAAT CAT CTTATATGCA TATACTCAAT CTGTTTAATT ATGTTCAAAG CATTAAGGTA 54 0 

ACAAGACAAT ATCTAAGATA T C AAAG AT AG AAATTTTTTG ACGTTGTTGC TGATTGTAAA 600 

CATAACCATC AATTTCATAA TTAATAGCAT CAATACGATA AAEGTTAAG CGTACTGAAT 660 

CTACAAAGCC ATT ATT AT AA AATTTAACTT CTACAGGTTG GGCATATTGT AGCGCCTCGT 72 0 

GTAGCCGAAT GTTTAGCTCA GCCAATTGAT CATCTGATAA TACAGGACGT GTAATTTTGT 7 80 

TTTGGTCGAT AATGTATTGT TGAATCGTTT CGAATTGTTC GGGTAATGTT G C AAAAG GAG 840 

CCCATTTAAT CATGCCTCTT CCCATAGGTA TATTGTTATC TAGTAATTCT CTTGGAACGT 900 

TACGATAATC AGTTTCTTCT TCATAACTTG TCATCCTTAA TTCACCCCAA TCTGATAATT 960 
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ACATTATACG AACATGTGTT CTATTTTGCA AC AAAAAT T T TGTGGaAGCA TAAACGCGTT 102 0 

AATAATTAAT GCTCGTGtAA GTAAJAAAGA GGGATTAATT AAAATCGAAT AATGaCATAT 108 0 

CACaGCAAAT AGTTCTTTTA AAGTAGTTAA ATAGTTTTAG CTTTAAGGAA aTGATAAaTG 114 0 

ATTGTwAATT CTAGCTAAAA TTTAATAAAA TGAAAATAAG ACTAACATGG AGGGGTAAAA 1200 

GTAATGACAA ATGGATATAT TGGTTCTTAC ACTAAAAAGA ATGGTAAAGGGATTTATCGT 12 60 

TTTGAATTAA AC G AAAAT C A GTCACGTATT GATTTATTAG AAACAGGATT TGAATTAGAA 1320 

GCGTCTACAT ATTTGGTGCG TAATAATGAA GTTTTATATG GAATCAACAA AGAAGGAGAA 138 0 

CAATGCGGTG TTGCAAGTTT GAAAATTGAT GACAATGGTG AATTGCATTT ATTAAATAAA 14 4 0 

TGTTTGTCTT CAAAAGCTGG TACAGGTTGT TATGTATCGA TTTCAGAAGA TAAACGATAT 150 0 

TTATTTGAAG CGGTATATGG TGCTGGCATC ATACGTATGT AT GAAT T AAA TACGCACACA 15 60 

GGTGAAATTA TACGTCTAAT TCAAGAACTT GCACATGATT TTCCAACAGG T AC AC AT G AA 162 0 

AG AC AAG AT C ATCCACACGC ACATTATATT A2TCAAACTC CAGATGGTAA GTACGTTGCA 1680 

GTAACAGATT TAGGTGCTGA TCGTATCGTT AC T TAT AAA T TTGATGACAA CGGGTTTGAA 17 4 0 

TTTTATAAAG AATCTTTATT T AAAG AT AG T GATGGGACAA G AC AT AT T G A ATTTCATGAT 1800 

AATGGAAAAT TTGCTTATGT CGTACACGAA TTATCAAATA CTGTGAGTGT TGCAGAATT 18 60 

AATGACGGTA AATTTGAAGA GCTCGAGCGT CATTTAACAA TTCCTGAAAA CTTTGATGGA 1920 

GATACTAAAC TTGcAGCAGT GCGTTTATCT CATGaTCAAC AATTCTTATA TGTATCTAAT 198 0 

AGAGGGCATG ATAGCATTGC AATTTTTAAA GTTCTTGATA ATGGTCAACA CTTAGAACTA 204 0 

GTAACAaTTA CTGAAaGTGG TGGTCAATTC CCAAGAGATT TTAATATTGC CTCATCAGAT 2100 

GACCyTTTAG TTTgTGCTCA kGaGCaAGGA GATTCAGTTG TAACTGTTTT CGAAAGAAAT 2160 

AAAG AAAC AG GT AAAAT T AC GCTATGTGAT AACACTCGTG TAGCATCTGA AGGTGTATGT 222 0 

GTCATATTTT AATCTTTAAT TAATCATGAT AAAAAGAAAACCATGTTTCC AAAAAATTTG 22 8 0 

TGTATACCTT GAAATTTATT GnTTTCCAGn ACATCAATTA TGGGAAGCAT GGnTTATTTT 234 0 

TGT 2343 
(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4837 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

AAATTGCCAG TTGGTATCGC TTCTGGTGCA GTAGTCGAAG GTTTCTTCCA AGGTATCATT 60 
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CCGATTGGCT ATATCGTTAT GATGGCAGTA TTGTTATACA AAATTACTGT TGAATCTGGA 120 

CAATTTTTAA CAATTCAAGA TAGTATTACA AATATTTCAC AAGACCAACG TATTCAAGTT 18 0 

TTACTTATTG GATTTGCATT CAACGCATTT TTAGAAGGTG CAGCAGGATT TGGTGTACCA 24 0 

ATTGCAATTT GTGCACTTTT ATTAACACAA TTAGGATTTA ATCCATTAAA AGCTGCGATG 300 

TTATGTTTAG TCGCAAATGC AGCGTCTGGT GCTTTTGGTG CGATTGGTAT CCCTGTAGGT 360 

GTTGTAGAAA CGTTGAAATT ACCTGGAGAT GTTTCAGTAT TAGGTGTTTC TCAATCAGCA 4 20 

ACTTTAACAT TGGCAATCAT AAATTTCATT ATTCCATTCT TETTATCTT TATTATTGAT 4 80 

GGTTTTAGAG GTGTTAAAGA AACATTACCA GCAATTTTAG TAGTTTCAAT CACTTATACA 54 0 

CTTACTCAAG GATTATTAAC TGTATTCAGT GGACCTGAAT TAGCAGATAT TATTCCACCG 600 

TTATTAACAA TGTTAGCATT AGCAGTATTT TCTAAAAAAT T C C AAC C AAA ACACATTTAT 6 60 

CGTGTTAATA AAGATGAAGA AATTGAACCT GCAAAAGCAC ATTCTGCAAA AGCAGTATTA 720 

CATGCATGGA GCCCATTCAT TGTATTAACA GTCATTGTAA TGATTTGGAG TGCGCCATTC 780 

TTTAAAAACT TATTCTTACC AAATGGTGCT TTATCATCAT TAGTATTTAA ATTCAACTTA 84 0 

CCTGGaACAA TCAGCGAAGT TACGCATAAA CCATTAGTAT TGACTTTAAA TATTATTGGA 900 

CAAACAGGTA CAGCTATTTT ATTAACTATT AT TAT T AC AA TTTTAATGTC TAAAAAGGTT 960 

AACTTTAAAG ATGCAGGTAG ATTATTCGGC GTTACATTTA AAGAGTTGTG GTTACCAGTT 102 0 

CTTACAATTT GTTTCATCTT AGCAATTTCT AAAAT C AC AA CTTATGGTGGTTTAAGTGCA 108 0 

GCAATGGGTC AAGGTATTGC TAAAGCAGGT AATGTCTTCC CAGTTCTATC ACCAATTTTA 114 0 

GGTTGGATAG GTGTGTTTAT GACAGGATCA GTTGTAAATA ACAACTCATT ATTTGCACCA 1200 

ATTCAAGCTT CTGTTGCACA ACAAATTGGA ACAAGTGGTT CACTTCTTGT ATCTGCTAAT 12 60 

ACAGTTGGTG GTGTAGCGGC AAAAT T GAT T TCACCACAAT CAAT TGCAAT TGCAACTGCA 1320 

GCAGTAAAAC AAGTTGGTAA GGAATCAGAA TTATTAAAAA TGACATTGAA ATACAGTGTA 138 0 

TGTTTACTAA TATTCATCTG TATTTGGACT TTCATCTTGT CATTATTATA AAAAAACGTA 14 4 0 

TTTCAAAATA TAAATATACA GAAGGTGAGA TSTTTTCTAA CATCTCATCT TTTTTTTATG 1500 

GATCATTAAT GAAAGAAGTT T GAC AT TATA ATAATGGTAG CGCTTTATGT T AAAAT GAAT 1560 

AGTGAGTAAT CAGCAATCAA ATTAAATTGG TTGATAGCTG TTAAGGTTTG TGGTTTTGTC 162 0 

TTTGTGCTAT CGCnCATAAA GTATATAATT AAAGTAGTTT CGTTATTATA AAATATAAT 168 0 

AT AC AT AG T A GATAGTAATA GAGCATCACC ATGGGAACCT ATTGAGACAC TTATTGATTT 17 4 0 

AAAGTGGTAT TAATATGTCG TATTTCTCGA ACGTTCCATT ATTCATTTTA AAAAGGGGGA 1800 

CTGTATTTGT TATGACAACA CAACATAGCA AAACAGATGT CATCTTAATT GGTGGCGGTA 18 60 
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TTATGAGTGC aCATTAGGAA CATTACTTAA AG AAT TAT C A CCTGAGAAAA ATATTAAAGT 1920 

GTTTGAAAAA TTAGCACAAC CTGGCGAAGA GAGTTCAAAT GTATGGAATA ATGCCGGTAC 198 0 

AGGGCATTCA GCACTTTGCG AGTTGAACTA TACAAAAGAA GGTAAGGATG GCACAGTTGA 2 04 0 

TTGTAGTAAA GCAATTAAGA TAAATGAGCA GTACCAAAT TCAAAACAGT TTTGGGCATA 2100 

TTTAGTTAAA ACAGGACAAT TAGATAACCC AGATCGCTTT ATTCAAGCGG TGCCACACAT 2160 

GAGTTTTGTC ATTGGCGAAG ATAATGTAGC TTTTATAAAA AGTCGTGTTG CAACGTTAAA 222 0 

GAAAAGTATT TTATTCGAAA AAATGAAATT ATCGCAAGAT GAAGAAGAAA TGAAATCTTG 2280 

GGTACCGTTA ATGATTGAAG GTCGTAAGTC TGATGAACCA ATTGCTTTAA CTTATGATGA 234 0 

AACTGGTACa gATGTTAACT TTGGTGCGTT AACTGCAAAG TTATTTGATA ATTTAGAGCA 2 4 00 

ACGTGGTGTG GGAATTCAAT ATAAGCAGAA TGTATTAGAC AT CAAGAAAC AGAAATCTGG 2 4 60 

GGTATGGCTA GTTAAAGTTA AAGATTTAGA AACTAATGAA ACGACAACAT ATGAATCTGA 2520 

TTTTGTATTT ATTGGTGCTG GCGGTGCGAG TTTACCATTA CTCCAAAAGA CTGGGATTAA 2 58 0 

ACAATCAAAA CATATTGGTG GTTTCCCGGT AAGTGGATTA TTCCTGCGCT GTACAAATCA 2 64 0 

AGAAGTGATT GATCGTCATC ATGCTAAAGT GTACGGAAAA GCAGCfiTGG GTGCGCCACC 27 00 

AATGTCAGTG CCGCACTTAG ATACACGTTT TGTAGACGGC AAGCGTTCAT TGTTATTTGG 27 60 

TCCATTTGCA GGTTTCTCAC CTAAATTTTT AAAAACAGGT TCACATATGG AT T T AAT T AA 2 82 0 

ATCGGTTAAA CCAAATAATA TCGTGACGAT GTTATCTGCA GGTATCAAAG AAATGAGTCT 2 88 0 

TACGAAGTAT TTAGTGTCAC AATTGATGTT ATCTAATGAT GAGCGTATGG ATGATTTAAG 2 94 0 

AGTCTTTTTC CCAAATGCTA AAAAT GAAG A TTGGGAAGTG AT T AC AGCAG GGCAACGTGT 3000 

CCAAGTAATC AAGGATACTG AGGATTCTAA AGGTAACTTA CAATTTGGTA CTGAAGTTAT 30 60 

TACGTCAGAT GATGGCACAT TAGCTGGSTT ACTTGGTGCA TCACCTGGTG CGTCAACAGC 3120 

TGTAGATATT ATGTTTGATG T T T T AC AG AG ATGCTATCGT GAT G AAT T C A AAGGATGGGA 3180 

ACCAAAGATT AAAGAAATGG TGCCGTCATT TGGTTATCGC t T AACAGATC ATGAGGATTT 3240 

ATATCATAAA AT T AATGAAG AAGTAACTAA GTATTTACAA GTTAAATAAT AASGAAACG 3300 

GTAATGTCTT TTTTAATGTG ATAGACATTA CCGTTTTTTA GTGGTTAATA AAAATCATTT 3360 

TAATTGTTTC AGTTGCTTGT TAATAGTGTC TACGTAGTTC TTGTTTTTAA AG AAT T G AAT 3420 

TATCCAAATT AAT AC AT AAA CCACAATGAA GATAATTGTG AATATGATTA GATAATGCAC 34 80 

TGTTAGTGGA AACCAACCGG CAAGCATTGC TAAAGGCAAG AATCCGACAT ACGTTGTTAT 3540 

GAAATGCATT ATAGTTGCTT TAGTAATGCT CCAATCTGTG TATTTAAAGA TAAAATCTCC 3 600 

AAGGAAAAAG ACGACGCCTA TGAGTAACCA T AAAAT GAT A GAAATCAACA TTACGGTAGT 3 660 

TTCTGTGAAA TGCGTATAAT ACAATATGCC AATACTTGAT TGTGGGTTCA GTGGATAATA 37 20 
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TTTGCCGTCT GCAAATAACA TACTAAAGAA CAGTGAAAGG GACAAACCAA TGATTAAGCT 37 8 0 

AATAAATAAT GAGTTTTTCA AATTTTTCAT ATTGATAAGC GCTCCTTTAT AG AT T T T AAA 38 4 0 

TAACGTCTAG AAGAATAGGT GTAGTGTGCA TCTTTAAGAT ACATACGTAT AAGTCCATTT 3 90 0 

GGCTCTAATA ATAATTTTTC AATGTAATAC TTGTTGACGA TTTCTGATTT GGAAATGCGA 3960 

ATGAAATGTT GTGGTAACTG TTTTTCTAGT TCATAAAGTC GTAATTTTAG TTTGAATTTT 4 020 

TGATTGCTTG TAATGGCTGT TACATTTTTA TTAATAACTT GGAATGTTAT AACATCTGAT 4080 

ACATTAATGA TATGGATTTC TTTGTCTATG TATCCGACTA ATGTATGTGA TTTGTCTAAA 414 0 

TCATTGACTG CAT T AAT AAT ACTTTGAACG TTATCATTCA TTTTAGGTGC ATGTATATCA 4200 

ATATAAGATT CCGTCTCATT TGCATTGATA AATAAATTGA GTTTCATCAT AGGTTAATGC 4 2 60 

CTCCTTCAAA ATTATTAAAC CATAAATGAC CATCGATATA TTAAATTTT GTTGAATGGT 4320 

AGAAAT T AAA TGTTAAGTGG CTAGAAAGCG CTAATCAATA TAAAAGATAC CTCCTGAAAT 4 38 0 

AAAAACAGAA ATGTTTTTTC AGGAGGTAGA GATTAAAGTG AATTATTTGG CAGTGTAATA 4440 

GTAAAGGTGG TTACATACTC GTTACTTTGT GTGAATTGGA TTGTACCATG ATGCAATTCA 4 500 

ATGATGGATT TTGTAATTGC AAGACCTAAA CCATTGCTAT TATCATGTTT GCTCACTTTA 4 560 

TAAAAACGTT CAAATAAACG TGCTTCAGCT TGTGGACTAA TTGGTGAACC ATCATTACTT 4 620 

ATTGTGAAAA TGATATTGTT GTGACTATGT TGCAAAGCGA TGTCAATGGC ACCACCAACA 4 68 0 

TCTGTATACT TAATAGCATT TATTAATAAA TTACTCAATG CTTGATGTAA CAAACGTTGA 47 40 

TTTCCTAGGA AAT T GAT GAT TCTAGGTCAG CTAAnATGAT TAACGACTTT TCATCAGCAG 4 800 

CAnATTGTTC ATGTCGAATG ATATCnTTAA TGAGCTG 4 8 37 
(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

ACAATTATTG GAT TAT TAT C AAGCAACGTT AATGGATGAC TTCCfiTTAC AACAGAAATG 60 

CCCATAGATT CTAAATCTtT TGCATGAGCA TCTTGTGATA AGTCTTTTCC AT CAT TG AC A 120 

GTTACATTCG CACCTAATTT ACT TAATAAT TTAGCTGCTT CATAACCACT TTTTGCCAAA 180 

CCGACAACTA AT AC AT T T T T ATTTTCTAAC CCTGTATAAT TAAGCATCTT AATGCACTCC 240 

AATCCATAAA CCGATTAAAC CTGAAATCAG ACCAACAGCC CAAAATACTG TAACTACTTT 300 
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CCATTCGCTC CATCCTATCA ATTCAAAATG ATGATGAATC GGACTCATTT TAAATATACG 3 60 

CTTTCCAGTC AATTTAAAGC TAGCGACTTG TAACATAACA GATAATGTTT CAAT TACGAA 420 

TACTAAACCT ATAAAAATTA ATGATAATTC CTGATTAAGC AT GAT T G AAA TGGTAGCAAA 4 80 

TATACCACCT AAAGCTAAGC TACCTGTATC TCCCATAAAC ACTTTAGCAG GGTTAATGTT 54 0 

ATATGGTAAA AATCCTAAAA GTGCAAACAA CATAATGATA CAGAAAATAC CAATTGCCGT 60 0 

TTCTCCTAAC ACAAAGCTCA TGATGGCATA CATTGTAAAT CCGATAATTG AEGTCCAGT 6 60 

TGCTAATCCA TCTAAACCAT CTGTTAAATT TACTGCATTA GAAAAACCTA CTTGCCAAAA 720 

AACAATGAAA ATAACATATG CAAATGATAG TGGGATTGCT ACATTCGTAA ATGGAATATG 780 

TATGCTCGTA GAAAAATTCA C C AAAT G AAA CACATTACTT AAAACAAAGA ATATAATCGC 84 0 

AATACCAATT TGCGCCAAAA ACTTCTGTTT ACTTGTTAAA CCTTGGTTAT TCTTTTTAAC 90 0 

AACAATAATA TAATCATCTA TAAAACCAAT TAACCCAAAA CCAATCGTCA CAAATAATAA 960 

CAGTATGATT GGATTAGCTT GAT C T AC AAA TATAATAGCC ACCAAAGACG TTATCACAAT 102 0 

ACTTAATAGA AATGTTAGTC CACCCATCGT TGCETACCA GTCTTCTTCA TATGGCTTTG 108 0 

TGGACCTTCT TCTCGAATAC TTTGACCAAA TTTCATCCTT TTTAATGTAG GTATTAAAAC 114 0 

AGGTACCAAA AC AAAT G T AA TCACTAGCGC TAATAACGCA TATACAAAAA TCATAACTAT 120 0 

CTCCTCTTCT TAATCCAGAC TTTTTTAACC ACTAATATAT TAT CAAT T T T TCAATTAAAT 12 60 

AAACAAAGTT GTAATCAAAA TTTATAATTT TTCTTTTTTA CGGCATAAGA GGCCAGTATA 132 0 

AAAAGTTTGC CTATAACAAA CAAGTTAATC TGACCTCGTC TACCTTAAAA TTCTCTATCA 138 0 

ACACTTATTT AT AAAG AT T A AATGAAGATG TTGTTTTCTA TCACAGCATT ACTTTAGTAA 14 4 0 

AAACAAATAG TGACAATACA TCCTAATTTA ATGTAGCCAT TCTTGTTAGT CCGACTTATC 1500 

CTTGTCAGTT TTACTGTCAG ATTTCnTCTT ATCATCTGAA TTTGAATCAG AATTATTCGT 15 60 

CGAATTGCTG TCTACATTCT CTGGATGGAA AATTCTACGT 1600 
(2) INFORMATION FOR SEQ ID NO: 160: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

ATTGCCTTTG TTTTAATTTT AAATCAAAAT mGCCTAEAA AG AT T T AAAT CAATTAATTT 60 

CTATAATATT ATCATTTTTA AAGCATATCA TTGTTTAGTT TTTTTATAAT TGGATAAATA 12 0 

CTAATAGTTA CTTTATAAAA CATTACATAG AGAAAGGTTA AGGAGTGCAC ATGTCGAAAA 18 0 
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AGGATCACTC TTCTTCAAAA TACCTTAATT CTGTTAAGGA AGCGCAAGAG GAG T C AAAAA 24 0 

AGAAAAATAA AAGTAATCCC AAAATTGATG TTGATCGTAC AT AT AT T G AA CCTCAACAAT 30 0 

TCCAATCTAA GAAACCTAAA AAAGATGATC AGGTTTTCTT CTTATCAAGA TTAAATAAAC 3 60 

CTGCAAAATA TAAGAAAGAC TCTAATTTCT TATCATATCT CATCTATCGC ATAGGAAAAG 420 

ATGATGCCTC AGGACTAGCA. GCACAAATGA CTTACCATTT CGTACTTGCT ATGTTCCCTA 4 80 

TGTTGCTTTT CCTATTAACA TTATTACCAT TTTTCAATAT TAAGCAGAGT CAAATTACTA 54 0 

ATATGTTAAG CAATGCACCC GCTGAAACAT CTACTCTAAT TAAGAGTGTA ATTGGTGATA 60 0 

TAACTCAAAA CTCCAGTGGT GGCTTATTAT CTATCGGTTT GATTTAGCA ATTTGGTCAG 660 

CTTCAAATGG AATGACTGCA ATTATGAATT CTTTCAATGT TGCTTACGAT GTAGAAGATA 720 

GCCGTAATGG AATCGTATTA AAACTACTAA GTGTTGTCTT CACTGTAGTT ATGGGCGTTG 780 

TGTTTGTAGT TGCTCTAGCA TTACCAACGC TTGGTTCTGT AATTAGTCAT TTCCTATTCG 84 0 

GTCCACTTGG aTTTGACGAA CAAGTGAAAT GGATTTTTAA CCTTATTAGA ATTGTGTTAC 900 

CAATCATTAT TATATTTATC ATATTTATCG TGTTATATTC GGTTGCACCT AACGTTAAAA 960 

CGAAGCTTAA GTCAGTATTA CCAGGTGCAG TATTTACTTC AATTATTTGG TTAGCTGGTT 1020 

CATTTGGTTT TGGTTGGTAT ATTTC^ATT TTGGTAACTA TTCTAAAACA TATGGCAGTA 1080 

TCGCGGGTAT CATCATTTTG TTACTATGGT TATATATCAC AAGTTTTATT ATAATTGTCG 114 0 

GnGCTGAAAT CAATGCAATC AT T CAT C AGC GTAGTGTAAT TAAAGG 118 6 
(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

TCTTGAGCCA TCTTTTGAGC TAACTGACTA GAT T GAT AC C CAAAAACCAT AG T T AC C AAC 60 

ATAAACTTTA ATTTTACCGA AGTCTAAATC AGCGATATGA GTACATACAT TATTTAAGAA 12 0 

ATGACGGTCA TGCGATACTA CGATAACAGT ATTATCAAAG TTAATTAAGA AATCTTCTAA 180 

CCAACTGATT GCTGGAATAT CGAGACCGTT AGTAGGCTCA TCCAGTAATA GTACGTCTGG 24 0 

TTCACCGAAT AAACTTTGCG CTAATAATAC TTTAATTTTT TGGTTGTTTT CTAATTCAGC 300 

CATTTTTTTA TCGTGTAAAG TTGGATCGAT ACCTAAACCA GAT AAAAGG T TAGCAGCATC 3 60 

AGCTTCAGCA TTCCAACCAT TCATTTCTGC AAATTCACCT TCAAGTTCAG CAGCACGGAT 4 20 
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ACCATCTTCA TCACTGAAAT CTGGCTTCiT AT AG AT T T C A TCTTTTTCTT TCATAACCTC 4 80 

AT AAAG AC G T TCGTGACCTT TAATTACAAC ATCAAGCACG CGTTCATCTT CATAAGCATA 54 0 

GTGGTCCTGT TTTAAAACAG CTAGACGTTC ATTTTTCCCT AATGAAACAT GTCCTGTTTG 600 

AGAATCTAAT TCACCAGATA ATATTTTTAA GAATGTTGAT TTACCTGCAC CATOGCACC 660 

AATTAATCCA TAACAATTAC CTTCTGTAAA TTTAATATTT ACATCTTCAA ATAGTTTACG 7 20 

ATCTCCAAAA CGTAAACTCA CATCAGTTAC TTGTAACATG CATTTTCTCC TTTTTTTCAT 7 80 

TCGATATTCT AACGGAAGAA TTATATCATA TTATCGTCAC AGTTTCGACC TCATATAAGT 840 

TGTAATGATA GAATGACTCA CACATGTTAT AATAATAAAG AATACAAGAA TCGAAGGAGA 900 

ATAACATGGC AT T AGACAAA GATATAGTAG GTTCTATAGA ATTCCTTGAA GTAGTAGGGT 9 60 

TACAAGGTTC AACTTACCTT TTAAAAGGAC CAAACGGTGA AAACGTAAAG TTAAACCAAT 1020 

CAGAAATGAA CGATGATGAT GAATTAGAAG TAGGTCAAGA ATATAGTTTC TTCATTTATC 1080 

CAAACCGTTC AGGTGAATTA TTTGCAACTC AAAATATGCC TGATATTACG AAAGATAAAT 114 0 

ATGACTTTGC TAAAGTACTT AAAACGGATC GCGATGGGGC ACGTATAGAT GTTGGATTAC 12 00 

CCCGTGAAGT GTTAGTACCA TGGGAAGATT TACCAAAAGT GAAATCACTA TGGCCACAAC 12 60 

CTGGTGATTA TTTGCTAGTT ACATTACGAA TTGACCGTGA GAATCATATG TATGGACGTT 1320 

TAGCGAGTGA ATCTGTTGTA GAAAATATGT TTACACCTGT ACACGACGAT AATTTAAAAA 138 0 

ACGAaGTCAT TGAAGCCAAA CCTTACCGCG TATTACGAAT TGGTAGCTTT TTATTAAGCG 14 4 0 

AATCAGGTTA CAAAATTTTC GTACATGAAT CAGAACGTAA AGCTGAACCA AGATTAGGTG 1500 

AATCTGTTCA AGTTAGAATT ATCGGGCATA ATGATAAAGG TGAGTTAAAT GGTTCATTTT 15 60 

TACCACTTGC ACATGAACGT TTAGACGATG ACGGCCAAGT CATCTTTGAT TTACTAGTTG 162 0 

AATATGATGG TGAATTACCA TTCTGGGACA AATCAAGCCC TGAGCGATT AAAGAAGTAT 168 0 

TCAATATGAG TAAAGGTTCA TTCAAACGTG CAATCGGTCA CTTATATAAA CAGAAGATTA 17 4 0 

TTAATATAGA AACAGGTAAA ATCGCTTTAA CTAAAAAAGG TTGGAGTCGA ATGGACTCAA 18 00 

AAGAATAATC ATTTTTACAC GTGTCGTAGG ATGCGTGTTT TTTTTATTCA ATATTAAATC 18 60 

GGACAGATGA AGTAGTTTTT TAAACATTCC TTTCAAAGTA AAAAATTAAA TAATTCAAAC 1920 

GAATAGGCTG GGaCATTAAG TTCTTAGGCA ATGTAAAAAA GCTGATTTCT ATTAATTATT 1980 

TGATGGAAAT CAGCTTTTTT GATATGTATT TTATAATGTA CAGCTCGTTG AGCTGCTATT 204 0 

TTCCTTATAT TAAGTGCCAT TAATATAAAA CCTAGCTCTC GTTTAACTTT ATTTATTCCT 2100 

CGAACTGACA TTCGAGTGAA aCCCAAAATA GCCTTCATAA ATCCAAAAAC AGGCTCTACA 2160 

TAAATTTTTC TATGACTATA GATTTTTTTC GTTTCTGGTT C AG AAAG C T T TTGaTTAATT 2220 

TGGGCTTTAA TGTATTTCAA AGTAAAATTA CATGTTAATA CGTAGTATTAATGGCGAGAC 2280 
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TCCTGAGGGA GCAGTGCCAG TCGAAGACAG GGGCCCCAAC ACAGAAGcTG ACATATAGTC 234 0 

AGCTTACAA.C AATGTGCCGG TTGGGGTGGC TGAGACGGCA CCCTAGGAAG GGACCCGTCA 24 00 

TCAAAAATTC TATTTATAGA ATTTTACAGT AATGTGACAG ACGGGCAAAG CGAAgCCATT 24 60 

CAATACGAAG TATTGTATAA ATAGAGAACA GCAGTAAGAT ATTTTCTAAT TGAAAATTAT 2520 

CTTACTGCTG TTTTTTTAGG GATTTATGTC CCAGCCATTT TTGTATTCAT ATTTAAATTT 2580 

CGATAATTTT TCAGGAAGCA TTTTAATTTT ACTAATGAAG CAATATTTTT TAGATTAACA 2 64 0 

AAAATTAATA TTTACATTTT CTTAACAATT TTTTATGTAA CATTTACAGT TTCTAAAAAT 27 00 

GAGGTTAATA ATTCAAGGTT AAGATAAAGA TGTAATCAAT ACAAATACTA TTTGTTGTTC 27 60 

ATACAGGGAG GATATTTCAA TGAAAAAATG GCAATTTGTT GGTACTACAG CTTTAGGTGC 2 82 0 

AACACTATTA TTAGGTGCTT GTGGTGGCGG TAATGGTGGC AGTGGTAATA GTGATTTffiA 288 0 

AGGGGAAGCT AAAGGTGATG GCTCATCAAC AGTAGCACCA ATTGTGGAGA AATTAAATGA 2 94 0 

AAAATGGGCT CAAGATCACT CGGATGCTAA AATCTCAGCA GGACAAGCTG GTACAGGTGC 3000 

TGGTTTCCAA AAATTCATTG CAGGAGATAT CGACTTCGCT GATGCTTCTA GACCAATTAA 30 60 

AGATGAAGAG AAGCAAAAAT TACAAGATAA G AAT AT C AAA TACAAAGAAT TCAAAATTGC 3120 

GCAAGATGGT GTAACGGTTG CTGTAAATAA AGAAAATGAT TTTGTAGATG AATTAGACAA 3180 

ACAGCAATTA AAAGCAATTT ATTCTGGAAA AG C T AAAAC A TGGAAAGATG TTAATAGTAA 32 4 0 

ATGGCCAGAT AAAAAAATAA ATGCTGTATC ACCAAACTCAAGTCATGGTA CTTATGACTT 3300 

CTTTGAAAAT GAAGTAATGA ATAAAGAAGA TATTAAAGCA GAAAAAAATG CTGATACAAA 3360 

TGCTATCGTT TCTTCTGTAA CGAAAAACAA AGAGGGAATC GGATACTTTG GATATAACTT 3420 

CTACGTACAA AATAAAGATA AAT T AAAAG A AGTTAAAATC AAAGATGAAA ATGGTAAAGC 438 0 

AACAGAGCCT ACGAAAAAAA CAATTcAAGA TAACTCTTAT GCATTAAGTA GACCATTATT 354 0 

CATTTATGTA AATGAAAAAG CATTGAAAGA TAATAAAGTA AT G T C AG AAT TTATCAAATT 3 60 0 

CGTCTTAGAA GATAAAGGTA AAGCAGCTGA AGAAGCTGGA TATGTAGCAG CACCAGAGAA 3 660 

AACATACAAA T C AC AAT TAG ATGATTTAAA AGCATTTATT G AT AAAAAT C AAAAAT C AG A 3720 

C G AC AAG AAA TCTGATGATA AAAAGTCTGA AGACAAAAAA TAATAAGACG C AAT T T C AAA 37 8 0 

TGTGTCTTGA AACATGATTT TGATGGTGAA TCATTATTTA G AG T AC AAAG CTTGATTTAT 3840 

CGAGACGCTG ATTTTGACAT TCAGTTAGTC TAcAAGCTTA TCAACTffiAA ATAGTGGTTC 3900 

AT CAT TAT TT TACAAATCTA ATTATTTTGG GAGTAATAGA AAGAGGTTTG AT TAT G AC T T 3960 

CATCTACTAA TGTTAAAGCT TTAATCGAAA AAAATAATAA TAAAAAAGGA AAGCATAATG 4 02 0 

ACAAAATTAT ACCAGTTATT TTAGCCGCAA TTTCAGCGAT TTCCATTTTA ACAACACTAG 4 08 0 
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GTATATTAAT CACATTGCTT TTAGAAACCA TCACTTTTTT CACCAGAATT CCAATAACTG 414 0 

AATTTCTATT TTCTACTACT TGGAATCCTA CCGGTTCAGA CCCTAAGTTT GGTATCTGGG 4200 

CATTGATAAT AGGGACTTTA AAAATCACAG TTATTGCGAC TATATTTGCA GTTCCAGTCG 4 2 60 

GGCTTGGTGC AGCAATTTAT TTAAGTGAff ATGCAAGTGA CCGTGCACGA CGCATAATTA 4 320 

AACCGATATT AGAAATTTTA GCAGGAATAC CAACAATTGT GTTTGGTTTC TTTGCATTAA 4 38 0 

CCTTTGTTAC ACCAGTATTA AGATCTTTCA TACCAGGTCT TGGAGAGTTT AATGCTATAA 4 44 0 

GTCCCGGCTT AGTTGTCGGT ATTATGATTG TCCCTCTCAT CACAAGTTTG AGTSGGaTG 4 500 

CAATGGCATC TGTACCAAAT AAAATTCGAG AAGGTGCCTA TGGACTTGGA GCAACTAAAT 4 560 

TAGAAGTAGC AACTAAAGTC GTACTTCCCG CAGCAACATC AGGTATTGTA GCTTCAATCG 4 62 0 

TTCTCGCGAT TTCAAGAGCA ATTGGAGAAA CGATGATTGT ATCATTAGCG GCAGGTAGTT 4 68 0 

CGCCAACAGC TTCATTAAGT TTAACAAGTT CGATTCAAAC AATGACTGGA TATATTGTTG 4 74 0 

AGATAGCGAC AGGTGATGCA ACATTTGGAT CAAATATTTA TTACAGTATT TATGCTGTAG 4 8 00 

GGTTCACACT ATTTATCTTT ACCTTAATCA TGAATTTACT TTCTCAGTGG ATTTCTAAGC 4 8 60 

GTTTTAGGGA GGAGTATTAA TATGGAAACG ACAGAAATA AT AG AC AAT C ACTCGTCGAT 4 920 

CAACAACTTG TCCAAAAACA TTTAT CATCC AGAACGGTTA AAAATAAAGT GTTCAAACTC 4 98 0 

ATATTTTTAG CAT G T AC AT T ATTAGGACTT GTCGTACTTA TTGCGTTGTT AACTCAAACA 504 0 

TTGATTAAAG GGGTAAGTCA TTTAAATTTA CAGTTTTTCA CTAATTTTTC TTCTTCAACA 5100 

CCATCTATGG cTGGCGTTAA AGGCGCGTTA ATCGGTTCAC TTTGGTTAAT GTTAAGTATC 5160 

ATTCCATTAT CAAT CATCCT AGGAATAGGT ACAGCTATAT ACTTAGAAGA ATATGCGAAA 522 0 

AACAACAAAT TTACTCAGTT TGTTAAAATC AGTATTTCCA ATTTAGCTGG TGTACCATCA 528 0 

GTTGTATTTG GGTTATTAGG TTATACTTTG TTCGTTGGTG GTGCAGGGAT TGAAGCCTTG 534 0 

AAAATGGGTA ACAGTATATT GGCAGCAGCG CTAACAATGA CCTTACTGAT ATTACCAATT 54 0 0 

ATTATTGTTT CAAGTCAGGA AGCAATTAGA GCTGTACCTA ACTCAGTACG CGAACTTcTT 54 60 

ACGGCTTAGG TGCTAATAAA TGGCAAACGA TAAGACGTGT TGCTTACCA GCAGCGTTAC 552 0 

CTGGTATTTT AACTGGATTC ATTTTGTCTC TTTCAAGAGC ACTGGGAGAA ACAGCGCCAC 558 0 

TTGTGCTAAT CGGTATACCG ACTATATTAT TGGCAACACC TAGAAGTATA TTGGATCAAT 5 64 0 

TTTCAGCATT ACCTATCCAA ATATTTACTT GGGCGAAAAT GCCTCAAGAA GAATTCCAGA 5700 

ATGTTGCATC GGCAGGCATT ATCGTTTTAC TAGTTATCTT AATCTTAATG AATGGCGTTG 57 60 

CGATTATTTT ACGTAACAAA TTTAGTAAAA AATTCTAATT TAAACAATCA ATCTCATTTA 5820 

TCTATTAAAA AGGGAGTTTT AAATATGGCG CAAACACTTG CACAAACTAA ACAAATATCT 58 8 0 

CAAAGTCATA CGTTTGATGT CTCACAAAGT CATCATAAAA C AC C AG AT GA TACAAACTCA 5940 
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CATTCTGTTA TATATTCAAC ACAAAATTTA GACTTATGGT ATGGCGAAAA TCATGCATTA 6000 

CAAAATATTA ATTTAGATAT TTATGAAAAC CAAATTACTG CCATTATAGG TCCATCTGGT 60 60 

TGTGGCAAAT CAACATATAT TAAAACTTTG AATCGAATGG TTGAGTTAGTACCTTCTGTG 612 0 

AAAACAGCTG GTAAAATATT ATATCGAGAT CAAGACATTT TTGATCAAAA ATATTCTAAA 6180 

GAACAATTAC GTACAAATGT GGGCATGGTC TTTCAACAAC CTAATCCATT TCCAAAATCA 62 4 0 

ATATACGATA ATATTACTTA CGGTCCAAAG ATTCACGGTA TTAAAAATAA AAAAGTTCTT 6300 

GATGAAATCG TTGAGAAATC ATTACGTGGC GCTGCAATTT GGGATGAATT AAAGGATAGG 6 3 60 

TtGCACACAA ATGCATATAG TTTATCCGGT GGGCAACAAC AACGTGTTTG TATCGCGCGT 64 2 0 

TGTTTAGCAA TTGAACCTGA AGTCATTTTA ATGGATGAAC CGACATCAGC ATTAGATCCA 64 8 0 

ATCTCAACAT T AAGAG TAG A AGAGTTGGTT CmGAACTAA AAGAAAAGTA TACAATTATT 65 4 0 

ATGGTtACAC ATAATATGCA ACAAGCAGCT CGTGTATCAG ATAAAACTGC ATTTTTCTTA 6600 

AATGGTTATG TCAATGAATA T GAT GAT ACT GATAAAATTT TCTCTAACCC ATCAAACAAG 6 660 

AAAACAGAAG ATTATATTTC AGGAAGGTTT GGTTGATATA TAATGGCAAT AATTAGASA 67 2 0 

CGATATCAGG AGCAACTTGA TGATTTAATA AAAGAATTAC GTCGGTTAGG TGCaAATGTC 67 80 

TATGTGAGTA TTGaAAATGG TATAAAAtCA TTAAGTATTG aCGATAGAGG cTTTGCACGA 68 4 0 

CAAACAGTTA AAAACGATAA AC AT AT C AAT CAATTAAATT AT GAT AT T AA TGAGCGAGTT 6 900 

ATCATGTTAA TTACAAAGCA ACAGCCCATT GCGAGTGATT TGCGTATGAT GATTTCTTCA 6 9 60 

TTAAAAATCG CCTCCGATTT AGAAAGAATA GGAGATAATG CCTCGAGTAT TGCCAATATT 7 020 

CGATTGCGTA CAAAGATTAC AGATGATTAT GTGTTAACCC GTTTAAAGAC AATGGGTAAA 7 080 

TTAGCTATGT TAATGTTAAA GGACTTAGAT CAAGCATTTAAAAAGAAAGA TACCGTATTA 714 0 

ATAAGAGAAA TAATTGAGCG TGATGAAGAT ATCGATGACT TATATAGTCA TATTATTAAC 7 200 

GCAACGTATC TTATTGATAA CGtCCATTTG TCGC TGCACA AGCTCATTTA GCAGCAAGAC 7 2 60 

AT T T AGAACG TATTGGTGAT CATATTATTA ACATCGCTGA AAGTGTTTAT TTTTATTTAA 372 0 

CAGGTACACA TTACGAACAA TAACTTAAAG T TAT T AC TAT AAAATCCCTT AC G AT AAAT A 7380 

TATATTTCTA TTATTCATAA ACCCTCAAAA AAACCAAGAT TCTCACAATT AG T AAT G T GA 74 40 

AAATCTTGGT TTATATTGTT CTACTATAAA TTGTCTCGCA TCTTAGTTAT TTGCTTGCTC 7 500 

AATTTCATCT GTTAATTTTT (AACTTCATC G AC T AAAT C A G AAAT AT AT T GAATTGTAGA 7 5 60 

TTTAAGTGGC TGTTCTGTAG TAATGTCTAC ACCTGCAATG TTTGCAAGTT CGACAGGTGA 7 62 0 

TACACTACCA CCTTTTTTCA ATGTTTCTAA CCAAGCATCA ACAGCTGGTT GGCCTTCATT 7 68 0 

TTTAATCTTT TGAGAAACGA CAGTTCCGAT TGTTAAGCCA GCAGAAffiCG TATACGAATA 77 4 0 
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TAATCCCATA TAGTAATGAG GTTGACGCAT CCATGTTAAT TCAGCACCCT CAGTCATGTC 7800 

TACTGCATCT CCAAAAAATT GTTTATAAAC ATTTAGCATT ATTTCATTTA ATGTnCGGCG 7 8 60 

CATTTAAAGA TT 7 87 2 
(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

TTTTTTCTTT TCTTCATTTG AAAATTGATC ATTCAGCAAT ATAAGCGTAT TTGTTAATGA 60 

TTTAGGTGTT CCAATTTCAT AATCCCACCA ATTTAAGTTG GTATTCTTGC CAGTTGTTTT 12 0 

AGTAAAATTC TCACTTAATT, CTTTTACTTT TTTATCTGGT TCTTTTCCAT ATGCATTTTT 180 

ATGCAGCCAC TCAAGGGCAT CTTTCACTTT CTTCTTATTT TCGTCAGTAT TTAAAGTGGT 24 0 

TTTAGGATTC CTCATCGCTT CTGCGATTTT CTCAATATTA CGATAGGTAC GAGTCATATG 300 

AGAAGAATTA GTTTCAAGGG TTTCCGCTCC TGACCACAAG TATTTCCTAC CACTTTCAGT 360 

TTTCATTTCC TTGAGTAAAT TCGTCGCCTC TTTCTCTGTA GCATCAAACTTCTTCTTCAT 4 20 

ATCTGGATTA TTCTCATCAT ACTTATCATA ACCATAGTTA ACGTCCAGCC ATGTGTTCCT 4 80 

CAATTTTTCA TAATCTGGCG TTTGAACATT CGTATCAGCC ACAGCGATTT GATGTTTATC 54 0 

AACACTTCTG AATTCACCAC CATTCAAAGT AAT C AC AC C A GCCATTAATA ACGTAATGGT 600 

GGATAATTTT TGCCATTTCT TTATTCTATA TGTCATTGaC ATGTCTCCTT TTTGTGTTGC 660 

GCGTGCGCAA TGAATATTAT GATTAAATAA TGATTCAATT TTTCAAAATT CGTTAACGTA 72 0 

TACAAATGAC TGTCTACTGT CAAACAATCC AC AAAGAAT G TTGATGtCAT AT a AAC AAT C 78 0 



GATCACCCAA ATTTTCCG 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5132 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
TACAGGTTTT ACTATAATGG ATGGTATTTT GGCTAAACGA CATTGGTTTA GTCTTCTTTT 60 
TTTnACTTCC TAnATTTACA ATGGTATAAA TAATAATGCT AT AT T T AG AA T GAT GAG TAT 120 
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ACTTACTGAA ACTAAATTAA AAGTGTCTGG TTCTTTACTA AAGATAGCTG CTACCTTGC 180 

TATCAAACTT GGCAATGAAA TTACCAACAC GATGAGCGCA TAAATTTTCT GTTTAATTAA 24 0 

AATACAAGTT CCAATGAGCG CAATTAAAAG TACTAACCCA ACG AT GAAAC TCTGTTTGTC 300 

ACT T AAC T C A AAGAAAC TAT AGATAGGATA TTTTTTAATA ATCAAGCCAC CTAAAATCAT 3 60 

CCATAAAAAT ACGATAATTC CATAAGTCAC ATTTATAACA TACGTTATTT TTTGGTCACC 4 20 

AAATCGGACT AATGTATTTC GTAGAATCAG CATACCAATG ACAACACCTA AAAT AAC GAT 4 80 

ACTAGCTATA TAAAGTAAAA ATGCAATTGT CACATCAAAT G T AC C C AAAT CTAAAAACCT 54 0 

AGGAATTAyA AyGACTGCTA AAATAAAAGC GAAG^CAAA GTAATATAkT T AT AC AAAC C 600 

GGTAGTAAGA CTTATCTCAG GTGATAATTG ATCAGCCATT GACTTAATCG GTGTATTAAT 660 

AATTGAACTT GTATCTTCGT TATTTTTTTC AGCCATAGTT AAATGATCTT CGAGCTCTTC 720 

CAATAACTCT TCTACTTCTG CTTCAGTCTT ACCTCTAAAT AACAATTCAA CACGTAATTT 780 

TTCTAAAAAA TCTTGAGATT GTTTACTTAA CATCGTTTTC CCCTCCAAAC AAGTTAATCA 84 0 

TCCCTTTATT CAAAACTTGC CATTTCGATT TAAATACTTT TAGTTCCTTT AAACCTGAAT 900 

CGGTAATCGT ATAGTATTTC CGCCTCGGGC CGCCATTACT AGATTTTTTT ATTGTCGTAT 960 

CAACGTATCC TTTTTTGTTT AAACGCAT T A AAACTGGATA AATACTACCC TCACTTATCT 102 0 

CTGGAAACTC TTGATTCTTA AGTTTCGTCA TAATTTCATA TCCATACGTT TCGCCTTGGG 1080 

CAATGAGACC TAATATCGCC CCATCTAAGA GACCTTTCAT AATCTGATCT G AC AC T G AC A 114 0 

TTTTAATCAC CTACTATCTT ACATAATAAG ATAGTACATT G5AACTTTT CGTCAACTAT 1200 

CTTTTATTGT AAGGTAGTTG TTGTACACAT TCCTTAAATG ACTAACAACT TTGTTAATAG 12 60 

GGTAATACTT ACGGAAGTAT ATTTTATTTA TGGGGGAGGA ATTAATAATG AC T AC AAAAA 1320 

CAGTATTTGA TGTCATTGAT ATGGGGTTAG GATATTTAGT AAATGTGTAT GATGCTTGGA 138 0 

AAGTTGAAAA GGTACTTGAT GATTATCATA AGCCTTTTTC TAATACCATT CATTGGCAAT 14 4 0 

TTGGtCATGT ATTAACAATT TTTGAATCGG CCTTAGCTGT TGCTGGTAAA GAGAATATTG 1500 

ATTTAAATAT CTATAGACCT TTATTCGGAA ATGGTTCGTC T C C AG AT G AA TGGAAGGATG 15 60 

AAGTACCGAG TATTGAAAGG ATTTTAGAAG GTCTCCAAAC TTTACCTGAA CGTGCACGAA 1620 

ATCTAACTGA AGATGATTTA GCAATTGAAT TGAAACAGCC AATTGTCGGT TGTAATAACT 1680 

TAGAAGAGTT ATTAGTATTA AATGCCATTC ACATCCCACT TCATGCTGGT AAAATTGAAG 17 4 0 

AGATGTCTCG TATATTAAAA AATTTAAAAT AAATATGTGC TTATTAACCGTTAACAACAC 18 00 

GTTAACGGgT TTTTTATTTG TTTAAAAGGT CACTTTTTTG AATTTAATAA ACACCATCTA 18 60 

TACCAGTTCT TCACCGATTC TCGAAAAATA ATTATATTAA TGATTTCGTT AATTTAATTT 192 0 
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TATATTTAAT TATTACTGTA CATCTTTTGT AGTTAGCTTT ATTCTTAAAT TGAAATATGT 198 0 

ATAAAACATA GAAAGAAGGA CGTTTTAACA TGGAAAACAC GGTTAAATAT CGTAAGTTTA 204 0 

TACTCCCTAT CGTTGTAGGT CTCCTTATTT GGGCACTTAC ACCTTTTAAA CCGGATGCTG 2100 

TGGATCCAAC AGCATGGTAT ATGTTCGCAA TATTCGTCGC GACAATCATT GCTTGTATTA 2160 

CACAACCGAT GCCAATTGGG GCCGTCTCTA TIATTGGATT T AC AAT CAT G GTACTCGTTG 222 0 

GCATTGTTGA CAT G AAAAC G GCTGTCGCTG GTTTTGGTAA TAATAGCATT TGGTTAATTG 228 0 

CTATGGCATT TTTCATTTCG AGAGGATTTG TGAAAACAGG TCTTGGTAGA CGTATCGCAC 2 34 0 

TTCATTTCGT CAAATTATTT GGTAAAAAAA CATTAGGATT AGCATATTCT ATCGTQETG 24 00 

TAGATTTAAT TCTAGCGCCT GCTACACCAA GTAATACCGC GCGTGCTGGT GGAATCATGT 2 4 60 

TCCCAATTAT CAAATCACTT T C T G AAT CAT TTGGTTCGAA AC C G AAAG AC GGATCAGCAC 2 52 0 

GCAAAATGGG TGCATTTCTT GTTTTCACAG AATTCCAAGG TAATTTAATT ACTGCGGCTA 2580 

TGTTTTTAAC TGCAATGGCC GGTAACCCCC TTGCACAAAA TTTAGCATCT AGCACATCTA 2 64 0 

ATGTTCACAT TACATGGATG AATTGGTTTC TAGCTGCTTT AGTTCCTGGA CTTGTTTCCT 2700 

TAATTGTTGT ACCTTTTATT ATT T AT AAAA TTTATCCACC AACTGTTAAA GAAACACCAA 27 60 

ATGCTAAGAG TTGGGCTGAA AATGAATTAG CGACTATGG TAAAATCGCT TTAGCTGAAA 2 82 0 

AATTTATGAT TGGTATTTTT GTCGTTGCGT TAACACTATG GATTGTCGGA AGTTTCATTC 2880 

ATATTGATGC AACTTTAACG GCCTTTATTG CGCTAgcATT gTTATTATTG ACAGGCGTCT 2 94 0 

TAACATGGCA AGACATTTTA AAC GAAAC AG GTGCTTGGAA C AC AT TAG T A TGGTTCTCAG 3000 

TATTAGTGTT AATGGCCGAC CAATTAAACA AGCTTGGATT TATTCCTTGG TTAAGTAAAT 30 60 

CCATTGCTAC AAGTCTTGGT GGCTTAAGCT GGCCTATAGT CCTGGTCATT TTAATATTGT 3120 

TCTACTTCTA TTCACATTAC TTATTTGCAA GTTCTACAGC ACATATCAGT GCGATGTATG 318 0 

CAGCATTACT AGgCGTTGCC ATCGCAGCCG GTGCACCACC AT TAT T C AG T GCATTAATGT 3240 

TAGGTTTCTT CGGTAACCTA TTAGCTTCAA CAACACACTA TAGTAGTGGT CCAGCGCCGA 330 0 

TTCTATTCTC TTCAGGTTAC GTGACTCAAA AACGTTGGTG G AC AAT GAAC TTAATATTAG 336 0 

GTTTCGTCTA CTTTATTATC TGGATTGGTT TAGGATCACT TTGGACSAAA GTAATTGGTA 3420 

TATTTTAAAA TATTTAAATT AGCGCTCGAA TCTCATTGAT TTGGGCGCTT TTTAATTTGT 34 8 0 

ATTTAAAATC AACCTTTGCT AAATCAAGAC TCCCTTTTTA AAATACGTTT ATCCTTTAAA 354 0 

TCATTGCGTG CTTCACTGAA AATTTGTATA AAGATTTAAG TCATTACGTA ACATCACATA 360 0 

AAATACATTT C TAT AC T AT T CCGCTTCATT GATTAACATT ACGTATGCCC TCATAAATCA 3 6 60 

T CAT AC AAAA AACACCTTCG TTTAAATTCA TTTTAATTGC GAATTCAACG AAAGTGCCTT 37 2 0 

ATTTCATATT TAATGTTTCA AATTTATACG TCTGTCACTG TTACTGCACA CATACCTCAG 37 8 0 
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TCTCACCTCA T CAT G AC AC A AAGGATATTG AAAAC AC G T A TAATTATCTT TCCTCGTATT 38 4 0 

TTATAGGGTT TTTGCGACCG GATGTTTCTT CAATTTAATG TATTGAGAAA GACTATATAA 3 90 0 

CACAATACCT GTCCAAATAA ATATAAACGT AATTAATTGA TCTATACTAA AAGGCTCTTT 3960 

GAAAACAAAT ATGCCGAGTA CAAACATTAT TGTTGGTCCA AC G TAT T G AA TAATCCTAT 4 020 

TAGCGAAAGT GGAATACGTT TTGCCCCGGC TGAGAATAGG ATTAGTGGTA TTGCCGTAAT 4 08 0 

AGCACCAGAA AATAACAACC AAAATGATGA CATGTTCAAT CCAAATGACA TCTGATGTTG 414 0 

CTGCCATAAA TAAATAACGT ATATTAGTCC AGCAGGTGCG GTAACAATAC ATTCAATCGT 4 200 

AATACTGCTG ATGGCATCAA TAT GT ACT AC TTTTTTCAAT AATCCGTATG TACCAAAGGA 4 2 60 

TAACGCTAAT ATAATAGAGA CGATTGGGAA TTCTCCAATC TTGAGCGTCA TATATAATAC 4 32 0 

ACCGATGAAT GCGAATAAAA TGGCTAGCCA TTCAAATTTA TTGAATCTTT CTTTTAAAAA 4 38 0 

GATAAGTGCG AGCAAAATGC TAACAAGTGG ATTTJTATAA TAACCTAAAC TTGTTTGTAG 444 0 

GACGTGACCG TTCGTTACAG CCCAAATAAA TGTACCCCAA TTTAATGTAA TGACATAGCC 4 500 

TGCTACGACA ATCGCTAATA GCTGAATGGG CTTGCCTAAC AATTGATTCA TATCTCGTTG 4 5 60 

AAATGCATTG CGTTGTTTTT GTCCAACCGC GAG TAT GAAA AT CAT G AAT A TTGCTGAAAA 4 62 0 

TATAATACGA AAGGCTAAAA TTTCAAATGC GCCTATTGCA TCAACGAACT GCCAATATAT 4 680 

AGGTAGTATT CCCCACAGAA TGTATGCACT GAGTGCTAAA AATATGCCTT TTTTATACTC 4 74 0 

TGAATTCACC TTCAAACCTC CTTACTTTCC TAATTTTTAA TTTACTGCAT ACGCTCACTT 4 800 

GGTTATGCTA ATATAACGAT TTTACTAATA ATATTTCGAT AAAGATATCA TTTTGTTTAT 4 8 60 

ATTTCCCACA TTTATTCACC AACCACTAAA C AAT ATT AAT TTTATAAATA ATTCTGTACA 4 92 0 

AATCAGGGTA TATTGCCAGA AAGACTACCA TACAACATAA AGGATGGATA CAAATGACTT 4 98 0 

TACCTAAAAT TGGAAAGCCT GCAACACGCG CGCTAAATTC ASAGGTATA TACACATTAG 504 0 

AAGCAGTATC ACAATATACG AAGTCATCTC TAATGGAGAT GCATGGCGTT GGTCCTAAAG 510 0 

C TAT AT C AAT ATTGGAACAA GCTTTATTTC AG 5132 
(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

AAGTAAAT T A TATTATGAAT TTGCCTGTCA ATTTCTTAAA GACATTCTTA CCGGAACTAA 60 
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TTGAAAAAAA TGTCAAAGTT GAAACAATTG GATTTACTGA TAAGTTGCCA AAATCAACGA 120 

TAGAAGCAAT TAATAATGCy mAAGAAAAGA CAGCTAATAA TACCGGC TTA AAATTAATAT 180 

/-TTGCAATTAA TTATGGTGGC AGAGCAGAAC TTGTTCATAG TAT T AAAAAT ATGTTTGACG 24 0 

AGCTTCATCA ACAAGGTTTA AATAGTGATA TCATAGATGA AACATATATA AACAATCATT 300 

T AAT G AC AAA AGACTATCCT GATCCAGAGT TGTTAATTCG TACTTCAGGA GAACAAAGAA 360 

TAAGTAATTT CTTGATTTGG CAAGTTTCGT ATAGTGAATT TATCTTTAAT CAAAAATTAT 4 20 

GGCCTGACTT TGACGAAGAT GAATTAATTA AAT G TAT AAA AATTATCAG TCACGTCAAA 4 80 

GACGCTTTGG CGGATTGAGT GAGGAGTAGT ATAGTATGAA AGTTAGAACG CTGACAGCTA 54 0 

TTATTGCCTT AATCGTATTC TTGCCTATCT TGTTAAAAGG CGGCCTTGTG TT AAT GAT AT 600 

TTGCTAATAT AT TAGCATTG ATTGCATTAA AAGAATTGTT G AAT AT G AAT ATGATTAAAT 6 60 

TTGTTTCAGT TCCTGGTTTA ATTAGTGCAG TTGGTCTTAT CAT CAT TAT G TTGCCACAAC 720 

ATGCAGGGCC ATGGGTACAA GTAATTCAAT TAAAAAGTTT AATTGCAATG AGCTTTATTG 780 

TATTAAGTTA TACTGTCTTA TCTAAAAACA GATTTAGTTT TATGGATGCT GCATTTTGCT 84 0 

TAATGTCTGT GGCTTATGTA GGCATTSGTT TTATGTTCTT TTATGAAACG AG AT C AG AAG 90 0 

GATTACATTA CATATTATAT GCCTTTTTAA TTGTTTGGCT TACAGATACA GGGGCTTACT 9 60 

TGTTTGGTAA AATGATGGGT AAACATAAGC TTTGGCCAGT AATAAGTCCG AATAAAACAA 1020 

TCGAAGGATT CATAGGTGGC TTGTTCTGTA GTTTGATAGT ACCACTTGCA AGTTATATT 1080 

TTGTAGATTT CAATATGAAT GTATGGATAT TACTTGGAGT GACATTGATT TTAAGTTTAT 114 0 

TTGGTCAATT AGGTGATTTA GTGGAATCAG GATTTAAGCG TCATTTCGGC GTTAAAGACT 12 00 

CAGGTCGAAT ACTACCTGGA CACGGTGGTA TTTTAGACCG ATTTGACAGC TTTATGTTTG 1260 

TGTTACCATT ATTAAATATT T TAT T AAT AC AATCTTAATG CTGAGAACAA ATCAATAAAC 1320 

GTAAAGAGGA GTTGCTGAGA TAATTTAATG AATCTCAGAA CTCCTTTTGA AAATTATACG 1380 

CAATATTAAC TTTGAAAATT ATACGCAATA TTAACTTTGA AAATTAGACG TTATATTTTG 14 4 0 

TGATTTGTCA GTATCATATT ATAATGACTT ATGTTACGTA TACAGCAATC ATTTTTAAAA 1500 

TAAAAGAAAT TTATAAACAA TCGAGGTGTA GCGAGTGAGC TATTTAGTTA CAATAATTGC 1560 

ATTTATTATT GTTTTTGGTG TACTAGTAAC TGTTCATGAA TATGGCCATA TGTTTTTTGC 162 0 

GAAAAGAGCA GGCATTATGT GTCCAGAATT TGCGATCGGT ATGGGGCCAA AAATTTTTA 168 0 

TTTTAGAAAA AATGAAACAC TTTACACTAT TAGGTTATTG CCTGTTGGTG GATATGTTCG 17 4 0 

TATGGCAGGA GATGGCTTAG AAG AG C C AC C AGTCGAGCCC GGTATGAACG TTAAAATTAA 1800 

AC T T AAT G AA GAAAATGAAA TAACACATAT CATATTAGAT GATCATCATA AGTTTCAACA 18 60 

AATTGAAGCG ATCGAA.GTTA AAAAATGTGA TTTTAAGGAT G AC T TAT T C A TAGAAGGTAT 192 0 
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CACTGCTTAT GATAATGAAA GACATCATTT TAAAATTGCT AGAAAGTCTT TCTTTGTTGA 198 0 

AAATGGTAGC TTAGTTCAAA TTGCTCCGAG AGACAGACAA TTTGCACATA AAAAGCCATG 20 4 0 

GCCGAAATTT TTAACATTAT TTGCGGGACC GTTATTTAAC TTTATATTAG CTTTAGTCCT 2100 

ATTTATTGGT CTTGCATATT ATCaAGGcAC GCcTACGTCT ACTGTAGAAC AAGTCGCAGA 2160 

TAAGTATCCA GCTCAACAAG CAGGATTACA AAAAGGTGAT AAGATCGTCC AAATTGGCAA 222 0 

ATATAAAATA TCTGAATTTG ATGATGTTGA TAAGGCGTTA GATAAAGT T A AAGATAATAA 220 

GACGACTGTT AAATTTGAAC GTGATGGTAA AACAAAGTCA GTTGAATTAA CACCTAAAAA 234 0 

GACTGAAAAA AAACTGACTA AAGTAAGTTC AGAGACGAAG TATGTTCTCG GATTCCAACC 24 00 

AGCGAGTGAA CAT AC AC T T T TTAAACCAAT TGTATTCGGA TTTAAAAGCT TTTTAATCGG 24 60 

TAGTACTTAT ATTTTTACAG CTGTAGTAGG TATGTTGGCT AGTATATTTA CGGGCGGATT 2 520 

CTCATTTGAT ATGTTAAATG GTCCGGTTGG TATTTATCAT AACGTCGACT CAGTTGTTAA 258 0 

AGCGGGTATC ATTAGCTTAA TTGGTtnCAC TGCGTTATTA AGTGTAAACT TAGGTATTAT 2 64 0 

GAATTTAATT CCTATTCCTG CACTAGACGG TGGTCGTATT TTATTTGTA TATATGAAGC 27 00 

GATTTTCAGA AAACCAGTTA ATAAAAAAGC GGAAACAACG ATTATTGCTA TTGGTGCCAT 27 60 

TTTCATGGTC GTTATAATGA TATTAGTAAC GTGGAATGAT ATTCGACGAT ATTTCTTATA 2 82 0 

ATTTAGGAGG ATAAATAATT ATGAAGCAAT CCAAAGTTTT TATACCAACG ATGCGTGACG 288 0 

TGCCATCAGA AGCAGAAGCA C AAAG T CAT C GTTTATTATT GAAATCGGGT TTGATAAAAC 2 94 0 

AAAGTACAAG TGGGATTTAT AGTTATTTAC CGCTAGCAAC ACGTGTGTTA AATAATATTA 3000 

CTGCAATTGT GCGACAAGAA ATGGAACGTA TCGATTCTGT TGAAATTTTA ATGCCAGCGT 3060 

TACAACAAGC TGAATTATGG GAAGAATCAG GACGTTGGGG TGCATATGGC CCAGAATTAA 3120 

TGCGTTTACA AGATAGaCAT GGAAgACAAT TTgCATTAGG TCCaACACAT GAAGAAT TAG 318 0 

TTACATCAAT AGTAAGAAAT GAATTGAAAT CATACAAACA ATTACCGATG ACATTATTCC 324 0 

aAATTCAATC TAAATTCCGT GATGAAAAGA GACCACGTTT TGGTTTAyTC GTGGGGTGA 3300 

ATTTATTATG AAAG AT GC AT ATTCATTCCA TGCTGACGAG GCATCATTAG ATCAAACGTA 3360 

TCAAGATATG TATCAAGCGT ATAGCCGTAT TTTTGAGAGA GTTGGCATTA ACGCAAGACC 34 2 0 

AGTAGTTGCA GATTCAGGTG CTATAGGCGG TAGCCATaCA CATGAATTTA TGGCATTAAG 34 8 0 

TGCTATCGGT GAGGATACAA TCGTTTACAG TAAAGAAAGT GATTATGCTG CTAACATCGA 354 0 

AAAAGCAGAA GTCGTTTACG ArcCAaATcA TaAGCATACT ACTGTGCAAC CTTTAGAAAA 3600 

AATTGAAACA CCAAATGTTA AGACTGCGCA AGAATTGGCA GACTTCTTAG GTAGACCAGT 3660 

AGATGAAATC GTTAAAACGA TGATTTTCAA AGTTGAEGC GAATATATTA TGGTTTTAGT 37 2 0 
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GCGTGGCCAT CATGAAATTA AT G AC AT T AA ATTAAAATCT TATTTCGGCA CAGATAATAT 37 8 0 

TGAATTAGCA ACACAAGACG AAATTGTTAA TTTAGTTGGT GCAAATCCTG GTTCACTAGG 38 4 0 

TCCTGTAATT GATAAAGAAA TCAAAATTTA TGCAGATAAT TTTGTGCAAG ATTTAAATAA 390 0 

TTTAGTTGTC GGTGCTAACG AAGATGGTTA TCACTTAATT AATGTAAATG TAGGTAGAGA 3960 

CTTCAACGTT GATGAATATG GCGATTTCCG TTTTATTTTA GAAGGC GAAA AGTTAAGTGA 4 020 

TGGTTCAGGC G T T GC AC AT T TTGCTGAAGG TATTGAAGTT GGTCAAGTAT TCAAATTGGG 4 08 0 

TACTAAGTAT TCAGAATC2A. TGAATGCTAC ATTCTTAGAT AACCAAGGAA AAGCTCAATC 4140 

TTTAATTATG GGTTGTTACG GAATTGGAAT TTCTAGAACG CTAAGTGCGA TTGTTGAACA 4 200 

AAAT C AC GAT GATAATGGAA TTGTTTGGCC TAAATCAGTT ACTCCGTTTG ATT T AC AT T T 4 2 60 

AATTTCTATT AATCCTAAGA AAGATGATCA AC G AG AAC T A GCASTGCAC TATATGCTGA 4 320 

ATTTAATACT AAATTTGATG TGTTGTACGA TGATCGTCAG GAACGTGCAG GTGTTAAATT 4 380 

TAATGATGCC GATTTAATTG GTTTACCACT GCGAATTGTT GTTGGTAAAC GTGCATCGGA 4 4 40 

AGGTATTGTA GAAGTTAAAG AACGTTTAAC AGGTGATAGC GAAGAAGTTC ACATTGATGA 4 500 

CTTAATGACT GTCATTACAA AT AAAT AT GA TAACTTAAAA TAATTAAGAT CGAATGAATT 4 5 60 

ATAAGAGTAG GAAAAAGCTG AAAGAAATCT GATGCTTATG TCCTGCTCTT ATTATTTTTG 4 620 

ATATAATGAT TATTCGATGA AAAAT G AC T G AAGACATAGT ATAATTAAAG ATAAATTTGT 4 680 

TTTAACAATA TAATGATTAG CC AAA TA.TAA AG CAT T T AAT TTTCTATCAT TACTATGCTC 4740 

ACATAATCTA AATATTGTTC GAACACGTAA AAGTAATTTC TATTTAAGGT GGTAATTGTC 4800 

TTGGCAATGA CAGAGCAACA AAAAT TT AAA GTGCTTGCTG ATCAAATTAA AATTTCAAAT 4 8 60 

C AAT TAG AT G CTGAAATTTT AAATTCAGGT GAACTGACAC GTATAGATGT TCTAACAAA 4 92 0 

AACAGAACAT GGGAATTTCA TATTACATTA CCACAATTCT TAGCTCATGA AGATTATTTA 4 980 

TTATTTATAA ATGCAATAGA GCAAGAGTTT AAAGATATCG CCAACGTTAC ATGTCGTTTT 504 0 

ACGGTAACAA ATGGCACGAA TCAAGATGAA CATGCAATTA AATACTTTGG GCACTGTATT 5100 

GACCAAACAG CTTTATCTCC AAAAGTTAAA GGTCAATTGA AACAGAAAAA GCTTATTATG 5160 

TCTGGAAAAG TATTAAAAGT AATGGTATCA AATGACATTG AACGTAATCA TTTTGATAAG 5220 

GCATGTAATG GAAGTCTTAT CAAAGCGTTT AGAAATTGTG GTTTTGATAT CGATAAAATC 52 80 

AT AT TCGAAA CAAAT G AT AA TGATCAAGAA CAftACTTAG CTTCTTTAGA AgCACaTATT 534 0 

CAAGAAGAAG ACGAACAAAG TGCACGATTG GCAACAGAGA AACTTGAAAA AATGAAAGCT 54 00 

GAAAAAGCGA AACAACAAGA T AAC AAC G AA AGTGCTGTCG ATAAGTGTCA AATTGGTAAG 54 60 

CCGATTCAAA T T G AAAAT AT TAAACCAATT GAATCTATTA TTGAGGAAGA GTTTAAAG3T 5520 

GCAATAGAGG GTGTCATTTT TGATATAAAC TTAAAAGAAC TTAAAAGTGG TCGCCATATC 5 580 
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GTAGAAATTA AAGTGACTGA CTATACGGAC TCTTTAGTTT TAAAAATGTT TACTCGTAAA 5 64 0 

AACAAAGATG ATTTAGAACA TTTTAAAGCG CTAAGTGTTG GTAAATGGGT TAGGGCTCAA 57 0 0 

GGTCGTATTG AAGAAGATAC ATTTATTAGA GATTTAGTTA TGATGATGTC TGATATTGAA 57 60 

GAGATTAAAA AAGCGACAAA AAAAGATAAG GCTGAAGAAA AGCGTGTAGA ATTCCACTTG 5 82 0 

CATACTGCAA TGAGCCAAAT GGATGGTATA CCCAATATTG GTGCGTATGT TAAACAGGCA 58 8 0 

GCAGACTGGG GACATCCAGC CATTGCGGTT ACAGACCATAATGTTGTGCA AGCATTTCCA 594 0 

GATGCTCACG CAGCAGCGGA AAAACATGGC ATTAAAATGA TATACGGTAT GGAAGGTATG 6000 

TTAGTTGATG ATGGTGTTCC GATTGCATAC AAACCACAAG ATGTCGTATT AAAAGATGCT 6060 

ACTTATGTTG TGTTCGACGT T G AG AC AAC T GGTTTATCAA ATCAGTATGA TAAAATCATC 630 

GAGCTTGCAG CTGTGAAAGT TCATAACGGT GAAATCATCG ATAAGTTTGA AAGGTTTAGT 618 0 

AATCCGCATG AAC GAT TAT C GGAAACGATT ATCAATTTGA CGCATATTAC T GAT GAT AT G 624 0 

TTAGTAGATG CCCCTGAGAT TGAAGAAGTA CTTACAGAGT TTAAAGAATG GGTTGGCGAT 6300 

GCGATATTCG TAGCGCATAA TCCTTCGTTT GATATGGGCT TCATCGATAC GGGATATGAA 6360 

CGTCTTGGGT TTGGACCATC AACGAATGGT GTTATCGATA CTTTAGAATT ATCTCGTACG 6420 

ATTAATACTG AATATGGTAA ACATGGTTTG AATTTCTTGG CTAAAAAATA TGGCGTAGAA 64 80 

TTAACGCAAC ATCACCGTGC CATTTATGAT ACAGAAGCAA CAGCTTAET TTTCATAAAA 65 4 0 

ATGGTTCAAC AAATGAAAGA ATTAGGCGTA TTAAATCATA ACGAAATCAA CAAAAAACTC 6 600 

AGTAATGAAG ATGCATATAA ACGTGCAAGA CCTAGTCATG TCACATTAAT T G T AC AAAAC 6660 

CAACAAGGTC TTAAAAATCT ATTTAAAATT GTAAGTGCAT CATTGGTGAA GTATTTCTAC 6720 

CGTACACCTC GAATTCCACG TTCATTGTTA GATGAATATC GTGAGGGATT ATTGGTAGGT 6780 

ACAGCGTGTG ATGAAGGTGA ATTATTTACG GCAGTTATGC AGAAGGACCA GAGTCAAGTT 68 4 0 

GAAAAAATTG CCAAAT AT T A TGATTTTATT GAAATTCAAC CACCGGCACT TTATCAAGAT 6900 

TTAATTGATA GAGAGCTTAT TAGAGATACT GAAACATTAC ATGAAATTTA TCAACGTTTA 6960 

ATACATGCAG GTGACACAGC GGGTATACCT GTTATTGCGA CAGGAAATGC AC AC TAT T T G 7 020 

TTTGAACATG ATGGTATCGC ACGTAAAATT TTAATAGCAT CACAACCCGG CAATCCACTT 7 08 0 

AATCGCTCAA CTTTACCGGA AGCACATTTT AG AAC T AC AG ATGAAATGTT AAACSGTTT 714 0 

CATTTTTTAG GTGAAGAAAA AGCGCATGAA ATTGTTGTGA AAAATACAAA CGAATTAGCA 7200 

GATCGAATTG AACGTGTTGT TCCTATTAAA GATGAATTAT ACACACCGCG TATGGAAGGT 7 2 60 

GCTAACGAAG AAAT TAG AG A ACTAAGTTAT GCAAATGCGC GTAAACTGTA TGGTGAAGAC 7 320 

CTGCCTCAAA TCGTAATTGA TCGATTAGAA AAAGAATTAA AAAGT AT TAT CGGTAATGGA 7380 
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TTTGCGGTAA TTTACTTAAT TTCGCAACGT TTAGTTAAAA AATCATTAGA TGATGGATAC 74 4 0 

TTAGTTGGTT CCCGTGGTTC AGTAGGTTCT AGTTTTGTAG CGACAATGAC T GAG AT TACT 7500 

GAAGTAAACC CGTTACCGCC ACACTATATT TGTCCGACT G T AAAAC GAG TGAATTTTTC 7 5 60 

AATGATGGTT CAGTAGGATC AGGATTTGAT TTACCTGATA AGACGTGTGA AACTTGTGGA 7 62 0 

GCGCCACTTA TTAAAGAAGG ACAAGATATT CCGTTTGAAA CATTTTTAGG ATTTAAGGGA 7 68 0 

GATAAAGTTC CTGATATCGA CTTAAACTTT AGTGGTGAAT ATCAACCGAA TGCCCATAAC 7 74 0 

TACACAAAAG TATTATTTGG T G AGG AT AAA GTATTCCGTG CAGGTACAAT TGGTACTGTT 7 800 

GCTGAAAAGA CTGCTTTTGG TTATGTTAAA GGTTATTTGA ATGATCAAGG TATCCACAAA 7 8 60 

AGAGGTGCTG AAATAGATCG ACTCGTTAAA GGATGTACAG GTGTTAAACG TACAACTGGA 7 92 0 

CAGCATCCAG GGGGTATTAT TGTAGTACCT GATTACATGG ATATTTATGA TTTTACGCCG 7 980 

AT AC AAT AT C CTGCCGATGA TCAAAATTCA GCATGGATGA CGACACATTT TGATTTCCAT 8 04 0 

TCTATTCATG ATAATGTATT AAAAC T T G AT ATACTTGGAC ACGATGATCC AACAATGATT 8100 

CGTATGCTTC AAGATTTATC AGGAATTGAT CCAAAAACAA TACCGTAGA TGATAAAGAA 8160 

GTTATGCAGA TATTTAGTAC ACCTGAAAGT TTGGGTGTTA CTGAAGATGA AATTTTATGT 8220 

AAAACAGGTA CATTTGGGGT AC C AG AAT T C GGTACAGGAT TCGTGCGTCA AATGTTAGAA 8280 

GATACAAAGC CAACAACATT TTCTGAATTA GTTCAAATCT CAGGATTATC TCATGGTACA 834 0 

GATGTGTGGT TAGGCAATGC TCAAGAATTA AT T AAAAC C G GTATATGTGA TTTATCAAGT 8400 

GTAATTGGTT GTCGTGATGA TATCATGGTT TATTTAATGT ATGCTGGTTT AG AAC CAT C A 84 60 

ATGGCTTTTA AAATAATGGA GTCAGTACGT AAAGGTAAAG GTTTAACTGA AGAAATGATT 8 52 0 

GAAACGATGA AAGAAAATGA AGTGCGAGAT TGGTATTTAG ATTCATGTCT TAAAATTAAG 8 58 0 

TACATGTTCC CTAAAGCCCA TGCAGCAGCA TACGTTTTAA TGGCAGTACG TATCGCATAT 8 64 0 

T TCAAAGT AC ATCATCCACT TTATTACTAT GCATCTTACT TTACAATTCG TGCGTCAGAC 87 00 

TTTGATTTAA TCACGATGAT TAAAGATAAA ACAAGCATTC GAAATACTGT AAAGACATG 8 7 60 

TATTCTCGCT ATATGGATCT AGGTAAAAAA GAAAAAGACG TATTAACAGT CTTGGAAATT 8 82 0 

ATGAATGAAA TGGCGCATCG AGGTTATCGA AT GCAACCGA TTAGTTTAGA AAAG AG T C AG 8 880 

GCGTTCGAAT TTATCATTGA AGGCGATACA CTTATTCCGC CGTTCATATC AGTGCCTGGG 8 94 0 

CTTGGCGAAA ACGTTGCGAA ACGAATTGTT GAAGCTCGTG ACGATGGCCC ATTTTTATCA 90 00 

AAAGAAGATT TAAACAAAAA AGCTGGATTA TCTCAGAAAA TTATTGAGTA TTTAGATGAG 90 60 

TTAGGCTCAT TACCGAATTT AC C AG AT AAA GCTCAACTTT CGATATTTGA TATGTAAAAT 9120 

GAAATAATCA AGGTATTTAT TTAATGCGTA TGtEGTAGTC AAAGAAATAC AAAATTGTTG 9180 

CTGGACACAA AATTATGCCC GTATTTCTTT TCAATGTCTT ACGAGTCTAT TCAAATGTAA 92 4 0 
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TGGTGAAATA AAGGAACAAA CTTTTACAAG AATCTCTGAT TAATAGTGAA GTCATTTGTT 9300 

TCAAGCATAA ACTTATGCTA TAATTAAGTT GCTTAAAAAT TAGTGAACTC AGGCAGAAS 9 3 60 

GTGGGAGATT CCCGCTCTTT TCTATTTGCC AAAAAGGGAG GCCTGTATGA GTAAAATTAC 94 20 

AGAACAAGTA GAAGTGATTG TTAAACCAAT TAT GGAAGAC TTGAATTTTG AACTTGTAGA 94 80 

CGTTGAATAT GTCAAAGAGG GTAGAGATCA TTTTCTTAGA ATCTCTATTG ATAAAGAAGG 95 4 0 

TGGCGTAGAT TTAAATGATT GTACGCTAGC TTCTGAAAAA ATAAGTGAAG CTATGGATGC 9600 

AAATGATCCT ATTCCTGAAA TGTATTATTT AGACGTAGCG TCACCTGGTG CAGAACGTCC 9660 

AATTAAAAAA GAACAAGATT TCCAAAATGC AATAACTAAA CCTGTATTTG TTTCTTTATA 9720 

TGTACCAATT GAAGGTGAAA AGGAATGGTT AGGC AT T T T A C AAGAAG T C A ATAATGAAAC 97 80 

AATTGTAGTA CAAGTTAAAA TCAAAGCAAG AACGAAAGAT ATAGAGATAC C GAG AG AC AA 98 4 0 

AATAGCAAAA GCACGTCACG CAGTTATGAT TTAACGTGAT GAGGAGGAAA AAACGTGTCA 9900 

AGTAATGAAT TATTATTAGC TACTGAGTAT TTAGAAAAAG AAAAGAAGAT TCCTAGAGCA 9ffl0 

GTATTAATTG ATGCTATTGA AGCAGCTTTA ATTACTGCAT ACAAAAAGAA TTATGATAGT 10020 

GCAAGAAATG TCCGTGTGGA ATTAAATATG GAT C AAGGT A CTTTCAAAGT TATCGCTCGT 10 080 

AAAGATGTTG TTGAAGAAGT ATTTGACGAC AG AG AT G AAG TGGATTTAAG TACAGCGCTT 1014 0 

GTTAAAAACC CTGCATATGA AATTGGTGAT ATATACGAAG AAG AT G T AAC ACCTAAAGAT 10200 

TTTGGTCGTG TAGGTGCTCA AGCAGCGAAA CAAGCAGTAA TGCAACGTCT TCGTGATGCT 102 60 

G AAC G T G AAA TTTTATTTGA AGAATTTATA GACAAAGAAG AAGACAT AC T TACTGGAATT 10320 

ATTGACCGTG TTGACCATCG TTATGTATAT GTGAATTTAG GTCGTATGA AGCTGTTTTA 10380 

TCTGAAGCAG AAAGAAGTCC TAACGAAAAA TATATTCCTA ACGAACGTAT CAAAGTATAT 10440 

GTTAACAAAG TGGAACAAAC GACAAAAGGT CCTCAAATCT ATGTTTCTCG TAGCCATCCA 105 00 

GGTTTATTAA AACGTTTATT T G AAC AAG AA GTTCCAGAAA TTTACGATGG TACTGTAATT 105 60 

GTTAAATCAG TAGCACGTGA AGCTGGCGAT CGCTCTAAAA TTAGTGTCTT CTCTGAAAAC 10620 

AATGATATAG ATGCTGTTGG TGCATGTGTT GGTGCTAAAG GCGCACGTGT TGAAGCTGTT 10680 

GTTGAAGAGC TAGGTGGTGA AAAAATCGAC ATCGTTCAAT GGAATGAAGA TCCAAAAGTA 10740 

TTTGTAAAAA ATGCTTTAAG CCCTTCTCAA GTTTTAGAAG TTATTGTTGA TGAAACAAAT 10800 

CAATCTACAG TAGTTGTTGT TCCTGATTAT CAATTGTCAT TAGCGATTGG TAAAAGAGGA 108 60 

CAAAAC GC AC GTCTAGCTGC TAAATTAACC GGCTGGAAAA TTGATATTAA ATCAGAAACA 10 920 

GATGCGCGTG AAGCGGGTAT CTATCCAGTA GTTGAAGCTG AAAAAGTAAC TGAASAGAT 10 980 

GTTGCTTTAG AAGATGCTGA CACAACAGAA T C AAC C G AAG AGGTAAATGA TGTTTCAGTT 1104 0 
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GAAACAAATG TAGAGAAAGA ATCTGAATAA TAGGTTGGAG TGAAGTATCT AT G AAAAAGA 11100 

AAAAAATTCC GATGCGAAAA TGTATTCTTT CAAATGAAAT GCATCCCAAA AAAGAT AT GA 11160 

TTCGTGTTGT TGTTAATAAA GAAGGCGAAA TCTTTGCGGA TGTTACTGGA AAGAAACAAG 11220 

GCCGTGGCGC ATATGTTTCT AAAGATGTTG CTATGGTTGA AAAAG C AC AA CAAAAAGAAA 112 8 0 

TTTTAGAAAA AT AT T T T AAA GCATCTAAAG AGCAATTGGA TCCTGTTTAC AAAGAAATTA 11340 

TTAGATTAAT TTATAGAGAA GAGATCCCAA AATGAGATA GATCAAATAT TAAACTTTTT 114 00 

AGGATTAGCA ATGAGAGCTG GTAAAGTAAA AACAGGTGAA TCAGTCATTG TTAATGAGAT 114 60 

TAAAAAAGGA AATTTGAAGC TCGTTATTGT TGCAAATGAT GCGTCTGATA ATACAGCTAA 11520 

ATTAATTACA GATAAATGTA AGAGTTACAA AGTTCCATTC AGAAAGTTTG G AAAT C G AAA 1158 0 

TGAATTGGGA ATAGCACTTG G AAAAG G T G A GCGTGTTAAT GTAGGGATTA CTGACCCAGG 11640 

CTTTGCTAAA AAGTTGCTAT CAATGATAGA TGAATATCAT AAGGAGTGAT TATATGAGTA 117 00 

AACAAAGAAT T T ACGAAT AT GCGAAAGAAT TAAATCTAAA GAGTAAAGAG AT T AT AGAT G 117 60 

AGTTAAAAAG CATGAATATT GAGGTTTCAA ATCATATGCA AGCTTTGGAA GATGACCAAA 118 20 

TTAAAGCATT AGATAAAAAG TTCAAAAAAG AACAAAAGAA CGACAATAAA CAAAGCACTC 118 80 

AAAATAATCA CCAAAAATCA AAC AAT C AAA ACCAAAATAA AGGGCmACAA AAAGAT AACA 11940 

AAAAG AAT Cm ACAACAAAAT AATAAAGGCA ACAAAGGCAA TAAAAGAAT AAT AGAAAT a 12 00 0 

ATAAGAAAAA TAACAAGAAT AATAAACCAC AAAATCAACC AGCTGCTCCA AAAGAAATAC 12060 

CAT C AAAAG T G AC AT AT C AA GAAGGTATTA CAGTAGGCGA ATTTGCGGAT AAATTAAATG 12120 

TTGAATCATC AGAAATTATC AAAAAATTAT TCTTACTTGG TATTGTTGCT AATATCAATC 12180 

AATCATTAAA TCAAGAAACA ATCGAATTAA TTGCCGATGA TTATGGCGTT GAGGTTGAAG 1224 0 

AAGAAGTTGT GATTAATGAA GAAGACTTAT CAATCTATTT CGAAGACGAA AAAGATGATC 12300 

CAGAGGCAAT T GAG AG AC C A GCAGTTGTAA CAATTATGGG ACATGTTGAC CATGGTAAAA 123 60 

CGACTTTATT AGATTCAATT CGTCATACAA AAGTTACAGC AGGTGAAGCA GGCGGAATCA 12420 

CTCAACATAT TGGTGCATAT C AAAT T G AAA ACGATGGCAA AAAAATCACT TTCTTAGATA 124 80 

CACCGGGACA TGCTGCATTT ACAACGATGC GTGCGCGTGG TGCaCAAGTA ACAGATATTA 12540 

CTATTTTAGT AGTAGCAGCT GACGATGGTG TTATGCCACA AAC AAT T G AA GAATTAACC 12600 

ATGCTAAAGA AGCAgAAGTA CCAATTATTG TTGCAGTAAA TAAAATTGAT AAACCAACTT 12660 

CAAATCCTGA TCGAGTTATG CAAGAATTAA CTGAATATGG TTTAATTCCT GAAGATTGGG 127 20 

GCGGCGAAAC AATTtTCGTc CACTTTCTGC ATTAAGTGGT GATGGTATCG AC GAT T TAT T 12780 

AGAAATGATA GGATTAGTTG CAGAAGTTCA AGAAC T T AAA GCAAATCCTA AAAACCGTGC 128 4 0 

TGTTGGTACA GTTATCGAAG CTGAATTAGA TAAATCACGT GGTCCTTCTG CATCATTATT 12 900 
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AG T AC AAAAC GGTACATTAA ATGTTGGTGA TGCGATTGTA GTTGGTAATA CTTACGGCCG 12960 

TATCCGTGCA ATGGTTAATG ACTTAGGTCA AAtAATCAAA ACGGCTGGTC CATCAACGCC 13020 

TGTTGAAATT ACAGGTATTA ATGATGTGCC ACAAGCTGGG GATCGCTTTG TTGTATTTAG 13080 

TGATGAAAAA CAAGCTCGTC GTATTGGTGA ATCAAGACAC GAAGCTAGCA TTATACAACA 1314 0 

ACGTCAAGAA AGTAAAAATG TTTCATTAGA TAACCTGTTT GAACAAATGA AACAAGGTK 13200 

AATGAAAGAT TTAAACGTTA TTATTAAAGG TGATGTTCAA GGTTCTGTTG AAGCTTTAGC 132 60 

TGCATCATTA ATGAAAATTG ATGTTGAAGG CGTAAATGTT CGTATCATTC ATACAGCGGT 13320 

TGGTGCAATT AATGAGTCAG AC G T G AC AC T TGCTAATGCC TCAAATGGTA TTATCATTGG 1338 0 

TTTCAATGTT CGTCCAGACA GTGGTGCAAA ACGTGCTGCA GAAGCTGAAA AT G T T GAT AT 13440 

GCGTTTACAC AGAGTTATTT ATAATGTTAT CGAAGAAATT GAATCAGCGA TGAAAGGTTT 13500 

ACTTGATCCA GAATTTGAAG AACAAGTTAT CGGACAAGCT GAAGTTCGTC AAACATTCAA 135 60 

AGTTTCTAAA GTTGGTACTA TTGCTGGATG TTATGTTACT GAAGGTAAAA TTACGCGAAA 13 620 

TGCTGGTGTA CGTATTATTC GTGATGGTAT TGTTCAATAT GAAGGCGAAT TAGATACACT 13 680 

TAAACGTTTC AAAGATGATG CTAAGGAAGT TGCAAAAGGT TATGAATGTG GTATTACAAy 137 4 0 

T G AAAAC T AC AATGACCTTA AAGAAGGCGA TGTTATCGAA GCATTTGAAA TGGTTGAAAT 13CB0 

TAAGCGTTAA TTAAATAAAT TACAAGCTAA AAGTATAGTT AAG AT T GAT A TGCTCCCTAT 138 60 

AAATATTGCA CTTTTTAAGT GTCTACTTTA TAGGGAGCAT ATTTGATACT AGCTTTTGGT 13920 

TTTTTATTAG AATAGATTAC CTATTAAAAG T T AC G T TATA TGGACATGAT TTTGTATAAA 13 980 

ATTTTGTGGT GGCCTAGAAT GATTTTTAAT GACAAAATAT AATGTCGACT ATTATTGGAA 14040 

AATTTTCTGT TGaAATGCCT ATCTTACGGC AAACTTTATT TGATTTTATA GGCTTAATTT 14100 

ATTAAAATAA CGTGTGAGCT AAAATAATTG TTTAAGCATT GTTACACTAA AAAATGCAAA 14160 

TAACAATTGA ACTTAAAGAT AAAGAGGTGA CAAGAATGAG CAGTATGBA GCAGAGCGTG 14 220 

TTGGTGAACA AATGAAGAAG GAATTAATGG AT AT CAT C AA CAATAAAGTC AAAGATCCTC 14 280 

GAGTTGGTTT TATTACAATT ACAGATGTTG TTTTAACAAA TGATTTATCG CAGGCTAAAG 14 34 0 

TATTTTTAAC TGTATTAGGT AACGATAAAG AAGTAGAAAA T AC AT T T AAA GCACTTGATA 14 4 00 

AAGCAAAAGG C T T CAT T AAG TCTGAATTAG GTTCTAGAAT GCGATTACGT ATTATGCCGG 14 4 60 

AATTAATGTA TGAATATGAT CAATCAATCG AATATGGTAA TAAAATTGAA CGAATGATTC 14 520 

AAG AT T T AC A CAAACAAGAT AGATAATTTA GTGTTAGGTA TCTGGAAAAT GTTTGATAAT 14 580 

TTCTTAATAT CGGTATATTA ACATTAAACA GTTAATACAT AG AT G T G TAG AAATAGTTAA 14 640 

CATTTTCCAG TTTTTTTATG AATAAATTTA GTTGATACGC TATTAAAATA TATTTTAAAA 14700 
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AAGAAGGTGA CTATATGTAT AATGGGATAT TACCAGTATA TAAAGAGCGC GGTTTAACAA 14 7 60 

GTCATGACGT TGTATTCAAA TTGCGTAAAA TATTAAAAAC TAAAAAAATA GGTQSACGG 14 820 

GTACGCTTGA TCCCGAAGTT GCAGGCGTGT TACCGGTATG TATAGGTAAT GCAACGAGAG 14 8 80 

T TAG T GAT T A TGTTATGGAT ATGGGCAAAG CTTATGAAGC AACTGTATCG ATAGGAAGAA 14 940 

GTACAACGAC TGAAGATCAA ACGGGTGATA CATTGGAAAC AAAAGGTGTA CACTCAGCAG 15000 

ATTTTAATAA GGACGATATT GACCGATTGT TAGAAAGTTT TAAAGGTATC ATT G AAC AAA 150 60 

TTCCGCCGAT GTACTCATCC GTCAAAGTAA ATGGTAAAAA ATTATATGAA TATGCGCGTA 15120 

ATAATGAAAC AGTTGAAAGA CCAAAGCGTA AAGTtAATAT TAAAGACATT GGGCGTATAT 1518 0 

CTGAATTAGA TTTTAAAGAA AATGAGTGTC ATTTTAAAT ACGCGTCATC TGTGGTAAAG 15240 

GT AC AT AT AT TAGAACGCTA GCAACTGATA TTGGTGTGAA ATTAGGCTTT CCGGCACATA 1530 0 

TGTCGAAATT AACACGAATC GAGTCTGGTG GATTTGTGTT GAAAGATAGC CTTACATTAG 15360 

AACAAATAAA AGAACTTCAT GAGCAGGATT CATTGCAAAA TAAATTGTTT CCTTTAGAAT 154 2 0 

AT GG AT T AAA GGGTTTGCCA AGCATTAAAA TTAAAGATTC GCACATAAAA AAACGTATTT 154 8 0 

TAAATGGGCA GAAATTTAAT AAAAATGAAT TTGATAACAA AATTAAAGAC CAAATTGTAT 15540 

TTATTGATGA TGATTCAGAA AAAGTATTAG CAATTTATAT GGTACACCCT ACAAAAGAAT 15 600 

CAGAAATTAA ACCTAAAAAA GTCTTTAATT AAAGGAGATA GAATTTATGA AAGTCATAGA 15 660 

AGtGACACAT CCTATACAAT C T AAACAGT A TATTACAGAG GATGTTGCAA TGGCATTCGG 15720 

ATTTTTCGAT GGCATGCATA AAGGTCATGA CAAAGTCTTT GAT AT AT T AA ACGAAATAGC 157 8 0 

TGAGGCACGC AGTTTAAAAA AAGCGGTGAT GACATTTGAT CCGfiTCCGT CTGTCGTGTT 158 4 0 

GAATCCTAAA AGAAAACGAA CAACGTATTT AACGCCACTT TCAGATAAAA TCGAAAAAAT 15 900 

TAGCCAACAT GATATTGATT ATTGTATAGT GGTTAATTTT TCATCTAGGT TTGCTAATGT 15960 

GAGCGTAGAA GATTTTGTTG AAAAT TAT AT AATTAAAAAT AATGTAAAAG AAGTCATTGC 16020 

TGGTTTTGAT TTTACTTTTG GTAAATTTGG AAAAGGTAAT ATGACTGTAC TTCAAGAATA 16080 

TGATGCGTTT AAT AC G AC AA TTGTGAGTAA ACAAGAAATT G AAAAT G AAA AAATTTCTAC 16140 

AACTTCTATT CGTCAAGATT TAATCAATGG TGAGTTGCAA AAAGCGAATG ATGCTTTAGG 16200 

CTATATATAT TCTATTAAAG kCACTGTAGT GCAAGGTGAA AAAAGGGGAA GAACTATTGG 162 60 

CTTCCCAACA GCTAACATTC AACCTAGTGA TGATTATTTG TTACCTCGTA AAGGTGTTTA 1632 0 

TGCTGTTAGT ATTGAAATCG GCACTGAAAA T AAAT TAT AT CGAGGGGTAG CTAACATAGG 1638 0 

TGTAAAGCCA ACATTTCATG AT C C T AAC AA AGCAGAAGTT GTCATCGAAG GAATATCTT 16440 

TGACTTTGAG GATAATATTT ATGGTGAACG AGTGACCGTG AATTGGCATC ATTTCTTACG 16500 

TCCTGAGATT AAATTTGATG GTATCGACCC AT TAG T T AAA C AAAT G AAC G ATGATAAATC 165 60 
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GCGTGCTAAA TATTTATTAG CAGTTGATTT TGGTGATGAA GTAGCTTATA ATATCTAGAG 16620 

TTGCGTATAG t T AT AT AAAC AATCTATACC ACACCTTTTT CTTAGTAGGT CGAATCTCCA 16680 

ACGCCTAACT CGGATTAAGG AGTATTCAAA CATTTTAAGG AGGAAATTGA TTATGGCAAT 16740 

TTCACAAGAA CGTAAAAACG AAATCATTAA AGAATACCGT G T AC AC G AAA CTGATACTGG 16800 

TTCACCAGAA GTACAAATCG CTGTACTTAC TGCAGAAATC AACGCaGTAA ACGAACACTT 168 60 

ACGTACACAC AAAAAAGACC ACCATTCACG TCGTGGATTA TTAAAAATGG TAGGTCGTCG 16920 

TAGcATTTaT TAAACTACTT ACGTaGTAAA GATATTCAAC GTTACCGTGA ATTAATTAAA 16980 

TCACTTGGTA TCCGTCGTTA ATCTTAATAT AACGTCTTTG AGGTTGGGGC ATATTTATE 17040 

TCCAACCTTA ATTTATATTA AAAAAGCTTT TTACAAATAT TAACATTTAT TATATGTTAA 17100 

GCTAATATTG AGTGAATAAT AAGGTTACAA T GAG AT AAA G ATGATATAAG TACACCTAGA 17160 

GTAATAATCA AGATATTAAA AATAAAGTAT GTTTTTTTAA AAAATATAAC TTATATTTAT 17220 

ACTGATAAGG GTGGGAC GAT AAGTCTATTT TGTAAATAAT AGATGGATAT CCCGCTCTCT 172 8 0 

TTTTTTCCAA TTCAATATTT TATAACTAAT ATTAAAATAC GATAATAAAT GATATGATAT 17 34 0 

AACTATTAGA TTCAAGAGAG GAGATTTATA ATGTCTCAAG AAAAGAAAGT TTTTAAAACT 174 00 

GAATGGGCAG GAAGATCTTT AACGATTGAA ACAGGGCAAT TAGCTAAACA AGCAAATGGC 174 60 

GCTGTATTGG TTCGTTATGG AGATACAGTC GTGTTATCGA CGGCAACTGC ATCAAAAGAA 17520 

CCTCGTGATG GAGATTTCTT CCCATTAACA GTGAACTATG AAGAAAAAAT GTACGCTGCG 17 58 0 

GGTAAAATTC CTGGTGGATT TAAAAAGAGA GAAGGACGTC CTGGTGACGA TGCAAC AT TA 17450 

ACTGCGCGAT TAATTGATAG ACCAATTAGA CCTTTATTCC CTAAAGGATA TAAGCATGAT 17700 

GTTCAAATTA TGAACATGGT ATTAAGTGCA GATCCTGATT GTTCACCACA AATGGCTGCA 177 60 

ATGATTGGTT CATCTATGGC GCTTAGTGTG TCGGATATTC CATTCCAAGG GCCAATCGCC 17 820 

GGTGTAAATG TGGGTTATAT TCTiCGGTAAA TATATCATTA ACCCAACAGT AGAAGAAAAA 178 80 

GAAGTTTCTC GTTTAGACCT TGAAGTAGCT GGTCATAAAG ATGCGGTAAA CATGGTAGAG 17 94 0 

GCAGGCGCTA GTGAGATTAC TGAACAAGAA ATGTTAGAGG CGATTTTCTT TGGTCATGAA 18 000 

GAGATTCAAC GTTTAGTTGA TTTCCAACAA CAAATCGTCG ACCACATCEA ACCTGTTAAA 18060 

CAAGAATTTA TTCCAGCAGA GCGTGaTGAA GCGCTAGTTG AACGTGTAAA ATCTTTAACC 18120 

GAAGAAAAAG GACTTAAAGA AACAGTTTTA ACATTTGATA AACAACAACG AGATGaAAAT 18180 

CTTGATAACT TAAAAGAAGA AATCGTCAAT GAATTTATCG ATGAAGAAGA TCCAGAGAAT 182 4 0 

GAaTTACTTA TTAAAGAAGT TTATGCAATT TTAAATGAAT TAG T G AAAG A AGAAGTTCGA 18300 

CGTTTAATTG C AG AT G AAAA AAT TAG AC C A GACGGCCGTA AACCTGATGA AATCCGTCCA 18360 
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TTAGATTCTG AAGTTGGTAT TTTACCTAGA ACGCATGGTT CAGGTCTATT TACACGTGGT 18 4 20 

CAGACTCAAG CACTTTCAGT TTTAACATTA GGTGCTTTAG GCGATTATCA ATTAATTGAT 18 4 80 

GGTTTAGGAC CTGAAGAAGA AAAAAGATTC ATGCATCATT ACAACTTCCC GAATTTTTCA 18 54 0 

GTAGGTGAAA CTGGTCCAGT ACGTGCGCCA GGTCGTCGTG AAATTGGACA TGGTGCGTTA 18 600 

GGTGAAAGAG CATTAAAATA TATTATTCCT GATACTGCTG ATTTCCCATA TACAATCGT 18 660 

ATTGTAAGTG AGGTACTTGA ATCAAATGGT TCATCATCTC AAGCGTCAAT TTGTGGATCA 18720 

ACATTAGCAT TAATGGATGC GGGCGTACCG ATTAAAGCAC CAGTTGCTGG TATTGCTATG 1878 0 

GGCCTTGTTA CACGTGAAGA TAGCTATACG ATTTTAACTG ATATCCAAGG TATGGAAGAT 1884 0 

GCATTAGGTG ATATGGACTT TAAAGTCGCT GGTACTAAAG AAGGTATTAC AGCAATCCAA 18 900 

ATGGATATTA AAATTGACGG TTTAACGCGT GAAATTATCG AAGAGGCTCT AGAACAAGCG 18 960 

AGACGTGGTC GTTTAGAAAT AATGAATCAT AT G T T AC AAA CAATTGATCA ACCACGTACT 1902 0 

GAATTAAGTG cTTACGCGCC AAAAGTTGTA AC TAT GKAA TTAAACCAGA T AAG AT TAG A 1908 0 

GATGTTATCG GACCTGGTGG TAAAAAAATT AACGAAATTA TTGATGAAAC AGGTGTTAAA 1914 0 

TTAGATATTG AACAAGATGG TACTATCTTT ATTGGTGCTG TTGATCAAGC TATGATAAAT 19200 

CGTGCTCGTG AAAT CAT T G A GGAAATTACA CGTGAAGCGG AAGTAGGTCA AACTTATCAA 192 60 

GCCACTGTTA AACGTATTGA AAAATACGGT GCGTTTGTAG GCCTATTCCC AGGTAAAGAT 1932 0 

GCGTTGCTTC ACATTTCACA AATTTCAAAA AATAGAATTG AAAAAGTGGA AGATGTATTA 1938 0 

AAAATCGGTG ACACAATTGA AGTTAAGATT ACTGAAATTG ATAAACAAGG TCGAGTAAAT 19440 

GCTTCACATA GAGCATTAGA. AGAATAATAT TTAAAGTCAT AT G AC G AC AA TGTATCGTCA 19500 

TGTGATTTTT TTATGCCACT TTTTACGAAG TGACCCGTTT TGAATTTGTT GTATTGAACA 195 60 

TTTTAAAACG CTTTATTATT TTGTGTGCAA CTGTTAATTA TCCTGTATGT ATAGTGATTA 19620 

ATAGTGTACA TCAAGTGTTT TTTAACTTAT AATGAATAGT GAGTTATAT ATGGACGGGT 19680 

AACAAATTTA GGAGGTAAGA TTTTGAGTTT AATAAAGAAA AAGAATAAAG ATATTCGCAT 19740 

TATACCATTA GGCGGTGTTG GCGAAAT TGC TAAAAATATG TATATCGTTG AAG TAG AC GA 19800 

TGAAATGTTT ATGTTAGATG CTGGACTTAT GTTTCCAGAA GACGAAATGC TAGGTATTGA 198 60 

TATTGTTATA "CCAGACATTT CATACGTACT TGAAAATAAA GATAAATTGA AGGGTATATT 1992 0 

CCTTACACAC GGACATGAGC ACGCGATTGG TGCAGTGAGT TATGTTTTAG AACAATTAGA 1998 0 

TGCACCAGTA TATGGATCTA AATTGACAAT AGCGTTAATT AAAGAAAATA TGAAAGCCCG 2004 0 

TAATATTGAT AAAAAAGTTC GCTACTATAC AGTTAATAAT GATTCAATTA TGAGATTCAA 20100 

AAACGTGAAT ATTAGTTTCT TTAATACGAC ACACAGTATT CCTGATAGTT TAGGTGTTTG 2 0160 

TATTCACACT TCATATGGTG CCATTGTGTA TACAGGTGAA TTTAAGTTTG ACCAAAGTTT 20220 



587 

1 opied h orn 1< 80(ii>0(i on 08/ 1/ t?00 ; 



AC AT GG AC AT TATGCACCAG ATATTAAACG TATGGCAGAG ATTGGTGAAG AGGCGTATT 20280 

TGTCTTAATC AGTGATTCTA CTGAGGCAGA GAAACCTGGA TATAATACTC CGGAAAATGT 2 034 0 

GATTGAACAT CATATGTATG ATGCTTTTGC AAAAGTGCGA GGTCGCTTGA TAGTTTCATG 20400 

TTATGCTTCG AACTTTATAC GTATTCAGCA AGTTTTAAAT ATTGCTAGCA AGCTAAATCG 204 60 

TAAAGTGTCA TTTTTAGGAA GAT C AC T T G A AAGTTCATTT AATATTGCTC GTAAAATGGG 20520 

GTATTTCGAC ATTCCTAAAG ATTTGCTAAT TCCTATAACA GAAGTTGATA ATTATCCTAA 20580 

AAATGAAGTG ATAATTATAG CTACTGGTAT GCAAGGAGAA CCTGTAGAAG CCTTAAGTCA 2064 0 

AATGGCGCAA CATAAGCATA AAATTATGAA TATJGAAGAA GGCGATTCTG TATTTTTAGC 20700 

AATTACGGCT TCTGCTAATA TGGAAGTTAT CATTGCGAAT AcATTAAATG AGCtTgTtAC 207 60 

GnCTGGCGCA CATATTATTC CAAATAACAA AAAGAT T CAT GCTTCAAGTC ATGGTTGCAT 2 0820 

GGAAGAATTA AAAATGATGA TTAATATTAT GAAACCTGAA TACTTTATTC CTGTACAAG 208 8 0 

TGAATTTAAA ATGCAGATAG CACATGCGAA GCTAGCAGCT GAAGCAGGTG TTGCACCAGA 20940 

AAAGATTTTC CTTGTGGAAA AAGGAGATGT CATTAATTAC AACGGTAAAG ATATGATATT 21000 

AAATGAAAAG GTAAATTCAG GAAATATTTT AATAGATGGC ATTGGTATTG GGGATGTAGG 21060 

AAATATCGTG TTGAGAGACC GTCATCTTTT AGCAGAAGAT GGTATCTTTA TTGCTGTTGT 2112 0 

AACGTTAGAT CCTAAAAATA GACGTATAGC TGCGGGACCT GAAATTCAAT CTCGTGGGTT 21180 

TGTATATGTA CGTGAAAGTG AAGACTTATT ACGTGAAGCA GAAGAGAAAG TACGTGAAAT 21240 

AGTAGAGGCT GGTTTACAAG AAAAACGCAT AGAATGGTCT GAAATTAAAC AAAATATGCG 21300 

TGATCAAATT AGTAAACTAT TATTCGAAAG TACAAAACGT CGTCCTATGA TTATTCCAGT 21360 

AATTTCTGAA ATTTAATCAA AAAGTCATTA ACATAAAAGA GGTCAGAACA AGTCACTGAA 214 20 

ATATAATGGT TGTCATGGAC AATTTACTTA TATTTTATGA TAGTCAATTG AAGGGGTAAC 21810 

GATTAATCTG TTATCTTAAG TAAATTGATA CATAGATGAT ATTGTTCTAA CCTCTTTCAT 21540 

CGTCTGTTTG GACTACATAT TCTAAACATC AAATAGGAAA TTATATATAA TAACGTCGTT 21600 

TTAACTAAGG CAACATAAGG AGGTGCGTCA ATTGGCACAA GCAAAAAAGA AATCGACAGC 21660 

TAAGAAAAAA AC AAC AT C AA A^AAAGAAC AAATTCGAGG AAAAAGAAGA ATGATAATCC 21720 

GATACGTTAT GTCATAGCTA TTTTAGTAGT TGTATTAATG GTGTTGGGTG TTTTCCAATT 21780 

AGGAATAATA GGTCGTCTAA TTGACAGCTT CTTTAATTAT TTATTTGGGT ACAGTAGATA 218 4 0 

TTTAACATAT ATTTTAGTAC TCTTAGCAAC TGGTTTTATT ACATACTCA AACGTATTCC 21900 

TAmaACTAGA CGAACGGCTG GTTCGATTGT ATTGCAAATT GCATTGCTAT TTGTATCACA 21960 

GTTAGTTTTT CATTTTAATA GTGGTATCAA AGC TGAAAGA GAACCTGTAC TTTCTTATGT 22020 
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GTATCAGTCA TACCAACACA GTCATTTCCC AAATTTTGGT GGCGGTGTAT TAGGCTTTTA 



22080 



TTTATTAGAG TTAAGCGTAC CTTTAATTTC ATTATTTGGT GTATGTATTA TTACTATTTT 



22140 



ATTATTATGC TCAAGTGTTA TTTTATTAAC AAACCATCAA CATCGTGAAG TTGCAAAAGT 



22200 



TGCACTGGAA AATATAAAAG CTTGGTTTGG TTCATTTAAT GAA 



22243 



(2) INFORMATION FOR SEQ ID NO: K5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5510 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

TTATTAATnA TTAATATTTT TATTTTTAAA AATAAAGCGA GGAGCTATCA ATGGAACAAA 60 

TTACTTCTGC ACAAAATAAT AGAATTAAAC AAGCGAACAA GCTAAAAmAG AAACGTGAGA 12 0 

GGGATAAAAC TGGATTAGCT TTAATTGAAG GTGTGCATTT AATTGAAGAA GCTTATCAAA 180 

GTGGAATTGT AATTACACAA TTATTTGCAA TTGAACCGGC AAGATTAGATCAGCAAATTA 24 0 

wCGCATACGC GCAAGAAGTT TTTgAAATAA ACATGAAAGT TGCTGAATCT TTATCAGGTA 300 

CAGTGACACC ACAAGGGTTT TTCGCAATCA TTGAGAAGCC GCATTATGAT ATTTCTAAAG 360 

CACAACAAGT ATTGCTCATC GATCGTGTTC AAGATCCTGG AAAT T TAG GC ACATTAATTA 42 0 

GAACTGCGGA TGCTGCTGGA ATGGATGCTG TAATAATGGA GAAGGGTACG AC AG AT C C T T 4 80 

ATCAAGATAA AGTGTTGCGA GCGAGTCAAG GTAGTGTTTT CCATTTGCCA GTTATGACAC 54 0 

AAGATCTCGA TACGTTTATT ACTCAATTTA ATGGTCCTGT TTATGGTACA GCACTTGAAA 60 0 

ACGCAGTGgC ATACAAAGAA GTTACTTCAA GTSATTCTTT TGCATTACTA TTAGGTAATG 660 

AGGGAGAAGG TGTTAATCCT GAATTATTAG C AC AT ACT AC ACAAAATTTA ATCATACCTA 72 0 

TTTATGGTAA AGCTGAAAGT TTAAATGTAG CGATTGCAGG TAGTATTTTA CTTTATCATT 78 0 

TGAAAGGTTG ACCGTGTTGA AAGTTTTCCG AT AT AAT TAT AATTAATTGT TTAACAGfflC 84 0 

ATTTCTACGT GAATGCATAC AC AAT T G AAT ATAAACCAAT AAAAAGGCAT GGACATTTAT 900 

ATAAATAATT GTTTTAGGGA G AAT AAT C G T GACTGCAAGT TATTCCAATT ATTTAAAGTC 960 

TTTTCACCTT TTTGGTTACT TAAAGAGATT TAAGTCGGAA AGACAATCCG T TAT C AAT AT 102 0 

TAAACAAGTG TATGCTTAGG CATAAATTTG GGTGGTACCA CGGAAATGAC TTTCGTCCCT 108 0 

TATTTTTTAA GAGGATGAAA GTCTTTTTTT AGTTAAACAA CAAATATGAT AAATAGAAAA 114 0 

TGAATAGTTC GAATAGGGAG GTCAGTGACA TATGTCTGAA CAACAAACAA TGTCAGAGTT 12 0 0 

AAAACAACAA GCGCTTGTAG AT AT T AAT G A AGCAAATGAT GAACGTGCAC TGCAAGAAGT 12 60 
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TAAAGTGAAA TACTTAGGTA AAAAAGGGTC AGTTAGCGGA C T AATGAAAT TGATGAAGGA 1320 

TTTGCCGAAT GAAGATAAAC CTGCGTTTGG TCAAAAAGTG AATGAATTGC GTCAAACAAT 138 0 

TCAAAATGAA TTAGATGAAA GACAACAGAT GTTAGTTAAA GAAAAATTAA ATAAGCcAAT 41.4 0 

TGGcTGAAGA AACAATTGAT GTATCATTAC CAGGTCGTCA TATTGAAATC GGTTCAAAGC 1500 

ATCCATTAAC ACGTACAATA GAAGAAATTG AAGACTTATT CTTAGGTTTA GG T TAT G AAA 15 60 

TTGTGAATGG ATATGAAGTT GAACAAGATC ATTATAACTT CGAAATGCTG AATTTACCTA 162 0 

AATCACACCC TGCACGTGAT STGCAAGATA GTTTCTATAT TACGGATGAA ATTTTATTAC 168 0 

GTACGCATAC ATCACCAGTG CAGGCACGTa CGATGGAATC ACGTCATGGT CAAGGTCCAG 17 4 0 

TTAAAATTAT TTGCCCTGGT AAAGTGTATC GTCGTGACTC TGATGATGCG ACACATAGTC 1800 

ATCAATTTAC ACAAATCGAA GGATTAGTTG TTGATAAAAA CGTTAAATG AGTGATTTGA 18 60 

AAGGTACTTT AGAATTGTTA GCTAAGAAAT TATTTGGTGC TGATCGTGAA ATTCGTTTAC 1920 

GTCCAAGTTA CTTCCCATTC ACTGAACCTT CTGTAGAAGT TGATGTGTCA TGTTTTAAAT 1980 

GTAAAGGAAA AGGTTGTAAT GTGTGTAAAC ACACAGGATG GATTGAAATT TTAGGTGCTG 204 0 

GAATGGTACA TCCTAATGTA TTAGAAATGG CTGGTTTTGA TTCTTCAGAG TACTCTGGAT 2100 

TTGCATTTGG TATGGGACCA GACCGTATTG CAATGTTGAA ATATGGTATA GAAGATATTC 2160 

GTCATTTCTA TACTAATGAT GTGAGATTTT TAGATCAATT TAAAGCGGTA GAAGATAGAG 222 0 

GTGACATGTA ATGTTGATAT CAAATGAAE GTTGAAAGAA TATGTAACAA TCGAT GATTC 2 28 0 

TGTAAGTAAT TTGGCAGAAC GTATTACGCG CACAGGTATT GAAGTGGATG ATTTAATTGA 23 4 0 

CTACACAAAA GATATCAAAA ATTTAGTTGT CGGCTTCGTT AAGTCAAAAG AGAAACATCC 24 00 

TGATGCTGAT AAATTAAATG TTTGCCAAGT TGATATCGGA G AAGAC G AAC CTGHCAAAT 2 4 60 

CGTTTGTGGT GCACCGAACG TTGaTGCAGG ACAATATGTC ATTGTTGCTA AAGTAGGTGG 2520 

CAGATTGCCT GGTGGTATTA AAATTAAGCG TGCCAAATTA CGCGGTGAAC GTTCAGAAGG 25 80 

TATGATTTGT TCGTTACAAG AAATTGGTAT TTCAAGTAAC TATATACCGA AAAGTTTTGA 264 0 

ATCAGGCATT TATGTATTTA GTGAAGCCCA AGTTCCAGGA ACAGATGCCT TACAAGCTTT 27 00 

ATATTTAGAT GATCAAGTAA TGGAATTTGA TTTAACGCCG AATCGTGCAG ATGCTTTAAG 27 60 

TATGATAGGT ACTGCTTATG AAGTTGCAGC ATTATATAAT AC AAAAAT G A C T AAGCCAGA 28 20 

G AC AAC AT C A AATGAGCTTG ATTTATCTGC AAATGiTGAA CTGACTGTGA CAATTGAAAA 2 880 

T GAAG AT AAA GTACCATATT ATAGTGCACG TGTTGTTCAC GACGTGACAA TTGAACCCTC 294 0 

GCCAATTTGG ATGCAAGCAC GCTTAATAAA AGCGGGTATA CGTCCTATTA ATAATGTTGT 3000 

TGACATTTCA AATTATGTGT TATTAGAATA CGGTCAACCA TTGCACATGT TTGATCAAGA 3060 
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TGCGATTGGT T C AC AAC AAA TTGTTGTTCG TCAAGCTAAT GAAGGCGAAA AAATGACAAC 312 0 

ATTAGATGAT AC AG AAC G T G AATTATTAAC GAGCGATATT GTCATTACTA ATGGACAAAC 3180 

TCCAATTGCA TTAGCTGGTG TTATGGGTGG CGATTTTTCA GAAGTTAAAG AAC AAAC AT C 32 4 0 

AAATATAGTG ATTGAAGGTG CTATTTTTGA TCCAGTTTCA ATTCGTCATA CAT CAAGACG 3300 

TTTAAATTTA CGCAGTGAAT CATCTAGTCG TTTTGAAAAA GGAATAGCTA CTGAATTTGT 33 60 

AGATGAAGCA GTCGACCGTG CATGTTATTT ATTACAAACT TATGCAAACG GAAAAGTGCT 34 2 0 

AAAAGATAGA GTGTCTTCAG GAG AAC T T G G TGCATTTATT ACECAATCG ACATCACTGC 34 8 0 

TGATAAAATT AATCGCACTA TTGGATTTGA TTTGTCACAA AATGATATTG TTACTATTTT 35 4 0 

TAATCAACTA GGGTTTGATA CAGAAATAAA TGATGATGTT ATTACAGTGC TAGTACCATC 36 00 

ACGTCGTAAA GAT AT T AC AA T T AAAGAAG A TTTAATTGAA GAAGTTGCAC GTAT AT AT GG 3 6 60 

ATACGACGAT ATTCCATCAA CGTTACCTGT CTTCGATAAA GTTACTAGTG GTCAGCTAAC 37 2 0 

TGATCGCCAA TATAAAACTA GAATGGTTAA AGAAGTGTTA GAAGGTGCTG GATTAGACCa 37 8 0 

AGCTATTACG TATTCGTTAG TTTCTAAAGA AGATGCTACT GCaTTTTCGA T G C AAC AG C G 38 4 0 

TCAAACAATT GATTTATTGA TGCC/ATGAG TGAAGCGCAT GCGTCATTAC GTCAAAGTTT 3900 

ATTACCACAT TTAATCGAAG CGGCATCATA TAATGTGGCA CGCAAAAATA AAGATGTAAA 39 60 

AT TAT T T G AA AT CGGCAATG TCTTCTTTGC TAATGGAGAA GGTGAACTAC CAGATCAAGT 4 02 0 

TGAATATTTA AGTGGTATTT TAACTGGAGA TTATGTAGTC AATCAATGGCAAGGTAAGAA 4 08 0 

AGAAACGGTT GATTTCTATT TAGCAAAAGG TGTCGTGGAT CGAGTATCTG AAAAGTTAAA 414 0 

TCTTGAATTT AGTTATCGCC GTGCTGATAT TGaTGGATTA CATCCAGGTC GTACTGCTGA 4 2 00 

AATCTTATTA GAGAATAAAG TTGTTGGTTT TATTGGTGAA TTACATCCAA TATTAGCAGC 4 2 60 

TGATAATGAT TTAAAACGTA CGTATGTTTT TGAGTTGAAT TTTGATGCAT TAATGGCTGT 4 32 0 

GTCGGTAGGT TACATTAATT ACCAGCCAAT TCCGAGATTC CCAGGCATGT CTCGTGACAT 4 38 0 

TGCATTAGAA GTAGATCAAA ATATTCCAGC AGCTGATTTA TTATCAACGA TTCATGCACA 4 4 40 

TGGTGGCAAT AT AT T AAAAG ATACACTTGT CUTGATGTA TATCAGGGCG AACATTTAGA 4 500 

AAAAGGTAAA AAATCAATTG CAATACGTTT AAATTATTTA GACACAGAAG AAAC AT T G AC 4 5 60 

AGATGAGCGC GTTTCAAAAG TACAAGCGGA AATTGAAGCA GCATTAATTG AACAAGGTGC 4 620 

TGTTATTAGA TAATGATTTA AACCCCATGT ATAAGGATAT CTGAAGTAGA TTGATATCC 4 680 

TAACATGGGG TTTTATTTTT GGGTTCACCA ATTTGGTTCC AATGCATTTA AAAAG TC AAA 47 40 

GAG G AAC AG C GGAATACAGA TGATGcTTCG CACAACTGCA TAAAAGCCTC TAATGATTAA 4 8 00 

AAATCAAAGA GGCTTTAAAA TTTTTTGGGC TTTTTCACGA TTTTTAAAAT GCTTTTTTGA 4 8 60 

AATGGTATCT AAAGGTGAAA GACCGTATTT TTTTATAATT TTGGCGGCGA T T AC AT C G AC 4 920 
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TTTAGCACCG GCACCTTTAG GAATCGTCAT ATTAATATTT TTTGATATTT GATCCATATA 4 980 

TGTAACAAAT GCGTATCGAG AAATTATGCT TGCCACTGCA ATGGCTAATG ACTTCGATTC 5 04 0 

TCCTTTTGTT TCAAATTTTG TTTTCTTTGG AAGTGGTATATCTGATAATG CGTAATGGCT 5100 

ATACACTTCG CGTTTTGCGA ACTGATCAAT GACGATATAG TCTAATTGAG ACGAATCAAT 5160 

TTTTTCAAGT ACATTTTTGA TGGCTTCATT ATGAAGGGCA GCTTTCATTT TTACTTGAGT 5220 

CCAGCCTTTT GCTTGCTGAA TATTATATTT TTCATTGTGT AGTGTTAATA ATGAATGTGG 2580 

TATGAAAGTA ACCAATTGCT CAGCAAGTTC TACAATTTTG GTATCGGTTA ATTTTTTTGA 5340 

ATCATCTACA CCCAAAGTTT TTAAAATAGG GACATGCTCT TTGGTAACGA AAGCAGCACA 5 4 00 

CACAGTCAAC GGACCAAAGT AATCGCCACT TCCAGCCTCA T C AC T AC C AA T AC AG T T AAA 54 60 

TTGrTCATAC ATTAaAGTTg TcCAgAAAAG AATTAGCCAT ATTTnCCTTT 5510 
(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

GnTTATACTT ATAAATTTTA CGGGGGTAAT ATAATACTtA TTTACCTGTA ATATATGATA 60 

ATTCTTCAGC GGCAGCTGCG TTGATAGTTC TAT GAG AAA T GATACCTAAT CCTTTAACAT 12 0 

TGGATTCTGA AATAACGATA GAACCATCAC TGTTAACTTT TEAACAAAT GCTACATGAC 180 

CGTAATGTTG ATCTGCACCA AATTGTCCAG CCTCAAATAC AACAGCAGCA TGACGTTTTG 24 0 

GTGTATGACT T AC T T G AT AA TCACGGTATT GAGCTCGATT ATTCCAATTA TGTGCATCAC 300 

CTAAATCACC T GAG AT AG AT GTACCAAATT GTTTCATACG GT TAT AT AC G TACCAAGTAC 360 

ATTGGCCATG TGGATATGGC ATACTATCAG ATACCTCACG GAAAGGTTTG AATTCATCTG 4 20 

AT GAAT CAT C ATAATCCTTG ATAGAACGTT CATATTTATC TAAATCTGGC ATGCGTTCAT 4 80 

CGTCAAACTG AGTTAATTGA TAGTGTTTAA TAATACTGTT TAATTTCTTA GCATAGTTTG 54 0 

GATCTGTAGC ATATGTTTTA GATMGTGTG ATGTTGCATC TTTATAAGAA TCGGCTTCCG 600 

ATTTCCATGT TGGTTTATAA ATTGTTCGAT TGCCATCAAT ACCATTTTTA AT AAGGT CAG 660 

AGTAATCTTT TAGTGATTCT TTCGTGCTTG GATATTTTCG GAATCCAGCA TTAATACTAT 720 

ACAATTGATT ACCATCAGCT TCTAATGTGT TAAAAGGAAC AGAATTCCCTTCaAAAGCAC 7 80 

CTTTGATACC GAATAAATTA TGGTTTGGTG ACwTAGCTAA AGCACTACGA CCTGAGTCAG 84 0 
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ATTCTAAGAT TGCTTGGGCA ATCATGACAG ACGCATAAAT ATCGTTATCT TGACCAATGC 900 

GATGTGCATC TTTAGCAATT GATTTGACAA ATTGACGTGT ATCTTTTGAG TCAACAACGT 9 60 

TAAATTGTCC GCTATCATCA TTGTTAGATA TACTAGGATC TGTTTCGAAT AATGATGTTG 1020 

CACGTGTATC CTTTTGATTA ACATCGTTAT TGAATGATTG AGCAGGTTTA GATTTATGTT 1080 

TCAATTCATC TTGTGTTGGT AACTGTGGAT TCTTTGTATT AGATTTTTCA TTTTTGTCTT 114 0 

TTTTAGATTG AGATGCATAA TCTTTTTGTG TTTTCTTTGC ATCTTCACTG TATTGATCCA 1200 

AAATAGAGTC TAAAGCCGAA TCTGACATTG ATTGATTATC TTTCGATGAA GATTTTTGAT 12 60 

TTGCTTTATC GTCACTTGCT GGTTGACTAT TTGATTGATT AGGTTGTGTT GGCTTTGGCG 1320 

AATTTGGTTG CTTATTAGAT GTACTTGGTT TTGTATTGTT TGATTTAGGT GCTTTTGAT 138 0 

TGTCTGCTTT ATCTTGTTTA GATGATTGCG TATCAGTGTC ATTTTTGATG CTATTGTCAC 14 40 

TGTTTTTATT CGAATCATTT GTTGACTTTT CGCCATTACG AGGTTGTTCG TAATCAGAAA 1500 

TATCCGAATT TAAATTGAAT AAGTTTTGGA TTAAAGTTGT TAATGAGTAA TTATCATCGT 15 60 

ATTTATTTTT GGTTAGCAAT TGGTTTATAT TGGTTTGTGG TAAATTCTTA TAAATAAAAT 1620 

CAATGATATT GTTAGAGTCT GAAGTGCTGT CGTCTATAGT TTTAAATTTT TTGTCGTTAT 168 0 

TGTCTTGGTT ACTTGTATTA TTTTTGTCTG CTTTATCAAT ATCTTTACTT GTAGTATCCT 1740 

TAGAAGTTTC ATCGTCATTA GATTTTTTTG AATCATGAIA TGTTGTCTTA GCTGTAGTAT 18 00 

CTTTTTGAGG TGTATCAGCA TAAGCGgTAG GTGAAaCTAA AGTAGGTAAT ACGAGCGTAG 18 60 

TTGATAGCAA ATAAATTAAA ATTTTATTTT TAGGCATATT TCGTATTCTC CCTTGAAAAA 192 0 

TATAATAATT AAGTGTGATA ATAAACTATG ATTTGTTATA ATTTATCGTA TGCTGAAAAT 1980 

AGTTGATAGG TATCAATCGA CTAAATATCT TCCAGTAAAT T GAT TAT AC T AATTCACAAC 2 04 0 

GCAAAAATAA ATTAATTTAC AAAAAATATA TAAAAAATAT GAATAATTCC TACATAGGAG 2100 

TGTGACAATG AAGAACGCAT TTAAATTATT TAAAATGGAT CTGAAGAAAG TAGCTAAGAC 2160 

GCCAGCTGTG TGGATTATCT TAGCAGGCTT AGCTATTTTG CCATCGTTCT ACGCTTGGTT 2220 

TAACTTATGG GCAATGTGGG ATCCATATGG CAACACGGGA CACATCAAGG TCGCAGTCGT 228 0 

TAATGAAGAT AAAGGCGACA CAATCAGAGG GAAAAAAGTT AATGTCGGTA ATACGATGGT 2 34 0 

TAATACACTC AAGAAAAATA AAAGTTTTGA TTGGCAGTTT GTAAGKGAG AGAAAGCTGA 24 00 

TCATGAGATA AAAATGGGTA AATATTTTGC AGGTATTTAC ATCCCATCTA AGTTTACACA 2 4 60 

TGAAATTACA GGGACACTAC GTAAGCAGCC T C AAAAAGC A GATGTAGAAT TTAAGGTGAA 2 52 0 

TCAGAAGATT AACGCTGTTG CGTCTAAGCT AACAGATACT GGTTCGTCAG TTGTCGTTGA 2 580 

AAAAGCGAAT GAACAATTTA AT AAAAC AG T AACTCGAGCA TTATTAGAAG AAGCTAACAA 2 64 0 

AGCAGGTTTA ACTATTGAAG AAAATGTGCC GACAATTAAC AAGATAAAAA ATGCGGTATA 2 7 00 
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TTCAGCAGAT AAAGCTTTAC CTAAGATTAA TGACTTTGCG AATAAAATTG TATATTTGAA 27 60 

TAACCACCAA GCGGATTTAG ATAAATATSC CAATGATTTT AG AAAAC TAG GAAATTATAA 2 82 0 

AGGTGATATT TTAGATGCTC AGAAAAAATT AAACGAaGTC AATGGTGCTA TTCCGCAACT 28 80 

TAATGAAAAG GCTAAGTTGA TATTAGCTTT AAATAATTAT ATGCCGAAAA TTGAAAAAGC 2 94 0 

GTTAAATTTT GCAGCTGATG ACGTGCCAGC GCAGTTCCCT AAAATTAATC AAGACTTAA 3000 

CATTGCGAGT CAAGGTATTG ATCAAGCTAA TGGACAGTTA AATGAT GCCA AAGGCTTCGT 3060 

CACACAAGTT AGAAGTAGAG TCGGTGATTA TCAAGATGCA ATTCGACGCG CGCAAGATTT 3120 

AAATCGAAGA AACCAGCAAC AGATTCCTCA AAATAGCGCG GCGAACAACG AAACATCAAA 318 0 

TAGTGCACCT GCAGCTGGTA ATGGTGTAGC ATCAACGCCA CCAAGTGCAC CAAGTGGCGA 32 4 0 

TACTGCACCA AATAATAATG TTACGCAAAA TACCGCACCA AATAGTAATA ATGCGCCTGT 3300 

ATCGACTACA CCACAAAGTA CAAGCGGGAA AAAAGATGGT CAAAGTTTTG TAGATATAAC ' 3360 

AACAACACAA GTCAGCACAG C T AAC GAG AA C AC A (AAAAC ATTACAGATA AAGATGTTAA 34 20 

ATCAATGGAA GCGGCATTAA CGGGCTCTTT ATTATCATTA TCAAATAATT TAGATACCCA 34 8 0 

AGCGAAAGCC GCACAAAAAG ATAGTCAGGC ATTACGTAAT ATTTCGTATG GGATTTTAGC 354 0 

ATCGGACAAG CCTTCTGATT T TAG AG AG T C TTTAGATAAT GTTAAGTCCG GTTTAGAATA 3 60 0 

CACAACGCAA TATAATCAAC AATTTATCGA TACATTAAAA GAGATTGAGA AGAATGAAAA 3660 

TGTTGATTTA TCAAAAGAAA TTGATAAGGT AAAAGCAGCT AATAATCGAA TTAATGAATC 37 2 0 

ATTAAGGTTA GTTAATCAAT TAAGCAATGC AT T AAAGAAT GGTAGTTCAG GAACTGCTGA 37 8 0 

AGCTACTAAA TTACTAGATC AACTTTCAAA ACTAGATTCA TCATTATCAT CATTTAGAGA 38 4 0 

TTATGTTAAA AAAGATCTTA ACAGCTCTTT AGTATCAATA TCACAACGTA TTATGGATGA 3900 

ATT G AAC AAA GGGCAAACTG CATTATCCAA TGTTCAGTCT AAATTAAATA CAATTGATCA 3 9 60 

AG T CAT T AAC AGTGGACAAG CTATTTTAAA AAATGGTAAA ASCGTATTG ATCGCTTACA 4 020 

AACAGTATTA CCAAGTATTG AACAACAATA CATTAGTGCT GTTAAAAATG CTCAAGCAAA 4 08 0 

CTTCTCGAAA GTGAAAAGTG ATGTAGcTAA AGCTGCTAAC TTTGTGCGCA ATGACTTACC 414 0 

ACAGTTAGAA CAGCGAT T AA CTAATGCGAC AGCAAGTGTG AATAAAAATT TACCAACGTT 4 2 00 

ATTAAATGGT TATGATCAAG CGGTAGGATT AC T AAAT AAA AATCAGCCAC AAGCGAAAAA 4 2 60 

GGCTTTATCA GATTTAGCTG ATTTTTCTCA AAATAAATTG CCTGATGTTG AAAAAGATTT 4 32 0 

GAAAAAAGCG AATAAAATTT TCAAGAAATT AGACAAAGAT GATGCAGTCG ACAAATTAAT 4 38 0 

CGACACACTT AAGAATGATT TGAMAAGCA AGCGGGTATT ATTGCAAATC CTATTAATAA 4 4 40 

GAAGACTGTT GATGTTTTCC CAGTTAAGGA TTATGGTTCA GGTATGACAC CATTCTATAC 4 500 
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TGCACTGTCA GTATGGGTAG GTGCACTCTT GATGGTAAGT TTATTAACGG TTGATAATAA 4 5 60 

AC AT AAG AG T CTAGAGTCAG TCTTAACGAC AAGACAAGTG TTCTTAGGTAAGGCAGGATT 4 620 

CTTTATAATG CTTGGTATGT T G C AAG C AC T CATTGTATCG GTTGGAGATT TGTTAATCCT 4 680 

AAAAGCAGGA GTTGAGTCAC CTGTATTATT TGTACTTATA ACGATTTTCT GTTCGATTAT 474 0 

TTTCAACTCA ATCGTATATA CGTGCGTATC ATTACTTGGT AACCCAGGTA AAGCCATTGC 4 800 

AATCGTATTG CTTGTATTAC AAATTGCAGG TGGTGGGGGA ACATTCCCAA TTCAAACTAC 4 8 60 

GCCACAATTT TTCCAAAACA TTTCGCCATA CTTACCATTT ACGTATGCAA TTGATTCATT 4 920 

ACGTGAAACA GTAGGCGGTA TTGTTCCGGA AATCCTAATT ACAAAATTAA TTATATTAAC 4 980 

GTTATTTGGT ATAGGATTCT TCGTTGTAGG TTTAATTTTA AAACCTGTAA CAGATCCATT 50 4 0 

GATGAAGCGC GTATCTGAAA AAGTTGACCA AAGTAACGTT ACAGAATAAA AATTAAATCC 5100 

ACACATTAGG GTTATAGCTC CTTAATGTGT GGATTTTTAT GTTTTTAGAC AGAAGAGATA 5160 

GTAATTTCTG TCTTTTATGG GACGGTTGTT ATCATTGCTA TTATCCAGGA TGACTTSTA 5220 

TAGCACTAAT ATTACCGACA AAGTGAATAT CCTCGTCTTC CGTAGTTAAA ATAAAGC TAG 5280 

AACCTTTTTG GATGTCATAG TGCTTATCGT TTACTGTTAA AGTACCAGTA CCATCGATAA 534 0 

TTGTAACTAA GCAATAAGCA TGTGGTTTAT TGAATTTTAA ATCTCCATGA ATATCCCATT 54 00 

TATATACTGC AAAATATTGA TTATCTACAA ATTGAGTTAC AGTGTGTGTG TCGATGTGAG 5 4 60 

TTGTTATAGG AGTAGTATTT GGTTCATGAT TGCCTAATTC AATCACATCT TTACTTTGCT 5520 

CTAAGTGCAA ATCACGCAAT TGACCATTTT GATCTCGTCT ATCATAGTCA TAAATACGGT 558 0 

ATGTCGTATC GGAGGATTGT TGTGTCTCTA AAATTAAAA' ACCCGAACCA ATGGCATGGA 5 64 0 

CAGTGCCAGC AGGAACATAA TAAAAGTCAC CGGGCTTAAC AGGTATACGT TTGAAAAGAC 57 00 

TGTCAAATTC ATGATTATCA ATCATGTCTA TTAACGTCTG TTTATTATGT GCATGTACGC 57 60 

CATATATAAT TTCAGCACCT GGGGCTGCAT C T AAAAT AT A CCAACATTCT GTTTTACCTA 5820 

GTTCGCCTTC GTGTTTTAAA GCGTAGTCAT CATCTGGGTG AACTTGAACA GATAATTTAT 58 8 0 

CAT T G GC AT C TAATACTTTA GTTAGCAGAG GGAAACTATC TCGTGAATCA TTATCGAATA 5 94 0 

ATTCACGATG TTGTGACCAA AGTTGATCTA GGGTCATATC CTTGTATGGA CCATTGATAA 6000 

TTGTATTAGG ACCATTTGGA TGTGCAGAAA TTGCCCAGCA TTCACCAGTT GTTTCATTAG 6060 

GGATATCATA GTTAAATGCT TTTAATGCAT GACCGCCCCA AATTCTGTCT TTAAAAACGG 6120 

GTTGTAAAAA TAATGCCATA GTTAAAACTC CTCTATATTT TCATTAATAA GTTATAAATT 618 0 

TCTGTAGTAC TGTTTGCATT AATTAGTGAT TGGCGTGTCT CATCATCAT TAACGCTTTA 62 4 0- 

GATAAGCGCT G AAG TAT T T T TAAATGTGTA TCCTGACTGT TGTTTGGTAC GGCAATTAAG 6300 

AATATCAATT GAGGTAGACT ACCATCTAGA CTGTCCCATT TAACACCATG ATTATTTTTC 63 60 



595 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



ATAACAGCTA CAATCGGTTG TTTTACAACA TCAGACTTTG CATGTGGAAT GGCCACGTTC 6420 

ATGCCAATAG CTGTCGTAGm tCcATTTCAC GTTCTAGTAT TGCATTTTTT AAATGCGATG 64 8 0 

TGTGCTCTAC ATAACGGCAA ATTTTAAGTT TATGAATCAA CATATCAATT GCTTCGTTTC 654 0 

GAGACATGTC GTGATCAGTA ATTATCATAG TTTGTTGATC AAAAAC AT G A GAAGGTTTAT 6600 

TGAGATGT G A ATGTTTCGCG GTGTTATCTA CATTGTCAAC CTCTGTATCA TGTTGTGTAA 6 6 60 

TATCTGTATC ATGAAGTTGC GTGTGTTGCG CTGGTGCATC TACTGCTATA ACTGGTGTAT 67 20 

TGCGTTTTAA TAATAGTACA GTAGTCATTG TGACAAGACT ACCTACTATC ACTGCAAAGA 67 80 

TAAACCATAA TACATGATCA ATACCACCTA ATACAGCCAC GATTGGACCT CCAGTGCGA 68 4 0 

CTCTATCGCC GACACCACCA ATGGCTGCAA TGACTGATGC AATCATTGCA CCAATGATGT 6900 

TTGCAGGTAT AATGCGCAAT GGATCTTGGG CTGCGAAAGG AATAGCACCT TCAGTAATAC 6960 

CAAATAGTCC CATAGTGAAG GAAGCCTTAC CCATTTCTCT TTCGGAATGA TTGAATTTAT 7 020 

ACTTTTGAAC AAACGTTGCT AAACCTAAAC CGATTGGTGG TGTACATACA GCAACTGCGA 7080 

CCATACCCAT AACGGCGTAA TTACCTTCAG CAATAAGTGC TGAGCCAAAT AAAAATGCTA 714 0 

CCTTGTTTAC TGGACCGCCC ATATCGAAGG CAATCATCGC ACCTATAATC ATCGCAAGTA 7 2 00 

TAATAATATT AGCACCTTGC ATACTTTTTA ACCAGTTGT TAATGCCTCA AAAAT AT TAG 7 2 60 

AAATTGGTGC ACCGATTAAA AATATAAATA TCAATCCTAC AACGACCGAT GAAATAATGG 7 32 0 

GAATAATAAT GATAGGCATA ATTGGTGCCA TTGCTTTTGG AACTTTAATA TCTTTAATCC 7380 

ACTTTGCGAT ATAACCTGCT AAGAAACCAG CAACAATACC ACCTAAAAAT CCTGCGCCTG 744 0 

CATCACTGCC ATAAAAACTA CCGTCAGCAG CGATAGCGCC GCCAATCATA CCAGGAACAA 7 500 

GACCGGGcTT GTCAGCGATA CTAACAGCGA TATATCCAGC TAGTATTGGA ACCATAAATT 7 5 60 

TAAAGGCTAA ACTACCAATG TTTTCAATGG ATTTCCAAAA TGAATCATCT GGGAT G AC T A 7 62 0 

ATCCTTTTGA TGTCGTTtCA CCGCCTAGAG TCAGCGCGAT GGCGATAAGG AGTCCACCAA 7 68 0 

CTACGATAAA AGGAACCATA AACGATACAC CGTTCATTAA ATGTTGATAC ACCATTTGAA 77 4 0 

TACCATTTTT AGACTTACCG CGATCTTTCG AATGATAATT TGTTTCAGAT T GAT AAAT AG 7800 

GCGCATCTTG ATTAATGATA CGTTGAATTA GACCTCTCGG ATATGAATC CCTTCGCGAA 7 8 60 

CATTTTCATT AATCAACCGT TTACCAACAA ATCGGGACAG ATCAACTTGT TTATCAGCTG 7 92 0 

CAAT TATGAC ACCGTCAGCT TCTTCGATGT CTTGCGTAGT TAAAACATTT TCAGCACCAA 7980 

CACCGCCCTG TGTCTCTACT TTAATATCCA CACCCATTTC TTTTGCTACC TGCTCAAGCT 804 0 

TTTCTTGAGC CATATATGTA TGTGCAATGC CATTTGGGCA TGAGGTAATA GCTACAATTT 8100 

T CAT AAAAT C ATCTCCTTTT CTATATTGTA AGCGTATTCT CGATACTAAA AAAAAGAATA 8160 
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ATTACCGTTA CTAGTGGCAA TTATTCTTGT AAGTATTCAA ATAACTGTTG CTTTAAACTA 822 0 

TGATCATCTA AACTACATAA ATGGTTCACT GAATCATCAT CCAAGTTAGC AATTAATTGC 82 8 0 

ATCATTTGTT TTGTAAAAGC TTTGTCTTTA TGCGAAATCG CTAAGAAAAA GACAAGTTTG 8340 

ACATCGTGTT GTCGCCAAGG AAAAAC AT C T TTTGTGCGAA AAATAAGCAC ATGTGATTGT 8 4 00 

AAAACTTTTT CAGGATCTCC ATGAGGAATC GCCATAAAAT TACCTATGTATGTAGAAGAT 8 4 60 

GATTTCTCAC GCTCTAAAGC T GAT T C GAT A TATCCTTCTA CAATCGCATG ATGTGCTTGT 8 52 0 

AATATTTTTT GAGCTTCTTC AAAAATTTGC ACAGTATGCC GTGATTTTTG TTCAGTATTT 8 58 0 

AC G AC AAG G A AATTGACAGT GTCCATATGA TGATGTGCTT GAACCGGATT TTGCTTTTGC 8 64 0 

TTCACAACGT GTCTGATTTT GTGACGATCA TCTTCAGAAA ATAATGGTGC AACCTTGATA 8700 

GTCGTCAGGT GCTTAGGAAG TATGTTTAGC GTTTGTTTAG GAATATCATG GGTCGTTATT 8 7 60 

AATAAATCTA CATTGTCAAA GTGATAGTGT GTTATATTTT CTAGTTTAAT CGTATTTATC 8820 

ACTGACAACT CTTCGGATAA GTTATTTATT TTAGTTTCTA AAAAATTCGA CACACCTAGA 88 8 0 

CCATAATAAC AAGCAATGAC TACATTTAAT TGTGTTTTGG TACGACGCTC GATGGCAGCT 8 94 0 

TGAAAATGAA TTGTTAAAAA TGCAATTTCA TCTTCGCTCA TCTCTATATC AGTATCAATT 9000 

GCTAATTTAT CAATCGCTTC AAAAAGTGTG TTAAACACAA AGGGATAGAG TTTTTT&TC 9060 

TCTATAACTA AAGGATTGTT TAAATAAATG TTTTGAGTGA TACGTAAATA TGCTTTACTA 9120 

AAAT GAT TAT ATAAATTTTG TTGTAAAATC GAATCTTCAT TGAAAGGTAC ATGAATACGT 9180 

TGCTGCATCA ATTCGATTAA GCGATCAATA TAACTTTGTA TAAATATACG TTCTATGCCA 92 4 0 

ATATCGAGTT TATTAAAATG ATAAGCAATA AAG AAT G AAA ACATATTGAT TACTTTTTCG 9300 

TTCAAGTCAT AACCTAATCT TTCGTTGATT TGCTTAATGC AAG AT T GAGA TATCAATTTT 93 60 

GAATCGTGGT TTAACGTGTC TAGCTCACTT AAAGATyGCT TACGTTTAAT GATTAAAATG 94 2 0 

AGATGAATTA AAAGCTGTTG TATTTGAATA TCAGTTGTT C AAT AC TAT G TTGTTGAAGT 94 80 

GTCTCTTGTA TAATATGCGA AATCATCCTT TGGTGTGAAT CAGGTAATTC aTTTAAAATT 954 0 

AGGTCTTCAA CAT G T AC AT G CCCTGATGAT AATTGATTTA AATGGATGAT GGCATTAGTG 9600 

ATATCATTAT CTGTTCCATC GAC 9 62 3 
(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO: 167: 
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ACCGTGGAAA CACGTCTAGT CAATCAGAAA GCGATAAAAA TGTGACTAAA TCATCTCAAG 60 

AGGAAAATCA AGCAAAAGAA GAATTACAAA GCGTTTTAAA CAAAATTAAC AAACAATCAA 120 

GTAAGAATAA TTAAAAAATT TTGATATTGT CTATGTTTAT AGTTCACAAG CCATTCAACG 180 

TATTGTAAAC TAAGGATAGT GTATTTTTTT AATAGTAATT TGTCAGGAGG TGCCTATCTA 24 0 

TGGAAGAACA TTACTACGTA AGTATTGATA TTGGATCATC AAGCGTAAAA ACAATAGTAG 300 

GCGAGAAATT TCACAATGGT ATAAATGTGA TAGGTACAGG ACAAACCTAC ACGAGCGGTA 3 60 

TAAAAAATGG TTTAATTGAT GATTTTGATA TTGCGCGACA SCAATCAAA G AC AC AAT T A 420 

AAAAGGCATC AATCGCTTCG GGTGTTGATA TTAAAGAAGT TTTCCTGAAA TTACCTATCA 4 80 

TTGGAACGGA AGTTTATGAT GAATCAAATG AAATCGACTT TTATGAGGAT AC AG AAAT C A 54 0 

ACGGTTCACA TATCGAAAAA GTATTAGAAG G TAT TAG AG A AAAAAATGAT GTGCAAGAAA QD 

CAGAAGTAAT TAATGTGTTC CCGATTCGTT TTATAGTCGA TAAAGAAAAT GAGGTTTCAG 660 

ACCCTAAAGA ATTAATTGCC AGACATTCAT TAAAGGTTGA AGCAGGCGTA ATTGCTATTC 720 

AAAAATCGAT TTTAATTAAT ATGAT T AAAT GCGTAGAAGC ATGTGGTGTT GATGTATTAG 780 

ATGTTTACTC TGATGCATAT AACTATGGTT CAATCCTAAC AGCTACTGAA AAAG AG T T AG 84 0 

GTGCATGTGT CATTGATATT GGTGAAGACG TTACGCAAGT TGCTTTTTAT GAACGCGGTG 900 

AAT TAG TAG A TGCTGATTCT ATCGAAATGG CAGGGCGTGA TAT T AC a G AC GATaTTGCAC 960 

aAGGrTTaAA CACTTCTnAT GAAACTGCTG n AAAAAG T T A AAC AC C AAffi TGGTCATGCA 1020 

T 1021 



(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7963 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

TAATCTATTA TAAAAACTGT CCATACCCTT TGATTACCTT CTCTTCAGGT ACAGGCCACA 60 

CTTGAGGCCA TAAGCCATAT GCTTGCTGTG AATAAAATTG TGCCATTTGT AACAATATAA 120 

TATATACAAA TAAACACCCA ATAATTGCTG TCACTAATGG ATATGATAAC CAAACCATTA 18 0 

ATAAAACTGC AATAATTACT AACCTAAAGA T AAT AT T AAA TGCGTCTCTC CCTCTTATAA 24 0 

AGCTTCTAAT AAATAAGAAT AAAT AC AT C G CAT T AGAGT T AAATTTACTA CCCTTTGGAA 300 

CTGGTAAAAG TATATCTAGA TAACTXTTC TGACTGCAGA TTCTTTCAAA TGTTTTACAT 360 
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CGGTGAACAT AT TAAC AAAT TTATAATAAT TCATATGATG TCGATGTTCG ATTGCAATCA 4 20 

TTTTCTCCCA AGGATACAAA AAGCCTGGTT TATATTTTTT AACTAAAAAT T C TAT TAAC A 4 80 

CAGGCAAAGC AACCATCACA AATGCGATGT ACCATTTTGG AGCTAATAGT AGTAATATG 54 0 

TTAGAGCAAA GGTGATGAAT GAT AT T AAAT TAACTTGCCA TGTTTTAAGT CCCGATTGAT 600 

ACCATTGCCA TCTTAAGCGT AAACCAACAT ATGGAAAAAT TAATGCACTG ACTCCAAAAC 660 

AAATATAAAA TGCCACATTA TGTTGATTAA TATTGTAAAA CAACGGGAAC ATTACAATAA 72 0 

CAATAATGAG TTGGATTAAT ATGCGCGCAA AG TAAC TATA TAAAATCGCA TGACGCATAA 780 

ATTGAGACAT GTGTTTTTCA AATGGTAATA AAAAGATTTT ATCCgCTTCT TTTAACAGTG 84 0 

GTCsCmTTGG AAAAATAGrT GTCAACGCAA CAATCACTGC TGCTATTaAT GAAAAATTGa 900 

TATTCGTTGG AATATGTTTT AACCATTcAC CAlVTCCArA AATAAATGCA CCCAGCAAAA 9 60 

TAAGTAAAAA G AC CAT G AAA TGACCATTAA ATATAAACTT ATTATAATAA TTTTtCTCTT 1020 

TACGAAGGGC ATGTAATCTT TT ATT AAAT A ATGTGGTAGC TTGGTTACGC ATGTACATCT 1080 

CCACCTTGCG TCACATGAAT ATATATATCG TCTAATGTTT GATTATGTAA GCCAGTTTE 114 0 

TGTCTCAATG CTTCTAAATC TCCAAATGCA ACGACTTCAC CTTCGTCTAG TATGaTAAAA 1200 

C GAT C AC AG T AACGTTCAGC TGTTGCTAAA ATATGTGTAC TCATTAGAAC GGTTCTACCT 12 60 

TCGTTTTTCT TTTCAACCAT TAAATCTAAC ATGGATTGAA TTCCTAATGG ATCTAGGCCA 132 0 

AGGAATGGTT CGTCTATAAT ATACAATTCG GGATTAACGA TAAACGCACA AATAATCATG ' 1380 

ACTTTTTGTT TCATCCCCTT AGAAAAATGA CTCGGAAAAA CTTTCAACTC ATTTTCTAAA 144 0 

CGGAATGTCT TTAATAATGG CATTGCTCGA TTCATCGTTT CATCACGATC AATATCATAT 1500 

GCCATTGCTG TCATCTCAAT GTGTTCCTCT AATGTGAGTTCTTCATAAAT AACCGGTGAT 15 60 

TCCGGAATAT AAGATAACTT TCTTCTATAA GCCTCTATGT CATCATTAAT GTTGATATCT 1620 

GAAATTGATA GAGATCCTTC CATAGGTGTA AGCAATCCTA GCATATGTTT AATCGTTGTA 168 0 

CTCTTACCAG CGCCATTAAG GCCAATAAGT CCAACAATTT CGCCTTTGTT TAATTCAAAA 1470 

TTTATATCTT TAATTACAGG GCGTTTTCCA TATCCACCTG TAAGCTGTTC TACTTTAACT 1800 

GTCATAAGGC ACCTCCATGA CTTATATTGT ACCAAAAATT ATAAAATGCT CAT AT T AAAT 18 60 

ACACATGTCC TAATATCGAA TTTTTAGCGA CAATGTTATA ATGAATGGTA ATACTAGTTG 192 0 

AAAAG GAG T G TAGTCATCAT GTCAGAAACA ATTTTCGGCA AAATTTTAAC TGGAGAAATT 198 0 

CCTAGCTTTA AAG TAT AT G A AGACGATTAT GTCTATGCCT TTTTAGATAT ATCACAAGTT 204 0 

ACTAAAGGAC AT AC G T TAT T AATTCCTAAA AAAGCTTCTG CTAATATCTT TGAAACTGAT 2100 

GAAGAAACAA TGAAACATAT CGGTGCAGCA TTACCTAAAG TAGCAAAGC TATTAAGCGT 2160 

GCATTTAATC CTGATGGTTT AAACATTATT CAAAATAATG GTGAGTTTGC AGATCAATCT 2220 
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GTATTTCATA TTCATTTCCA CTTAATTCCT CGATACGAAA AT GAT AT T G A TGGATTTGGT 22 8 0 

TATAAGTGGG AAAC AC AT G A AGACATTTTA GATAACGATG CAAAACAACA AATTGCTGAA 23 4 0 

CAAATTCAAG CACAATTTTA AATGTATGCT TAATCTAAGC TCGAACGGGT ATAATATGAT 2 4 00 

TAATATTATA ACAATTGCGT TTGAAGTGAT AACATCAAGG TTAGCAATTT TAAACAAAAT 24 60 

GAGTTATCAA GATAACAGAT GTTAAAAGTG AGGAGAATAT AAATGAAAGC ATCACGCATT 2520 

cTATTCGGTA TCGGTGTTGG CGTAGCAGCT GGTTTTGTAG TTGCACTTCA AGGACGTGAC 2580 

GACAAAAGTG TCAAGAACAA CACGATCGAT CGTACTGCCC CTACTGGTTC AAAATCAGAA 2 64 0 

CTACAACGTG AATTTGAAAC GAT T AAAC AA AGTTTTAATG AC AT T T T AAA CTATGGTGTT 27 00 

CAAATTAAAA ACGAAAGTGC GGAATTTGGT AGTTCAATTG GTGGTGAAAT TAAGTATTA 27 60 

CTTGGAAACT TCAAATCTGA CATTAATCCT AATATTGAAC GTTTACAGTC ACACATCGAA 28 20 

AATTTACAAA ATCGTGGCGA GGATATTGGA AACGAAATTT CTAAGTAGCA GGTTACGTTC 28 80 

TCGATCACAA CTATTTTTAT TAGTAACAGC ATATTTATTT TTTAAAATTA AATGCCAAAT 2940 

AAAC GAG AT G PC AT TAG AAA TTAGATATTT CTTGTCATCT CTTTTTTAAA ACTCAAATGA 3000 

ACTTATGTTT ACAAATTATA GGAAGACATT GTTTGTAGTG ATTTTCGCTT AAATCATATT 30 60 

TATGAATTGA TTGAAAACAT TGCTTAGGAT TCATTGTGTT ATCCtTGCAC TTTGATTACG 3120 

CTTTACTTAA ATCATTATCG ACAAACAACA TACTTAATT TTCATTGAGC CGAACCTTAT 3180 

ATACACATTA CATATACCTT ACTTGCACAA ATTATTAATC TGGTGTTTAT TATAATTACA 32 4 0 

TATCACTATA TTTTTAGCAT TTGTATAACT TAGTTGGTCA AAAGATGCTT TTGCATATGC 3300 

CCTGCCATAT CCATTTTTTA TATTATTCAA GTTTTCAAAT ACTCATGTCA CACGCGCCTT 3360 

TTTCATAAGT GATGCTTTAT TAGCAAGAAT ATGTGTTCGC AGAAATTTGT TCTGCATTCT 34 20 

ACTTCTACGC TAGTCAATCA GACAATTTTA CCAATCCCCA CTTTCGCGTT T C AAAT C AAA 34 80 

CAATACGTCG CTCCTTTCTT CTTATATAAC AATTCTTCTA ACATGATATG TTACTATTGA 35 4 0 

ATTACTGAAC CTGAGTTAGT TATAATCTAA C T TAT AT T GA AAAG AG AT G A GGCGTAAGAT 3600 

ATGTTTTTAT G T AAAAG AC A AATTGATATC AATGCACGAT TTGGTTTGCC TAGAATTGCA 3 660 

TTTATGAGTG CAGTTGCAAC CATCATTATG TTTTTAGTTA GTTATGAAGT AATGTATTTT 3720 

TTATCTAATA CGCCATTATC AGATAGACAT TTTCTCATCT TTTK.TTACT TGTATTTATG 378 0 

ACGTATCCAT TACATAAAAG TATACATTTA TTATTTTTCT TACCATATAG AAAATCGTTT 38 4 0 

AAAGTTCATA AGTTAACTAA AAGAAAATGG CTTATATTCT ATAATACCTA CGTCAATCAA 3900 

C C T G T AC AC A AATTTTATTT TTGCATTAAC TTAATATTGC CGTTAATTAT CTTATCTGCA 39 60 

ATGTTCGTTT ATCTAACAAT TTCATTCCCG CAATATGGAC ATTATTTTAT GTTCTTATTG 4 02 0 
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GCATTGAATT TCGGTATTTC CAT T AC AG AT TTATTATATT TAAAAATAAT TATATTTTCT 4 08 0 

AATTATGGAC AATATATAGA AGAACATAGT ACAGGTATTA ATATTTTGAA AAAAATTAAA 414 0 

AATCCATATC ATTTATAACA AAAT AST TAT AGCAAGGTGT TATTATTTGT TTTTAGGCTA 4200 

TGTAATAgcT t ACAATCAAA TGTATATAGA CCTTGTTTTT TTATTTTCAT CAATTTCTAC 4 2 60 

CCCTAAACCT AATGCTCTAG TCTGATGTCA TGGGTTATTG ATTGGTGATA ATATAAAACT 4 320 

ATGTTATATT CACGATGATT AAC T T AC AAA GGAGTTTCAA CTATGAAGAT J5TAAACAAA 4 380 

TTAATCGTTC CGGTAACAGC TAGTGCTTTA TTATTAGGCG CTTGTGGCgC TAGTGCCACA 4 44 0 

GACTCTAAAG AAAAT AC AT T AATTTCTTCT AAAGCTGGAG ACGTAACAGT TGCAGATACA 4 500 

ATGAAAAAAA TCGGTAAAGA TCAAATTGCA AATGCATCAT TTACTGAAAT GTTAAATAAA 4 560 

ATTTTAGCTG ATAAATATAA AAATAAAGTT AATGATAAGA AGATTGACGA ACAAATTGAA 4 62 0 

AAAATGCAAA AGCAATACGG CGGTAAAGAT AAATTTGAAA AGGCCCTTCA ACAGCAAGGT 4 680 

TTAACAGCCG ATAAATATAA AGAAAATTTA CGTACTGCTG CTTATCATAA AGAATTACTA 4 74 0 

TCAGATAAAA TTAAAATCTC TGATTCTGAA ATTlAAGAAG ACAGCArGAA AGCTTCACAC 4 800 

ATTTTAATTA AAGTTAAATC TAAGAAAAGC GACmAAGAAG GCTTAGATGA TAAAGAAGCG 4 8 60 

AAACAAAAAG CTGAAGAAAT TCAAAAAGAA GTTTCAAAAG AT CCAAGT AA ATTTGGTGAA 4 92 0 

ATCGCTAAAA AAGAATCAAT GGATACTGGT TCAGCTAAAA AAGATGGCGA ATTAGGTTA 4 980 

GTTCTTAAAG GACAAACTGA TAAAGATTTT GAAAAAGCAC TATTTAAGCT TAAAGATGGT 504 0 

GAAGTATCAG AGGTTGTTAA ATCAAGCTTT GGATATCATA TTATTAAAGC TGATAAACCA 510 0 

ACAGACTTTA AC AG T GAAAA ACAAAGCCTG AAAGAAAAAT TAGTCGATCA GAAAGTACAA 5160 

AAAAATCCAA AATTATTGAC TGATGCATAC AAAGATCTAT TAAAAGAATA CGATGTTGAC 5220 

TTTAAAGATC GTGATATTAA ATCAGTTGTC GAAGATAAAA TCTTAAACCC TGAAAAACTT 528 0 

AAACAAGGTG GCGCACAAGG CGGACAATCC GGCATGAGCC AATAACACAA AACCGAGCGA 534 0 

CCGTGGTTCA AAAATCATAC CACGGCCGCT CGGTTTTTTCGCATTAAAAA T CGGACAG AT 5400 

GAGCTCATGT TTCAGTATAC TCATCTGTCC GATATCTTTT AATTCTTAAT CGAGTGATTC 5 4 60 

AGGATTGTAG AATCTACGAT TTTCAAGACC AAATATTTTA TCTGTAAACT GACCCTTGTC 5520 

AGTTTTTTTA TATGCCTTTT CAAACATATT CATTCTAGCA TCGATATTAT CGATATAGCA 5350 

TAAAATTTCT GCTTCTTTTA AGTATGGCAG TTTTGGAGAA CCATACTCTA ACTTACCATG 5 64 0 

AT GAG AT AAA ATCATATGTC T T AAC AAC AT GATTTCTTCT CCTTCAATGT TCAATTCACG 5700 

AGCTGCTTCA ACTACTTCAT CACTCGCAAT CGAGATGTGT CCTAATAAGT TACCTTCGAC 57 60 

TGTATACGAC GTCGCAACAG GATCACTCAA TTCTCTAACT TTACCAATAT CATGCAAAAT 582 0 

AATACCACTA TATAACAAAC TTTTGTTTAA CAATGGATAA ATGTCaCAAA TTGATTTTGC 588 0 
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AATACGTAAC ATCGTTAATA CATGATAGCT TAAGCCACTC GCAAAGT T AT GaTGATGAGA 5 94 0 

AC TAGCAGCT GGATATGTGT AAAATCGTTC TTGATATTTT TTCAATAAT GACGTGTGAT 600 0 

ACGTTGTAAA TTAGCATTTT CAATATCTAG CAAATAATGA GAAATCTCTT CTTGTATTTC 60 60 

TGCCGGTGAT AAAGGTGCAC CAT C T AC AAA TTGTTCTGTT TTTAATTGAT CTTCAGTTGT 6120 

CGCTAGTCTA ATTTGGTTGA CTTTCATCTG TTTATTTCCG CGATAGTTTA TGATGTCACC 6180 

TTTAACATGT ACAATTTCTT CAGGCTTGAT TGTTGCCATA TCATTTTTTG TAGCCGTCCA 62 4 0 

AAATTTCGCT TCAATTTCAC CACTTTTATC TTGCAAATGT AATGTCATAT AATCTTTACC 6300 

TTGTGCTGTT ACACCCTGTG TAGCTTTATG C AC T AAGAAA AAGTGATCAA CTGAATCTCC 63 60 

GGGATTTAGA TTCTCTATAT TTCTCATCGT TTCCCGCCTT CCTCTATTTT GTTTAATGTA 6420 

ATCACTTCTT TTGATGGAAC AATATTATCT T T T AC AC AT G TAAAGTATAG TACTTGATAG 64 8 0 

TGTTCTGATA ATGATCGTAA ATAATTCAAC ATTTTTTCAG TACGTTTTTT ATCAAAATGA 654 0 

ACAAATGCAT CAT C AAC AAT TAATGGGAAC GGATAATATG GTCTTAGTAC CTTAATAAA 6600 

CTGATACGTA AAGCTACATA AAGTAATTCT TTTGTAGATT GACTTAGTTC AAC AGG AT C A 6660 

TATAATTGAC CATTAACATG TTTAACCGTA ATTGAATCTT CATTATAGTT AATCATCGTA 6720 

TATCTGCCAT CTGTTAAATG CTTCAATATT TCTACCGCTT CATTAATAAC TTGAGGCAAA 67 8 0 

CGTTTATCTT TAATTTGTTT AATGTGTTCA TCAACTAAAC TTTGTAAATA ACTTAAACTT 68 4 0 

GCCCAATCTT TTGCGATATC ATTAAGTTGA TTTTTAAGAC TGTGATATTC ATGTCTTAAA 6900 

TTAGCAAGCG TTGTATCAGT TTCCATGTGA TTGATTTGTG CACTTAAATC ACTGACTTGT 69 60 

GCTTGCATTT CAAGATATTG CTCATTATAT TCGTCAKTT GAGTAGCCAA TAAATGATCT 7 02 0 

TCTTCTTCAA GTTGTGCAGT TGTTTTTTCA C T T AAAC TAG AACTTAATTC ATAAGAATAG 7080 

TTTTGGTTCT CAAGATATTT AGTTAAATCA TTAAAACGAC TCAAATTACT AGTATAAGTT 7140 

TGGTAATCTT CATGATGTTG GTAAAAATCT TCTTCAGTAC CAACATTGAT AAAATCGAAT 7 200 

AGTGCTGTAA TTTCTTTATT ATTTTCTTCT AATTGAGCAT TTAAATGATT TAATTCATTT 72 60 

GTAACAAGTT TGGTATTTTC AGC AT T AAT A CGCCATTTTT CATTCGTGTC TTCAGCTGAT 7320 

TTCAACCATT GTtGCACATC GTGGAATAAA GATAATTTGT TGAAATAAAC AAATTGTGAT 7380 

TTTGTAACAG CTTCAGCATG ATTGTAGAAT GTATCTAATT CTTGAACCAA TTGCTGGCGT 7440 

TGTTGATTTA AATCACTGAT ATGTTGATCT AATGCTTTAA TATTCGCCAT TGTAGAAATA 7 500 

CTATCAACAA TTAAATCATT TGAAATTTTA GATGATAAGT ATAATTCATC CTTAACGTTC 7 5 60 

TCAACTGTCG ATTGTAATTC ATCATGACGC CCTTTCGCAT CATTAAACG ACCTTCAATA 7 62 0 

TACTGACGTT TCTCTTCTAA AATATCTTTA TTTTTCAAAG CTTGTTGCCA GTGATCACGA 7 680 
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ATGCGATATT GCTCATCAAG ATCAAAATCT AAGTCATAAT TTTCATCTAA AATGGCTAGT 774 0 

TGTGCTTTAA TTTCTTCGAT TTCATCTGTG ATGGCCTCGC TATAATCTAC TTCTTTTGAT 7800 

TTAGACATGA TGATACCGAT AACAAATACT AAAGTTAATA CTGCGAAAAT AATACCAAAC 7 8 60 

AACATGTTGT TTGAAATAAA TGAGAAGGCA GTTAAACCAA TACCTACTAA TGTTAAAAGr 7 92 0 

ATAAACGTTG TTCGkAACAA TTTTTGACGT TTTTGttTTT CTT 7963 
(2) INFORMATION FOR SEQ ID VD: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3958 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

ATATTGTCTT TACAATAGTT TGCTATGGAG GTAATTAACC AATAGGAGGA ATTTATAATG 6 0 

GCAGTAATTT CAATGAAACA AT T AC T AGAA GCGGGTGTTC mCttCGGTCA CCAAACACGT 12 0 

CGTTGGAACC CAAAAATGAA AAAATATATC TTCACTGAGA GAAATGGTAT TTATATCATC 18 0 

GACTTACAAA AAACAGTGAA AAAAGTAGAC GAGGCATACA ACTTCTCBAA ACAAGTTTCA 24 0 

GAAGaTGGTG GACAAGTCTT ATTCGTAGGA nCTAAAAAAC AAGCACAAGA AT C AG T T AAA 300 

TCTGAAGCAG AACGTGCTGG TCAATTCTAC ATTAACCAAA GATGGTTAGG TGGATTATTA 3 60 

ACTAACTATA AAACGATCTC AAAACGAATC AAACGTATTT CTGAAATTGA AAAAATGGAA 4 20 

GAAGATGGTT TATTCGAAGT ATTACCTAAA AAAGAAG TAG TAGAACTTAA AAAAGAATAC 4 80 

GACCGTTTAA TCAAATTCTT AGGCGGAATT C G T GAT AT G A AATCAATGCC TCAAGCATTA 540 

TTCGTAGTTG ACCCACGTAA AGAGCGTAAT GCAATTGCTG AAGCTCGTAA ATTAAATATT 600 

CCTATCGTAG GTATCGTTGA CACTAACTG? GATCCTGACG AAATTGACTA CGTTATCCCA 660 

GCAAACGACG ATGCTATCCG TGCGGTTAAA TTATTAACTG CTAAAATGGC AGATGCAATC 720 

TTAGAAGGTC AACAAGGCGT TTCTAATGAA GAAGTAGCTG CAGAACAAAA CATCGATTTA 780 

GAT G AAAAAG AAAAATCAGA AGAAACAGAA GCAACTGAAG AATAATCAAC TGTOAATCT 84 0 

GACTTAGATA TAGTTTAAAT GGGTGATAAG ATATTAATGC TTATCACCTT TTTTAAAAAG 900 

AAAATCGAGG CAAATTACAA ATATTCAATT AGAGTATTGG CAATCTTGCC TATAATAATG 9 60 

CTAAAATCAT AATATATAAn ATGATAACTT AT T G G AG G AA TAATGAATGG CAACTATTTC 102 0 

AGCAAAACTT GTTAAAGAAT TACGTGAAAA AACTGGCGCG GGTATGATGG ATTGTAAAAA 108 0 

AGCGCTAACT GAAACTGATG GTGACATCGA T AAAGC GAT T GACTACCTAC GTGAAAAAGG 1140 

TATTGCTAAA GCAGCTAAAA AAGCAGACCG TATTGCGGCT GAAGGTTTAG TACATGTAGA 1200 
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AACTAAAGGT AACGACGCAt TATCGTTGAA ATCAACTCTG AAAC AG AC T T TGTTGCTCGT 12 60 

AACGAAGGTT TCCAAGAGTT AGTTAAAGAA ATCGCTAATC AAGTATTAGA TACAAAAGCT 1320 

GAAACTGTTG AAGCTTTAAT GGAAACAACT TTACCAAATG GTAAATCAGT TGATGAAAGA 138 0 

ATTAAAGAAG CAATTTCAAC AATCGGTGAA AAATTAAGTG TTCGTCGTTT TGCTATCAGA 14 4 0 

ACTAAAACTG ATAACGATGC TTTCGGCGCT TACTTACACA TGGGTGGACG CATTGGTGTA 1500 

TTAACAGTTG TTGAAGGTTC AACTGACGAA GAAGCAGCAA GAGACGTTGC TATGCATATC 1560 

GCTGCAATCA ACCCTAAATA TGTTTCTTCT GAACAAGTTA GCGAAGAAGA AATCAACCAC 1620 

GAAAGAGAAG TTTTAAAACA ACAAGCATTA AAT GAAGGTA AACCAGAAAA CATCGTTGAA 1680 

AAAATGGTGG AAGGACGTTT ACGTAAATAC T T AC AAGAAA TTTGTGCTGT AG AT C AAGmT 17 4 0 

TCGTTAAAAA CCCTGATGTA ACAGTTGAAG CTTTCTTAAA AACAAAAGGT GGAAAACTTG 1800 

TTGACTTCGT ACGCTATGAA GTAGGCGAAG GTATGGAAAA ACGGAAGAA AACTTTGCGG 18 60 

ATGAAGTTAA AGGACAAATG AAATAATCTG TCATAAAGTA AAACAAGGAA GAAGACACCT 192 0 

TTAATGTTGC TTTATTAAAA TGTAAATCAT TCTAATAAAA CGACAACTGT GTCTTCTTTA 198 0 

CTTGTATATG TT AC AT AT AT TCACGATAGA GAGGATAAGA AAATGGCTCA AATTTCTAAA 204 0 

TATAAACGTG TAGTTTTGAA ACTAAGTGGT GAAGCGTTAG CTGGAGAAAA AGGATTTGGC 2100 

ATAAATCCAG TAATTATTAA AAGTGTTGCT GAGCAAGTGG CTGAAGTTGC TAAAATGGAC 2160 

TGTGAAATCG CAGTAATCGT TGGTGGCGGA AACATTTGGA GAGGTAAAAC AGGTAGTGAC 222 0 

TTAGGTATGG ACCGTGGAAC TGCTGVTTAC ATGGGTATGC TTGCAACTGT AATGAATGCC 2280 

TTAGCATTAC AAGATAGTTT AGAACAATTG GATTGTGATA CACGAGTATT AACATCTATT 234 0 

GAAATGAAGC AAGTGGCTGA ACCTTATATT CGTCGTCGTG CAATTAGACA CTTAGAAAAG 24 00 

AAACGCGTAG TTATTTTTGC TGCAGGTATT GGAAACCCAT ACTTCTCTACAGATACTACA 2 4 60 

GCGGCATTAC GTGCTGCAGA AGTTGAAGCA GATGTTATTT TAATGGGCAA AAATAATGTA 252 0 

GATGGTGTAT ATTCTGCAGA TCCTAAAGTA AACAAAGATG CGGTAAAATA TGAACATTTA 2 58 0 

ACGCATATTC AAATGCTTCA AGAAGGTTTA CAAGTAATGG ATTCAACAGC ATCCTCATTC 2 64 0 

TGTATGGATA ATAACATTCC GTTAACTGTT TTCTCTATTA TGGAAGAAGG AAATATTAAA 27 00 

CGTGCTGTTA TGGGTGAAAA GATAGGTACG TTAATTACAA AATAAATTTA GAGGTGTAAA 27 60 

ATAATGAGTG AC AT TAT T AA TGAAACTAAA TCAAGAATGC AAAAATCAAT CGAAAGCTTA 2820 

TCACGTGAAT TAGCTAACAT CAGTGCAGGA AffiGCTAATT CAAATTTATT AAACGGCGTA 28 80 

ACAGTTGATT ACTATGGTGC ACCAACACCT G T AC AAC AAT TAGCAAGCAT CAATGTTCCA 294 0 

GAAGCACGTT TACTTGTTAT TTCTCCATAC GACAAAACTT CTGTAGCTGA CATCGAAAAA 3000 
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GCGATAATAG CAGCTAACTT AGGTGTTAAC CCAACAAGTG ATGGTGAAGT GATACGTAT 3060 

GCTGTACCTG CCTTAACAGA AGAACGTAGA AAAGAGCGCG TTAAAGATGT TAAGAAAATT 312 0 

GGTGAAGAAG CTAAAGTATC TGTTCGAAAT ATTCGTCGTG AT AT G AAT G A TCAGTTGAAA 3180 

AAAGATGAAA AAAATGGCGA CATTACTGAA GATGAGTTGA GAAGTGGCAC TGAAGATGTT 32 4 0 

CAGAAAGCAA CAGACAATTC AATAAAAGAA ATTGATCAAA TGATTGCTGA TAAAGAAAAA 3300 

GATATTATGT CAGTATAAAA CTAATATACA ATGACATATT AAAATGCCAG TATTAAACGA 33 60 

TAATGTAACA TTTAAAATGG GCATGTTTAA TTAAATCAAA GATGCATGTG ATAATTTAAA 34 2 0 

TTCACAATGA G C AT AAAAAT GGTGTTTAAA CAAGT TAAT T AAACAT AT AC TTTATAAATA 34 80 

ATAGGCATTA GGTATATTGC TATAATAAAG TTATGTAATT TTTAACCTCA GTATGTATGT 35 4 0 

CACATTTCTG GTGTAAACTG TACCGAGTCA GACTTTGGTA CAGTTTTTTT ATTTGCTTAT 3600 

T C AAT GC AT T AAATGAGTAT G AT AAAAT G A TAATGATTGT TTAGTAACTT ATACTATATG 6360 

AC AGAG AT G A TCAGGCTCGG AGGAAAG AC C ATGTTTAAAA AGCTAATAAA TAAAAAGAAC 37 2 0 

ACTATAAATA ATTATAATGA AGAATTAGAC TCGTCTAATA TACCTGAACA TATCGCTATT 3780 

ATTATGGATG GTAATGGGCG ATGGGCTAAG AAGCGAAAAA TGCCTAGAAT TAAAGGTCAT 38 4 0 

TACGAAGtAT GCAAACAATA mAAAAATTA CTAGGGTAGC TAGTGATATT GGTGTTAAGT 3 90 0 

ACTTAACTTT ATACGCCTTT TCCACTGAAA ATTGGTCAAG ACCTGAAAGT GAAGTAAA 3958 
(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

ATTAAAACAA C T TAAT AT AC CTATTTATGG TGGTCCTTTA GCATTAGGTT TAATCCGTAA 60 

TAAACTTGAA GAACATCATT TATTACGTAC TGCTAAACTA AffiGAAATCA ATGAGGACAG 12 0 

TGTGATTAAA TCTAAGCACT TTACGATTTC TTTCTACTTA ACTACACATA GTATTCCTGA 180 

AACTTATGGC GTCATCGTAG AT AC AC C T G A AGGAAAAGTA GTTCATACCG GTGACTTTAA 240 

ATTTGATTTT ACACCTGTAG GCAAACCAGC AAACATTGCT AAAATGGCTC AATTAGGCGA 3 00 

AGAAGGCGTT CTATGTTTAC TTTCAGACTC AACAAATTCA CTTGTGCCTG ATTTTACTTT 3 60 

AAGCGAACGT GAAGTTGGTC AAAAC G TAG A TAAGATCTTC CGTAATTGTA AAGGTCGTAT 4 20 

TATATTTGCT ACCTTCGCTT CTAATATTTA CCGAGTTCAA CAAGCAGTTG AAGCTGCTAT 4 80 

CAAAAATAAC CGTAAAATTG TTACGTTCGG TCGTTCGATG G AAAAC AAT A TT AAAAT AGG 54 0 



605 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



TATGGAACTT GGTTATATTA AAGCACCACC TGAAACATTT ATTGAACCTA ATAAAATTAA 60 0 

TACCGTACCG AAGCATGAGT TATTGATACT ATGTACTGGT TCACAAGGTG AACCAATGGC 660 

AGCATTATCT AGAATTGCTA ATGGTACTCA TAAGCAAATT AAAATTATACCTGAAGATAC 720 

CGTTGTATTT AGTTCATCAC CTATCCCAGG TAATACAAAA AGTATTAACA GAACTATTAA 78 0 

TTCCTTGTAT AAAGCTGGTG CAGATGTTAT CCATAGCAAG ATTTCTAACA TCCATACTTC 84 0 

AGGGCATGGT TCTCAAGGTG ATCAACAATT AATGCTTCGA TTAATCAAGC CGAAATATTT 900 

CTTACCTATT CATGGTGAAT ACCGTATGTT AAAAGCACAT GGTGAGACTG GTGTTGAATG 960 

CGGCGTTGAA GAAGATAATG TCTTCATCTT TGATATTGGA GATGTCTTAG CTTTAACACA 102 0 

CGATTCAGCA CGTAAAGCTG GTCGCATTCC ATCTGGTAAT GTACTTGTTG ATGGTAGTGG 108 0 

TATCGGTGAT ATCGGTAATG TTGTAATAAG JSACCGTAAG CTATTATCTG AAGAAGGTTT 114 0 

AGTTATCGTT GTTGTTAGTA TTGaTTTTAA TACAAATAAA TTACTTTCTG GTCCAGACAT 1200 

TATTTCTCGA GGATTTGTAT ATATGAGGGA ATCAGGTCAA TTAATTTATG ATGCACAACG 12 60 

CAAAATCAAA ACTGATGTTA TTAGTAAGTT AAATCAAAAT AAAGATATTC AATGGCACA 132 0 

GATTAAATCT TCTATCATTG AAACATTACA ACCTTATTTA TTTGAAAAAA CAGCTAGAAA 138 0 

ACCAATGATT TTACCAGTCA TTATGAAGGT AAACGAACAA AAAGAATCAA ACAATAAATA 14 4 0 

ATCAAAAAGC TACTAACTTT GAAGTGAAGT TTTAATTAAA CTCACCACCC ATTGTTAGTA 150 0 

GCTTTTTCTT TATATATGAT GAGCTTGAGA CATAAATCAA TGTTCAATGC TCTACAAAGT 15 60 

TATATTGGCA GTAGTTGACT GAACGAAAAT GCGCTTGTAA CAAGCTTTTT TCAATTCTAG 1620 

TCAGGGGCCC CAACATAGAG AATTTCGAAA AGAAATTCTA CAGGCAATGC GAGTTGGGGT 1680 

GTGGGCCCCA ACAAAGAGAA ATTGGATTCC CAATTTCTS AGACAATGTA AGTTGGGGTG 174 0 

GGACGACGAA ATAAATTTTG AG AAAAT AT C ATTTCTGTCC CACTCCCGAT TATCTCGTCG 1800 

CAATATTTTT TTCAAAGCGA TTTAAATCAT TATCATGTCC AATCATGATT AAAAT AT C AC 18 60 

CTATTTCTAA ATTAATATTT GGATTTGGTG AAATGATGAA CTCTTTGCCT CGTTTAATTG 1920 

CAATAATGTT AATTCCATAT TGTGCTCTTA TATCTAAATC AATGATAGAC TGCCCCGCCA 198 0 

TCTTTTCAGT TGCTTTCAAT TCTACAATAG AATGCTCGTC TGCCAACTCA AGATAATCAA 2 04 0 

GTACACTTGC ACTCGCAACA TTATGCGCAA TACGTCTACC CATATCACGC TCAGGGTGCA 2100 

CAACCGTATC TGCTCCAATT TTATTTAAAA TCTTTGCATG AT AAT CAT T T TGTGCTTTAG 2160 

CAGTTACTTT TTTTACACCT AACTCTTTTA AAATTAAAGT CGTCAACGTA CTTGATTGAA 2 22 0 

TATTTTCACC AATTGCCACA ATGACATGAT CAAAGTTACG GATACCTAAA CTTTTCATAA 228 0 

CTGCTTCATC TGTAGTGTCT GCAACAACCG CATGAGTAGC GATATfiCTA TATTCATTCA 234 0 
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CTCTATTTTC ATCATGGTCG ATGGCCATTA CATCCATGTC TAATGCATTC AACTCACGAA 24 00 

CGATACTACC TCCAAAACGA CCTAGACCGA TGACTACATA TTCTTTACCC ATACTCGCCC 24 60 

TCCATTAAAT GATTTTCATC AATTCATTGA AAATATAAAT TTAAAATTAT TAT AAAT GAG 2 52 0 

TACCCCAACT AAAT TAT C T A AATGCAGTAA TGCAAGTAAA TGAAAGTTGG GGTATCGTCT 2580 

CAACTTATGA TTTCTTTCCT TCAACATATT CTTTGTCGAA AACAAATAAT CTTAATAATA 2 64 0 

ATATTAACGA TGGAAGTAAT AAAAGTAAAC CTAAAATAAA GACAATCACT AATGTCCAGC 2 7 00 

CCATTTCTGG AT T AAC AT AT GCATCTC^A TTTTTACAAA CGGATATAAA AGGTATGGCA 27 60 

ATTTACTAAT TCCATAGCCA AAGAACGCGA ACATCATTTG TAAAATAACA AATACAAAAG 2820 

CCAAACCATG TTTTTTCTTA AAGAATGTTA ACAATGAAGC TAATGCAAAG AATAAGAAAC 2880 

TTATACCAAA CATCCACCAA TAGTCAAAAA CAGCTGAATA AAAATGTTCA GAATTTGAA 2 94 0 

TGCGTAATGA TAGAAATACG AATAAACAAA TGATAATCAT CGGCGGCCCT AAAAATATGT 3000 

GCCATTGTCT TGTTAAATTA TATGCTGGTT CGTCATTTGC TTTTTTAGCA TAATATGTCA 3060 

AAAATCCTGA TGAAATATAT AAAACTGAAA TAATTGCCAA GAATACTACA GACCAAGCAA 312 0 

ATGGGCTTAA TAATAACTGC ACCCAATCTA GATCGATAAC ATTGTTTCGA ACATTAATAT 318 0 

AGCCACCTTC TGTAATAGTT AAAGCAGTAG AT AAT GAAGC TGGAATTAAT AATCCACTTA 324 0 

CCCCATACAT GATCATCCAT GCTAATTTTG TATCTTGACC AT AAT T T T C A AATGCATAAA 3300 

AACTGTTTCT CAACGATATC ATAATCAGTG CTAT'EAACC TGGTATTAAC AATACCGTGC 33 60 

CTAAATATTT GATTGACTCT GGAAAGAAAC CTACGAATCC TACGAAGAAG AAAACAAAGA 34 20 

ATACATTCGT AACTTCCCAA ACTGGGTTTA AATAACGTGA AATTAAGTGA TTAATTTTCT 34 8 0 

TTTCATCACC AGTTAACTTT G AAT GCAATG CGAAGAAACC TGCCCCAAAA TCTATAGAAG 35 4 0 

CAATAATGAT ATAGCAAAAT AAAAACAACC ATAACACTGT TATACCTATA AATGCATAAA 3 60 0 

TCATTTTTCT ATTTCTCCTC CTTGCTTCTT GGCTAAACGA TTTACATCTT CATACGCCGG 3660 

TTTATTTTTA AAC AT AC G AA TTAATACGTA TGCACATGTA TACATTAAAA TGATGTACAA 37 2 0 

TATGCCAAAT AAAATTGTAA CGAaGGTTAT TCCGCCTGCT TGTGTTGCTG CTTCTGCCAC 37 8 0 

GCGCATATAA CCACGAACAA TCCAAGGCTG TCTACCCATC TCTGTTAAGA AC CAT C C AAA 384 0 

TTCTATAGCT AGCATTGAAG CTGGGCCTGT T AAT AAT ATT C CAT AAAGCA TCCATTTATG 3900 

AG T AG AAAAC TTTCTAAGCT TTTTAAACAT TAAAGTTAAG AI&TAAACAC C T G AAAT G AC 3 9 60 

AAAACATAAA ATTCCCATCG TTACCATTAA AT C AAAG AAA TAATGGACGA T CAT AGGCGG 4 020 

ATGTAAACTT TTTGGAAAAT CATTTAACCC TTGTACTTTA GTTTTGACAC TATTATCTGC 4 080 

TAAGAAACTC AATAGTCCAG GTAATTCAAT CGCACCTTTA ACTTGCTGAG TCTTTTCATC 414 0 

T AAC AC AC C A AATAATAATA ATTTGGCATG GGAAGATGTA TCGAAATGCC AT T C ATAAGC 4 200 
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TGCTAATTTT TCAGGTTGGA ATTTATGCAA AAATTTTGCA GATAAATCCC CTGCCAACAT 4 2 60 

AGAAAGTAAT GTTGAAAAGA AT C C AAC TAT CATAGACATT TTCAAAGCTT TCTTATGGTA 4 320 

G AC AG T AT C T TTAGGTTGAC GATTACGCAA TAATTTAAAA GCTGCTATTG ATGCAATAAC 4 38 0 

AAATGCCATC GTCATACCGG CTGTAGTAAT TACGTGAAAT GATCGAACTA TAAACGATGG 44 40 

GTTAAACATC GCTTCTATAG G T T G AAC AT T GACCATCTTT CCATTCTTCA ACTCAAAACC 4 500 

TGCAGGCGTA TTCATAAATG AATTCACTGA AGTAATGAAG AATGCTGAGAAAGAGCCACC 4 5 60 

AATAATTACT GGTATACTAA TTAAGAAATG TGTCCATTTA TTTTTAAAAC GATCCCAAGT 4 620 

ATATAAATAT ATACTTAAGA AAATAGCTTC AAAGAAGAAC GCAAATGTTT CCATAAATAA 4 68 0 

TGGAAGTGCA ATAACGTGTC CACCCATTTC CATAAATGTA GGCCAAATCA ATGATAATTG 474 0 

AAGTCCTATA ATTGTACCTG TAACAACTCC CACTGCTACA GTAATTGTAT AAGCTTTAGC 4 800 

CCATCTTTTG GCCATAGCTA TATATTGAAG ATCATTTTTG CGAATACCTA AAAATTCTGC 4 8 60 

AATTGCGAAC ATTAAAGGCA TACCAACACC AATCGTTGCA AAAATGATAT GAACTGCTAA 4 92 0 

AGTCATAGCT GTCAAAAACC GACTGATTTC Z&CTGTATCC ATTTAAAAAC ATCACCTTTT 4 980 

TCTTTTTTTG ATGACAACAC AATGAACTTA ATTATAATTG CTATAATGTG TATTTTTAAA 504 0 

ACATTATGCG CTATGAAATT GTGTATAATT TATGTCAGTT CACAATGTGT TCACAAATTT 5100 

GAATTTCAAT GTATAATTGT GTATATTACA T T AGAAT AAA GCACGAAGGA GCATGA5CA 5160 

TGTCAGAAAT AATCGTTTAT ACGCAGAATG ATTGTCCACC TTGTACATTT GTAAAAAATT 5220 

ATCTAAATGA GCATCACATT GATTTTGAAG AGAGAAATAT CAACAATCAA CAATATCGAA 52 8 0 

ACGAAATGAT AGATTTTGAT GCTTTTTCAA CTCCGTTTAT TTTGTTGAAT GGC 5333 
(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

ATACGTGACC CTTTATCCGA AAATTTCTTT TCATATTCTG TTAAAATATT ACTGCCATCG 60 

TCTTCTTGAT GTAAATTTAG ATTTATTTTT GTAAAATACA TTCCAAATTG AG AC AT AC T T 12 0 

TCTAAACTGT AGGCAAATAG TCCTCTGTTA TCAGTTTTAA AATGTAAATC TCCTTCATCA 18 0 

TTTAAGATTT GTTGATACAA CGCTAAAAAC GTffGATACG TTAAACGTCG TTTTGCATGA 24 0 

CGATTTTTTG GCCATGGATC TGAAAAGTTC AAATAAATAC GCGAAACTTC GCCGTCTTTA 30 0 
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AAATATTCAT TTAATTCAAT GGCGTCATTA CAAATAATCT TTAAATTTGT TAAACCCATC 360 

TCTl'T^CTT TATCCAATAC TTTATAAACG ATACTTTTCT CACGTTCCAT TGAAATAT& 4 20 

TTAATATGAG GATTTTGAGC AGCTAATGTT GTAATAAACT GCCCCATACC CGAACCAATT 4 80 

TCAATGTGTA TCGGTTGCGT TTTaTCAAAC CATTCAGTCA TTTTCCCTGc ATGTTGACCG 54 0 

TCCATGTCAA CCAATTCAGG ATGATCTTTT AAATAATCTT CAGCCCATGG TTTGTATCGA 600 

ACTCTCATAT TTTATTCTCC TCTTAAATAA ACATGTTACT ATTCATAACT TCATTTAGGA 660 

AT T T AAGCCA AGTGTTCATA TCCTTATATC TTTTTTGCTC TTCATACCAT TGAACAAGAC 720 

CTATAGATTG AATTACCGTA TACCATTTCA TACGTTTATT TAAATTCAAG CTCTCTTGAA 78 0 

CACCATATGT TTCAAGCCAT TCAGACCATT GTTGTTGTGGAACATAGTTG TAAAGCAGCA 84 0 

TTCCGATATC AATTGCCGGG TCTGCAATCA TTGCACCTTC CCAATCAACT AAAAATAGTT 900 

CATCTCGATC GGATAATAAC CAATTATTAT GAT T C AC AT C ACCATGTACA AC AG T G AAAA 9 60 

AACGCGAATC TAAACTCGGT ATATGCTCTT CTAAATAGGT TAATGATTTT CTCACAATAT 12D0 

GATGTGTTAA AACTTCTCTT GATAAAGAGG CATTAATTTT ATTAAGCATA ATCTCAGGAG 1080 

TAATAGGTTC CATTTCCATA CGCTTTAACA TACTTAATAA AGGTCTAGAA TTGTGTATCT 114 0 

TCTTTAATAA ATGTGCAACT CTTGTTTGCT TCATTTCGTT TGAAGATAGT TCACGCCCAT 1200 

TTTTCCAATG TTGTGCTGTA ACAACCTCGC CTGTTTCTAT GCGTTTCGTC CATACTAATT 12 60 

TGGGCACAAT ACCTTCTGCT GATAATGCCG CAATAAATGG ATTTGAATTT CGTTTTAAAA 132 0 

ACAACTTTTG TCCATCTTGT TCAGCCATAT ATGCTTCACC AGATGCACCA CCTGCTGAAT 138 0 

CAAGTGTCCA CCCTAATTGA TAAAACTGCT CCAACTCGTC CACCTCACT TCAATTAGAA 14 4 0 

AATGGCTCTA GAAATAGGTT TTTCAAGAGC CATATATTCT AATTTATAAC ACCATACTGG 1500 

TACAAATATT ATGTCCAGAT AATTATTGTA AATCCTCAAC CAATGCCTAC ATTACACGAC 15 60 

TAAATTTAAA TCGTAATGTC TGTCATTGAC ACCATACATT CTATAGTCAC TTACTTGACA 1620 

TATAATGTTA CCGTGTCTAA AACTACATGT TTTTGAATCT CTGTAGGCGA TAAACTcTAG 168 0 

TTTTCAAAAT AATTGCTATC CCATTTTCAT GGTTAGCATA AATTTATGAA CTGTAACATT 17 4 0 

TACGTACTTA GTAAAATATG ATGCACATCA TATTTGTrAC TCATAGAAAA TTTTATAAtT 1800 

TTTATCATTA TATTTCAACT GAAAATGAGA AACAAAATGG CACTTTTTAC TAATATGTGT 18 60 

TTTCTAAACA ACACTTTTAA GCTTCGTTTT AAATTATAAC ATAATTCACT TACGAAAGTT 1920 

GATAAATTTA AGTAATTTAA TCTAAAAATA TGATGAAAGA ATTTTAAATA CTGTGTGACT 198 0 

CTATATACTT TTCAAATCCT TCTTGTAGTT GACGTGTAAT TGGGCCAACT TTACSTCAT 204 0 

TAACTGGTTC ACCATCTAAT TTAATAACAG GTGTAACCTC AGCTGAAGTA CTTGAAACAA 2100 

TAACTTCATC TGCGTTTTTC AAGAAATCTA CAGTAAACGT TTCTTCTTTA AATGGGATGT 2160 
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TATAGTCTTC GGCAATTTTT TTAATTACAA TTCGTGTAAT ACCATTAAGA ATATAGTTGT 2220 

TAATCGGATG TGTATAAATC ACACCGTCTT TAATTGCATA AGCATTACTT GAAGATCCTT 22 8 0 

CAGTTACAGT TtCACCTCGA TGTTGAATTG CTTCAACTGC ATTATATTTC ACAGCATATT 234 0 

CTTTTGCTAA TACATTcTCC TAATAAGTTC AAGCTTTTAA TGTCGCAACG TAACCATCGG 2 4 00 

ATATCTTCAA CGGTAACACC ATTCACACCA TTTTCT^AT GATCATAAGG ACGATCATAA 24 60 

CTCTTTGTAT AAGCAACAAT TGCTGGTTCT ACTTCAGGTG TCGGGAAGCT ATGATTCCTT 2520 

TCAGCTACAC CACGCGTTGC TyGAATATAA ATTGCCCCAG TTTCAATTTG ATTCATATCA 258 0 

ACTAATTTAC GAG AT AG T T C AATTAATTCT TCTACAGAAT AATTTAAATC TAAACCAATC 2 64 0 

TCATTGGCAC TACGTwAAAw TCTTTCATAA TGTTCTGTTA CTGTAAATAA CTTACCATTA 2700 

TATACTCGAA TGTATTCATA AATACCATCG CCAAATACGT ATCCTCTGTC GTTGTATGAA 27 60 

ACCTTTGCTT CACTTGGACT TACAAACTCA CCATTTAAAA AAATTTTTTC CATATATTAT 2 820 

TCCTCCACGC AT AAT G AATA AATTGCTTCT AAGTAAATAC TAGTTGCGTT AAATAACTGT 2880 

TTTTTAGTGA TATATTCATT TTTCTGATGC ATTAAATCTT CAGAATCACT AAACATTGCG 2 94 0 

CCAAATGCTA CACCCTTGTC TAAGTTTCTC GCATAAGTAC CGCCACCTAT AGTATAAGGT 3000 

TCAGTCATAT CATTTGTTTG ATTTCTATAT GCAGTAACTA ACTTTGTAC AAAAGGATCA 30 60 

TTTTTATCAA CATAATGTGG TGGTTGGACT TTACCTAATT TCACTTCAAA GCCATATTGT 3120 

TGAATCTCAT TTGCAAAACG ATCCATAGCT TTTTCAAATT CAAATCCTTC TGGGTAGCGT 3180 

AAGTTGATAC CGAAAAGACC TGCGTTTTCA TTATCATATG TAATAACACC AATGTTAGTT 324 0 

GTCACGTCAC CCATGACATC TGTATGGAAT TTCATTCCCA TCTTTTCACC AAAATCTGAA 3300 

TTAAATAAGT AGCGATTACT AAATGCTACA AACGCTTGTG CATTATTATC AAGATTTAAT 3360 

GATGCTAAGA ATTTTAGTAA GTAAAGACCC GCATTCACAC CGATAGATGG ATCCATACCA 3420 

TGAACCGCTT TACCTTCAAC TGTTAJAACT AGAATGCCAC TATCAACAGT ACTATCACCT 34 8 0 

TGTAAATGAT TTTGTTCTAA AAAGTACTCA AAGTCTTGAA TAACATCTGT CATATTTTCT 354 0 

TTAACAAGCA CTCTTGCTTC TGCAT GATCA GGTACCATGT TGTAACGTTC ACCAGATTTA 3600 

AAAGTTATTA ATTCATAATC AGGTTCATCT TGATCTTCAG TAAGTTTATT TGAACTAAA 3 660 

TCAAATGTTG TAATGCCTTT TTCAC CATGA ATACATGGAA ATTCTGCATC TGGTGCAAAA 37 2 0 

CCTAATGTTG GCATTTCTTC TGTTTTAAAA TAGCGATCCG TACATTTCCA ATCAGATTCT 37 8 0 

TCATCCGTAC CAATAATCAT ATGAATACGT TTCTTCCAAT CCACATTCAT ATCTTCTAAT 38 4 0 

ATCTTAATTG CATAATAAGC AGCAATTGTT GGACCTTTGT CATCAAGTGT ACCTCTAGCT 3900 

ATGATAGCAT CTTCTGTTAC AACCGGCTCG AACGGATTAC TATCCCATCC ATCACCAGCA 3 9 60 
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GGAACAACGT CAACATGACA TAAGATACCT AATACGTCAT TTCCTTTACC TGCCTCAATT 4 020 

CTTCCTGCAA TATGATCCAC ATCATGTGTT GT^ATCCAT CTCTATGTGC AATTTCATAC 4 080 

ATGTAGTCTA ATGCCTTACG AGGACCTGGA CCAACTGGTG CGTCTTCTGA TGCTTTTGCA 414 0 

TCATCTCTCA CACTTTCAAT TGCTAATAAT CCTTTTAAGT CATTAATGAT TTGATCTTCG 4200 

TATTGTTGAA CTTTTTCTTT CCACATTCGA AATCGACTTC CTTTTTTCTA TAAGTTAAA 4 2 60 

TCTATTTTAC ATGAAAAGAT ATAAAAACTA CAATAAGATG TCAGAAAATA ATAAAAAGGA 4 320 

ACAAAACGAT GCTATTGATA T G AC AC AAA T CATAAATAGC TGCTTTGTTC CTTTTTTAAT 4 38 0 

TTATATATTT AAAATACACA TATTCAAGAG CTCGAGATAT AAGTCAATGT ACTAGGCACA 4 440 

CAATTTAATA TTGACAGTAA TTAACCGAAC GAAAATGCGC CCCGGGGCCC CAACATAGAG 4 5 00 

AATTTCGAAA AGAAATTCTA CAGACAATGC AAGTTGGCGG GGCCCCAACA TAGAAGCTGG 4 5 60 

CCAATAGTTA GCTTTCAATA ATGTGCAAGT TGGGGTAAGG GCCCCAACAC AGAAGCTGGC 4 620 

CAATAGTCAG CTTTCAATAA TGTGCAAGTT GGGGTAAGGG CCCCAACACA GAGAATTTCG 4 680 

AAAAGAAATT CTACAGACAA TGCAAGTTGG CGGGGCCCCA ACACAGAAGC TGGCCAATAG 4740 

TCAGCTTTCA ATAATGTGCA AGTTGGGGTA AGGGCCCCAA CACAGAGAAT TTCGAAAAGA 4800 

TAAAGAAATA CGTTTTCTTT AGATATTAGT ATTTCTTATG AATGAGTTTC ACGCATGTAT 4ffi0 

TCTTCTTTCT ATATGCATAT TAGCTATGAC T AAC GAT AAA GAACCTGAAA C AC T AAT AAA 4 920 

TGTCCTATAG TTTACAATAT TATAT TGGCA GTAGTTGACT GAATGAAAAT ACGCTTGTAA 4 980 

CAAGCTTTTT TCAATTCTAG TCAACCTTGC CGGGGTGGGA CGACGAAATA AATTTTGCTA 504 0 

AAATATGATT TCTGTCCCAC TCCCTTATCA TTTCTGTCCT ACTCACATCT TATTCTTTAT 5100 

CAGATAATGC ATTTTTATTC TTTTTTAAAT CTTCTTCAGT GACGATACGT AAATTATTAT 5160 

TTGGTGTGCG CCACCTTCAT CATCAAATTT ACCTTTTTCA ATACTTTCGT CAGTCTTATT 5220 

GTCATATTCG GTAAATTTTG ATTTTTCTTC TTTGAAAAAT GCTTTTGBT TATTTTTTAA 5280 

TCTATTAGCA TATTCTTTCG GATTTGTTTT TACTTCTTTA ATTGTTTCAT TAGCAATTGT 534 0 

TCCTAATTGC GTCGCTTTAT CCTTAGCATT ATCTTTATAG CTTTGAGGAT CTTGTTTATA 54 00 

TTTATTATAT TCcTGcTTTC AGCTTGTCAC GACTATCTTT AC G T G T AAC A AGTACAGCTG 54 60 

CTACAGCGCC ACCTATACCT AAAATCGCTT TAAATAAATT ACCTTTTGCC ATATCAATCG 5520 

TCTCCCTTTT ATTTATAATT TAATTTGTCA AAATCATTTT CAGTTAATAA ACGATATTCT 5580 

CCTGAATCTA AATTGCTGTC C AAT T C T AAA TCAGCAATTT TGATACGTCT TAAATGTAAT 5 64 0 

ACCTCATTTT GAATGCTATG AAACATTCGT TTAACTTGAT GATATTTTCC TTCATAAATT 5700 

GTTACGTGTG ACGTTTGATT ATCAATATAA GTTAATATTG CAGGCTTAAC CTTGCCATCA 57 60 

GTCAGTGTtA CACCCTCTTT AAAAGCTTGA ATGTCGTCTT CAGTGATAGG ATTTGCTGAA 5820 
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ATAACTTCAT ATTTTTTAGA AACATGTTTG TTTGGACTCA TTAATTCATG ATTA&ATCA 5 88 0 

CCATCATTCG TTATCAATAA AAGCCCTTCT GTATCTTTAT CAAGACGACC AACCGGAAAA 594 0 

ATATTTAGAT GTTGGTATTC AGGTATTAAA TCAATAACGG TTTTTGAATG ATGATCTTCA 6000 

GTTGCTGATA TATAACCTTT TGGCTTATTT AACATAATAT AGACATTTTC AATGTATTCT 60 60 

ATTAATTCTC CACGAACTGT TATCTTATCG TTTTCTGGTT CTATATGTGT TTTTGGTGAT 6120 

TTAATTACTT GTTCGTTGAC ATTTACAAGG CCTTTTTTAA GTAACTGTTT GACCTCATTA 6180 

CGTGTACCGA CGCCCATATT TGCTAAAAAT TTATCTATTC TCATCGTAAA AACCTAACTC 62 4 0 

TACGTCTTAA TTTTTCAGGA ATTTCACCTA AGAATTGTC CGCAAGACGC GTTTTAATTG 630 0 

TGATTGTACC G T AAAT TAG A ATACCTACTG TAACACCTAA AATAATAATG ATTAAGTAAC 63 60 

CAAGTTTAGT AGGTTCTAAG AATAGATTTG CAAGGAAAAA TACTAATTCT ACACCTAGCA 64 2 0 

TCATAATAAA TGAATACAAG AATATTTTTG CAAAATGAAT CCAACTATAG CTGAATTTAA 64 80 

ACTTCGCATA TTTTTTAAGA ATATAGAAAT TACATCCAAT TGCAAATAAT AATGCGATAC 654 0 

TAGTACTTAA AATTGCACCA GGTGTATGGA ATAACATAAT TAATGGATAG TTTAACGCTA 6600 

ACTTGATAAC TACAGAAGCT AAAATAACAT AAACTGTTAA TTTCTGTTTA TCTATACCTT 6660 

GTAACATTGA TGCCGTTACA CTTAATAGTG AAATTAGTAT TGCTACAGGC GCATAATAGA 67 2 0 

ATAATAAGCG AC T AC CAT C A TGGTTAGGGT CAT G AC C T AA AACAATTGGA TCGTAACCAT 67 8 0 

AGAAAACTGT GAATAATGGT TGTGCCAAGG CCATAATTCC AATACTAGCT GGAACAGTTA 68 4 0 

TAAACATTAA TACACCAATA GATGTTCTAA TTTGATGATG CATTCATGT AAGCGACCTT 6900 

CTGCAAATGT TTTTGTAATA T AAGGAAT T A AACTCACTGC AAAACCAGCA CTTAATGATG 6 9 60 

TCGGAATCAT TACAATTTTA TTAGTTGACA TATTTAGCAT AT T AAAGAAT ATATCTTGTA 7 02 0 

ACTGTGAAGG TATACCAACT AAAGATAAAG CACCGTTATG TGTAAATTGA TCTACTAAGT 7 08 0 

TAAATAATGG ATAATTCAAA CTTACAATAA CGAACGGTAT ACTATAAGCA ATAATTTCTT 714 0 

TATACATCTT GCCAT AT GAC ACATCTATAT CTGTGTAATC AGATTCGACC AT ACGAT C AA 72 00 

TATTATGCTT ACGCTTTCTC CAGTAATACC AGAGTGTGaA TATACCAATA ATCGCACCAA 7 2 60 

CTGCTGCTGC AAAAGTAGCA ATACCSTTGG CTAATAAAAT AGAGCCATCA AAGAC AT T T A 7 32 0 

GTACTAAATA ACTTCCGATT AATATGAAAA TCACGCGTGC AATTTGCTCA GTTACTTCTG 7380 

ACACTGCTGT TGGCCCCATA GATTTATAAC CTTGGAATAT CCCTCTCCAT GTCGCTAATA 74 4 0 

CAGGAATAAA GATAACAACC ATACTAATGA TTCTTATAAT CCAAGTAATA (TATCGACTG 7 500 

AC CAACCGTT T T TAT CAT G A ATGTTTCTAG CTAATGTTAA T T C AG AAAT A TAAGGTGCTA 7 5 60 

AGAAATACAG TACCAAGAAA CCTAAAACAC CGGTAATACT CATTACAATA AAACTCGATT 7 62 0 
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TATAAAATTT CTGACTTACT TTATATGCCC CAATAGCATT ATATTTCGCA ACATATTTCG 7 68 0 

AAGCTGCTAA TGGTACACCT GCTGTCGCAA CTGCAATTGC AAT AT TAT AT GGTGCATAAG 77 4 0 

CGTATGTGAA CGGCGCCATA TTTTCTTGTC CACCAATTAA ATAGTTGAAT GGAATGATaA 7.8 00 

AAAGTACGCC CAATACCTTG GTAATTAATA TACTAATGGT AATTAAAAAG GTTCCACGCA 7 8 60 

CCATTTCTTT ACTTTCACTC ATTACGAATC TCCCTATCTC ATGTTTATTA AAGTTTTGTA 7 92 0 

AAC T AAAAGC TGTTTCTCTG T AAAAT CAT T TTTCATTATT ATGAATATAT CACAAAACTT 7 980 

TATTTCATTG TCGTATATTC AAT G AAT TAT CATAACAAAA T TAT C AAC AC ATTGTCATTG 804 0 

AATACTAGAT TTTGATTAGA ATATTACGAA ATTTCATATA AAC AT TAT AC TACTATTTS 8100 

GAT G AAC AT C GCATAACAGT AGAAAAATCA TTCTTATCAT ACACATACAT CTTCATTTTT 8160 

TATGAAGTTC ACATTATAAA TATATTCAAC ATAATTGTCA TCTCATAACA CAAGAGATAT 8 220 

AGCAAAGTTT AAAAAAGTAC TATAAAATAG CAATTGAATG TCCAGTAACA AATTTGGAGG 8280 

AAGCGTATAT GTATCAAACA ATTATTATCG GAGGCGGACC TAGCGGCTTA ATGGCGGCAG 834 0 

TAGCTGCAAg CGAACAAAGT AGCAGTGTGT TACT CAT TGA AAAAAAGAAA GGTCTAGGTC 8 4 00 

GTAAACTCAA AATATCTGGT GGCGGTAGAT GTAACGTAAC TAATCGATTA CCATATGCTG 8 4 60 

AAATTATTAA GAACATTCCT GGaAATGGGA AAT TTT TATA TAGTCCCTTT TCAATTTTTG 8 52 0 

ATAATGAATC CATCATAGAT TTTTTTGAGT CTAGGGGTGT TAAATTAAAA GAAGAAGATC 8580 

ACGGGCGTAT GTTTCCAGTT TCCAACAAAG CACAAGACGT GGTTGATACA TTAGTGACAA 8 64 0 

CTATCGAACG CCAACAT GT A AC GAT T AAAG AAGAAGAAGC T G T TAG TAG A ATCGAAGTTA 8070 

ATACAGACCA AACTTTCACT GTACATACTC AAAAT AAT AG T T AT G AAAG C CATTCGCTAG 8 7 60 

TGATTGCTAC AGGTGGTACA AGTGTCCCTC AAACTGGTTC AACTGGTGAT GGTTATAAGT 8820 

TCGCACAAGA TTTAGGTCAT ACCATTACTG AGTTATTCCC GACCGAAGTT CCAATTACAT 8880 

CAGCTGAACC TTTCATCAAA TCCAATCGTC TAAAAGGTTT AAGTTTAAAA GATGTTGAAT 8 94 0 

TGTCAGTACT TAAGAAAAAT GGTAAAAAAC GCATCAGTCA TCAAATGGAT ATGTTATTTA 9000 

CTCATTTTGG TATCAGTGGT CCAGCTGCAT TAAGATGTAG TCAGTTTGTT TATAAAGAAC 90 60 

AAAAAAATCA AAAGACACAG CACATTTCTA TGGCAATCGA TGCATTTCT GAATTAAACC 912 0 

ATGAACAATT AAAACAACAC AT C AC AT CAT TATTATCGGA CACACCAGAT AAAATCATTA 918 0 

AAAAC AG TTT GCATGGTCTA ATTGAAGAGC GCTACTTACT GTTCATGCTG GAACAAGCAG 9240 

GAATCGATGA AAATACCACA TCACATCACT TATCAAATCA ACAATTGAAC GACTTAGTAA 9300 

ATATGTTTAA AGGGTTTGTA TTTAAGGTGA ACGGGACATT ACCTATAGAT AAGGCATTTG 9360 

TCACAGGTGG TGGTGTGTCA C T T AAAG AAA TTCAACCTAA AACAATGATG TCTAAATTAG 94 2 0 

TTCCGGGATT ATTTTTATGT GGTGAAGTAT TAGATATACA TGGTTATACT GGTGGTTATA 94 80 
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ATATTACAAG TGCACTCGTA ACAGGACATG TCGCTGGATT ATATGCCGGA CATTACTCAC 954 0 

ATGCATCAAT GGAATAATAG TATAAAATTT GGTTCGATTC TCTTTAGTAG ATCAACTTTT 9600 

TCATTCAAAT AAAAATGACC TTAATATAAC TGAGTCACTA AAAAGTGTCG TTATATTAAG 9660 

GTCATTTCGT TAATTATGAT TCTTTTTCGT TTTTAGTACG TCTTCTAGCT AACAftGCCG 9720 

CACCTGTAAT CAGTGCAAAT TCTTTCAATG GTAAATCCAT TCCTTCAGAA CCTGTATTTG 97 8 0 

GAAGTTCTTT TTCAACTTTG CGCGATTCAT GTGTCTCTTC TTTTTTAATA GGCGTACAAA 98 4 0 

CTTTTGGAGC TGGCTGAATT TCTTTTGGTG ATACTTTCGT CGCTTCAGCT GGTAATTTAA 9900 

TTGCTAAAAT TTCATCAACA ATGAATTGCG TGTGTTGTTT GATGTCATTT AATGTCGCAT 9 9 60 

CTTCATCAAT CATTCTATTG CCATCTGCAA CATATTGATC AAT TAATACT TTTACTTTAG 10020 

CTAATTGTTC TGGTGTTGCG ATCGCTTTGA ATTTCGCATA TGTTTGTTGA GCAATGTTAT 10080 

CAATTCGCAG TAAGCTATTT TCTTTTTCAG TAATTACTGC TTCTATATCG CTTAATGCAA 1014 0 

CATTAAATCG TTTTAAAACT GCATCAACAT CAGCATTAGT GCGTGCTGTT TTTAATTCTT 10200 

CATCCATTTG TAATTTTAAA GCAGTTATAG CTTTTAATGC ATCAGCCTTA TTACGATTAC 102 60 

TTACTTTTCG ATAATTTTGC ACTAAAGCAG TGACGCGTGC AAG AT CAT C A TTAATCGTTT 10320 

TTTCAGCATC TGGCTTTTTA ATAGGATGTA CATCTAAATC ATGTATTGTT TGTAGATTTA 10380 

ATGATGCTGT TTTATCAACT TGTGCATTGC TAC GATCTTG ATCAATTTGT CCAATAGCAG 10440 

TGTCATAAAT ATTTTGTAAC TGTGCTAATA TACTATTTCT TTCTTCTACC GTTGCTTGAA 10500 

TATTCGCTTC AATTGCTTGT TTTTTATCGT TGAATAATGT TGTCAATTGT TCTCGAGCAG 10560 

ACGCCTTTCT GTTAATAACA GGTTCGATTT CACGAATTTC GTTTTTCTCA TCATGCAATA 10 620 

AATATGCCAC ATCTGCATTA GTCACTGCAC TAGCAATTTG TTGTTTAGCT TTAATTAACT 10 680 

CTTTTTCAAC TTGTGCTATT GCAATATTTT GTTCTTCATC TGTGCTTCG TTATTTGCTT 10740 

TAATTAAATT AATTTTATTT GTAGCGATAT TTTGAATTTG TTGTAATGCT GTTGCTTTAA 10800 

CTGTTGTCGC TGGTTTAATT TTTGAAATAA TATTTTGAGC ATTTATACTA TCTTGATTAA 10860 

CTTGGGCAGT CTTATCTGCA TGATTGATCT GATCAATAGC CTGATTAAGT GCTTGTTCTA 10920 

CTAAATGTTT AGCAGCTAGT CTTTCTTCTT CAGTTGATAA ATCGCTTTGA TCGATTAGTG 10980 

CATTTTGAGC TTCGGCTTTT ACACCAACAG ATTGACGCGC TGCTGGTTTA ACTTGAACTT 1104 0 

TAGGTAAAAT CACTTTGATG TTGTCGTTGC CATCAGTCnC AGTnCGATCC ACTTCTGCAT 11100 

TCGTTTTGTT TTGTGCAATG TCATTT 11126 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3660 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

TTGCCCCGCA CGGCGGTGTG nTTCCTAGAA ATAATGAATA TAAAGaGAAA TATATAACAA 60 

CGATTTTGAA TTATGAACCT GGTGATATCG TTACAATCAA ACGTGTGAGA GATAAGACCG 12 0 

ATTTGCTAAT ATATTTGTCT AGTAAAGATA TTTCTATTGG TAATGABTG GAAATTGTAT 180 

CGAAAGATGA AATGAATAAA GTAATTATCA TTAAACGTAA TGATAATGTA ATTATTGTCA 24 0 

GTTACGAAAA TGCAATGAAC ATGTTTGCTG AAAAATAAAA TAAAGAAGCC ATAAAGATAT 300 

CCATGATTGA ACTGATAAAG ACATATGGAT AATTGCTTTA GGCTTCTTTT TTATTAGTTA 3 60 

ATTTATCAAG TGAGTATATT TGAGTAAAAT ATTCACTGCA TAAAGATTGA AGATAATCCA 4 20 

GATTGTACTA TAAATGAAGA TAGGTACATG ACTGAGTTCT TTAAGTGCAC TACCATCCCA 4 80 

CTGTGGACTC GGACGCTGGA AAGTCAATTT AGCAATCGTC CAACTAGATT GTAGAACTTC 54 0 

GCCTAATAAT ACACCTAAAA TATATTGATA ACTCATTGTG ACAAGTAGTT GAATTTCTAC 600 

TATATTTTCA TCTTTTAATA TAAAATACAA CAT G AT AG AA ATTAAAGTTA TAACAACAAT 660 

GGGTGAGCCT TTTCtAGATG TTAAAATTAA AAAATAAATA AATATCAATA AATAGGTAAA 7 20 

TATAAAGAAA CTAGGTATCT GATAATGGCT CGACGCTAAA CCTATCAATA ACAAATAGG 78 0 

TGGCATAAAA TAACCACCAA TCGTTGTAAG CCATTGGCCT GCTAGATGTC TAGATTGTGT 840 

AATTGCGAAT CCTTGTTGTA ATGTCTGTTG TCGCTCTCGT GGACTTGTTA CAATGACTAA 900 

ATCTTTTGCA CGGCCACCAG CGAGTTTATT AAAC AG T AC A TGACCAAATT CATGTGTTAA 9 60 

AACAGGGATA TAGTTTAAAA TGACATCTAA ATAGTTCAAA ACAGGCTTAT GTCTATATTG 1020 

ATGAATAGCA ATATAACAAG CTGCAACAAT AACGATAATG TATATATTAA GTTGAATTGT 1080 

CGTATTAAAA AAGTTTGATA AAT AAT T CAT TGTTAACCTC ATATAAGATA TTAATTTAAA 114 0 

GTTTGCTTAT CACTTATTAT AAATGATATT GGCAT&ATA GCGTTAGACT TTAGACTTAC 12 00 

CTTAGTTAAA CTAATTTTAA TTTTTGAAAA GGTGAATATG TGTTAAAATA AAGCAAAATC 12 60 

ATTTCGATAT AAATAGGATG AATATAAATA CTGTTAATAT T GAT T AC AC T AACATAATAA 1320 

TGAAATAAGA TAGGAGAT T C CTGTTATGAC TGTTGAAGAA AGATCCAATA CAGCCAAAGT 138 0 

TGACATTTTA GGGGTCGATT TTGATAATAC AACAATGTTG CAAATGGTTG AAAAT AT T AA 14 4 0 

AACCTTTTTT GCAAATCAAT C AAC G AAT AA TCTTTTTATA GTAACAGCCA ACCCTGAAAT 1500 

AGTGAATTAC GCGACGACAC ATCAAGCGTA TTTAGAGTTA ATAAATCAAG CGAGCTATAT 1560 

TGTTGCTGAT GGGACAGGM3 TAGTCAAAGC TTCGCATCGT TTAAAGCAAC CTCTAGCGCA 162 0 
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TCGTATACCT GGTATTGAGT TGATGGATGA ATGTTTGAAA ATTGCTCATG TAAATCATCA 168 0 

AAAAGTATTT TTGCTAGGGG CAACTAATGA AGTTGTAGAA GCGGCACAAT ATGCATTGCA 174 0 

ACAAAGATAT CCAAACATAT CGTTTGCACA TCATCACGGT TA21TTGATT TAGAAGATGA 18 00 

GACAGTAGTG AAcGnAnTTA AACTGTTTAA ACCTGATTAC ATATTTGTAG GTATGGGATT 18 60 

CCCTAAACAA GAAGAATGGA T TAT G AC AC A TGAAAACCAA TTTGAATCTA C AG T GAT GAT 192 0 

GGGCGTAGGT GGTTCTCTTG AAGTATTTGC TGGGGCTAAA AAGAGAGCGC CTTATATCTT 1980 

TAGAAAATTA AACATTGAAT GGATATATAG AGCAT TAAT A GATTGGAAAC GTATTGGTAG 2040 

AT T AAAGAGT ATTCCAATAT TTATGTATAA AATAGCCAAA GCaAAAAGAA AAATAAAAAA 2100 

GGCGAAATAA TCATGATGAC AAAAATAAAA CCGAGGAAAT CCTTAAATGG AGATTCTCGG 2160 

TTTTTTCGGT TTATTTAATA ACGAA5CGGG ACTCATCGAG TTTGTTTCTA AATTCTTTTT 2220 

GTTCGGCTTT GGATTTCTTT TTAAAATCGT TAAGGAAAGC TTCATATTTA GGTAATACAT 2280 

CATCAAGTTC ACCGTAATCT TTTAACTTTC CGCCTTCAAT CCAAGCAATC TTAGTACAAA 234 0 

ATTGTCTCAC TTGTCCTAAG TTATGACTAA CGAAAAAGAT GGTTTTGTTTTGCTCTTTAA 24 00 

ACTCGTAAAT TTTATCTAAA CATTTTTGTG CAAAAGTTTG G T C AC C T AC A GATAAAGCTT 24 60 

CGTCAATGAC TAAGATATCT GGATTAACTG TGATATTAAT TGAAAAACCA AGTTTTGCAC 2520 

GCATACCACT TGAATACTTT TTAACTGGTT GATAAATAAA CTCACCAAGT TCACTAAATT 2 580 

CAATAATCTT AGGTGTCATC GCTTTAATTT CTTTTCGCTT AAAGCCCATA CATAACATTT 2 64 0 

TAAATTCGAT ATTTTCAATC CCTGTAAGTT GTCCACTCAA GCCAGCACTA ATTGCGATAA 2700 

CGCTGACTTC ACCATTACGA TCCACTTTGC CAACAGTAGG CGACAAAGAA CCGCCAATGA 27 60 

TATTGCTCAA CGTTGATTTG CCGGAACCAT T&TGCCAAC AAGCCCTATG ACGTCACCTT 2820 

CATATGCTTT TAAACTAATG TCATCTAAAG CGAAAAATGT TTTGTTTTTA TGTTTGGGAA 28 8 0 

TGAGCGCATC TTTCATACGT TCTTTATTTG T AC G AT AAAT ACGATATTCT TTTGTTACAT 2 94 0 

TTTTAATGTT TACCGAAACG TTCATTTGTA GACCTTCCTT ATTCACATTT AT C TAG ATA 3000 

TAAT AT AC T A CTCAACAGTT GTTAAATTTT AAAACCTGTT GTAAAGTGTA TAGAAGATTT 306 0 

TGTTATTATC AGAGTGGGTG TTTTGACACA AAATGTTAAT CATCAATGAT AACAATGATA 312 0 

TTTAAAAACT AAACTTATTT CAACTTACAT GATTGTATAC TATAATGTAT TTGTAATAAA 3180 

CTAATATTTT AAAGAAC TAG ACAATAATTT TGATAGCATC CATGTATAGT GATAGTATTT 32 4 0 

ACAACAATTA TTATAATACT ATTTAGTTAA GTAGAGAAAT AGTTAAACAT TTGAAAGTGT 3300 

GGTTTAATGG AATGTCAGCA ATAGGAACAG TTTTTAAAGA ACATGTAAAG AACTTTTATT 3360 

TAATTCAAAG ACTGGCTCAG TTTCAAGTTA AAAT TAT C AA T CAT AG T AAC TATTTAGGTG 34 20 
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TGGCTTGGGA ATTAATTAAC CCTGTTATGC AAATTATGGT TTACTGGATG GTTTTTGGAT 34 8 0 

TAGGAATAAG AAGTAATGCA CCAATTCATG GTGTACCTTT TGTTTATTGG TTATTGGTTG 354 0 

GTATCAGTAT GTGGTTCTTC ATCAACCAAG GTATTTTAGA AGGTACTAAA GCAATTACAC ©DO 

AAAAGTTTAA TCAAGTATCG AAAATGAAcT TCCCGTTATC GATAtACCGA CATATATTGT 3660 
(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doitole 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

TATCGCGATT AATAGAAACG CTAGGAAACC AACGATTCTC CATTTAAATA CTTTTAAAAA 60 

ATTAATCACT TGTTGTGTAG AGTCTTGTCC GTTTTGGTTA TGATTGTTAG CCATGATATA 120 

CCTCCCTTAC AACACTCGTG GACCAGAAGT TTTCTGATCT CTCACATTAA CTTCTAACTT 180 

ACGTACTGGC ATTTCTGTGA AATATTCTAC ATTCTTTTTA ATATCCGAAC GAATTGCTTC 24 0 

AGTTAAAGAT TGAACTTGAA CAT TATTTGG T AC G AAAAAG TCAGTTTTAA TGTCGATATA 300 

AG AT T T AT T T TTTTTGTTAT ATAGTTTCGC AACTACATTT GTTGTCTTA C T T GAT CAT A 360 

TTTTGCAACC GTATCGAATG CCGTCTTTTC AACAGCTTTA CGAGATACGT AAAC AT G AC C 4 20 

ATCATCGAAG TCTTTGTATA ATCCAGGTTT TCGATGCGTA GGTTTGAAGA TACTAAATAC 4 80 

TAATATAAGA CCTATTAATA TCAATAGTGC AGCAAGTGAA ATAAGTAATG GTTGGAACCA 501 

TTCAAATTGA AGGAAGTAGT CTTGATATTC AGTTATACGT CCATCTTGGA TATACATGAA 600 

TAACAGGAAC CCCACGaTTA CTACTATTAA TAAGCCAAGG ATAAAGTTTT TAAGTCGTTT 6 60 

CACCCCTAAC GACACCTCCT TAGTTAAAGT TAATTTAAAA ACATATTAAA TATGTACCCA 7 20 

TCAGTTTTTT TCTTAAACAT AATAAATTAA TAACTTTAAA TTTATTTTTA ATATATAAGA 780 

TGAAGTACCA TTTAGTAATA TATTCCCTAG TTTTTGTAAA TAAAACCTCA TTATTAATTA 84 0 

ATTyTCGTCA ATATGTTTTG AAGAACGATA TTCTAAAATA TCTGGGTCAC GATGTTTAAT 900 

TAAAACCTTA TTACTATTTC TCGGTTTCTC CTCACTCAAA GATTTTATA GCGACCATAT 960 

CATCGCTATA ATGACCACGG AAAATGGTAA CGCAGCAATG AT TAAT AAAT TTTGAATTGC 1020 

TTGAGTACCA CCTGTGTAAA T CAT GAT GAT TGCAAATAAT GCCATAATGA TACCCCAACT 108 0 

CACTTTGACA AATGACTTCG GATTAATATC ACCACTTGAA CTCAACATAC C T AAAAC AT A 114 0 

AGTTGCCGAA TCCGCTGATG TAACAAAGAA AATCATAATA ACAAGTAAAG TAATTAAGCT 1200 

TAATACAAAA CCTAGCGGAT AATGTTGTAG CGTCGCAAAA GTTGCTGTTT CTGTCGCAGC 12 60 
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TTTAGCAATA TCGGCAATAT GATTATCTTG TAAGTAAATT GCTGACGCGC CGAATACCGC 1320 

AAAGAATATA AAGCAAACTA ACGCCGGGAC AAAAAGTACA CCTAGAATAA ATTCTTTAAT 1380 
CGTACGTCCT T T T G AC AC AC GTGCAATAAA TATACCTACA AATGGTGCCC AAG AT AT CCA 1440 
CCATGCCCAG TAAAAGATTG TCCAATTTTG TAACCATTGG AATTTTTGAC CACCTGTCGG 15 00 

AATGCGTAAA CTCATACTAA AGAAATTTGC AATATAATTA CCTAGACCAT TCGTAftTGT 1560 
ATTTAAAATG TATAGCGTTG GCCCAACAAT AAAAAG AC C A ATAAGTACTA CAAAAGCAAG 162 0 

TAACATGTTG ATATTACTCA ACGTTTTGAT ACCTTTATCG ATACCTGACC ATGCTGACCA 1680 
AGTAAATAAT ATGGTTGCAA TGACAATCAA GATTACTTGC ATCGTGAAGT TACTCGGTAC 17 4 0 

AT T AAATAAA AAATGTAAAC CTTCGTTTAT TTGCAATGCA CCGAAACCTA ATGTTGCAGC 1800 
AACGCCTGTT ACTGTAGCTA TGACTGCCAA TACATCAATG GCACCACCTA AAGGTCCACG ' 18 60 

CATTGCCTTT TCACCTAATA AAGGCGTCAA TGTAGCGCTG ACTAAGCCAG GATATCCTTT 1920 

ATGAAAGCTA AAATATGCAA ACACTAGCGC GACAATACA TAGACTGCCC ATGCATGAAT 1980 
CCCCCAATGG AAAAATGAAA ACTGCATTGC ATCATTAATT GCAGATTGCG TGCCAGCTTT 204 0 

AT GAAT AGGC GTTAATTTGA AGGCATGACT GATTGGTTCT GCCGTTGTCC AGAACACAAG 2100 

TCCTATTCCC ATACCAGCAC TAAATAACAT AGCAAACCAA GACGGCAATG AGAATTCAGG 2160 
ATCTTCGCCT TCTTCACCTA ATGTAATGTT TGCGTATCTC GAAAATAAAA TATACACACA 2220 
GACAAATAAA ATAACTAAAA CGAGCAATAA ATAATACCAA GAAAAATGTA GCGCAATAAA 2280 

TGTAGTAATG TTTTGCGTGA GTTTTTCTAA CTGTTTCGGA AATATTGCTC CAAAAGCAAC 234 0 

AAATATCGTA CATATCACTA AAGATACCCA AAACACTAGA CTTACTGATT TATTTTTCAT 2400 

AAATACAAAC CCTTTCTGTG TAATGGTAAG TTCATACCCA TAACTGCAAC ATTTTAATCA 24 60 

TTTGTAATTT TATATAGACA CAATTAATAA TGCCTCATCT TTTAAAAATG ATATATAAAA 252 0 

C AC AC T C AAA TTATTTATCA TTGAGCAACA AAGTATTTTA TTGT&TTAA GTAATGCCTT 2 58 0 

TCTAGTGCAT TATTGATTTG ATACCTGCAA AGTTGCCATA TTTCCGTTTA GAATCAATAG 2 64 0 

TCGCTAGACA CAAAAAATAA GTCGCCTATA CAGTATTTTC TGCATAAGGC GACTTTACTT 27 00 

ACTAATCTAT ATATTAATTA CTAATTTTCC AATCATTGAT TGTTTTTCCA ACAATTGATG 27 60 

TGCTTGATAT AAGTTTTCAG GTGATAAACC TTCAAAAACT TGTGTCGTTG TTGGTTGGTA 2820 

ATGCCCTGAT TCTATATTTT TCGTAATATC TTCTAAATAC TCATGTTGTT TAATCATATC 2880 

AGGCGTTCGA T GAAT T GG AC GCGCAAACAT AAATTCATGT GTAAATGTTA TACTTTTTAA 2 94 0 

TTTTAATGCA TTTAAATCTT GATCTTGVTT AAAAGCTACG ATAGTCGTAA TATGCCCTAA 3000 

TGGTTTTATC AGTTCAATCA TAGTATTGTA ATACAAGTCT GTATTATAGG TGCAAAATAT 30 60 
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ATAATCTACT AATGGAATTT CTTTAAATTG ACGCACTAAA TCCTCTTTAT GAT TCAATAC 3120 

GATATCTGCG CCCATCTTTT CACACCACTC TGTTGTTTCT TGTCGTGATG CGTTTGTAAT 3180 

GACAGT T AAA CCATACCGTT TAGCAATTTG AGTGGCTATA CTGCCTACAC CACCGGCACC 32 4 0 

ATTAATGATT AAGACAGACT TCCCTTCGTT TTCAGCAGGA TTCGTAGAAA TTTTAAATGT 3300 

ATCAAAAAAC GTTTCATATG CCGTAATACC AGTTAGCGGT AGACTAACCG CTTCATTAGC 3360 

ACTTATGTTG TGTGGTGCTT TTGCAACTAT AGCTTCTGAC ACCAATTGAT ATGTCGCATT 34 2 0 

TGATCCTTGT CTATTTGGCG ATCCAGCATA AAATACAACG TCACCCGGAC TAAATAATGT 34 80 

AACGTCTGGT CCGATAGCTT CAACAGTACC AATAGCATCA AACCCAAGTA CACGAGGTGC 354 0 

TTGAGTGACT TCCATTTGTC GTTGCTTTGT ATCACAGGA TTTACACTAA TGCTATTTAC 3600 

TTTGACTAAG ATATCATCGT TTTCTGGCGT TGGCTTTCTT TGTTCGTATA CTTTAAATAA 3 6 60 

ATTTCCTTCT TCCAATTTAA AGGGCTTCTC AAATCCTATC ATTTTCATAT CGTTTCACCT 37 2 0 

CATTTATGAA CTTATTTCTT ATT AT AC AAA ATAGAAGCCA TGTGTGCTTA TATCGCAGCA 37 8 0 

TCATGACTCC TTTTTCATTT GAATATATAA ATAATTACAG ACGACTTTCG TATTAAATTT 3840 

TAGACTTATT TCTACCATGT TGCTGAACAA ATTTACTTTA GATAAAAAAT TATTAAATTT 3900 

TGGTCAATTA AC AAAGT T AG TTTGTTAAAA CGTgATACTT TATTATTCCG TTACTTTAAT 3960 

AACTTGTTTA CCAAAGTTAT CGCCAGTaAA TAAATTTTTA AATGCATGTG GCGCATTTTC 4 02 0' 

AAAACCATCT TCAATGGTTA CTTGTGACTG AATTTTACCT TCTTGAACCC ATGTTGCAAG 4 08 0 

CTGTTCACTA GCTTCTTTAA AAGCATTAGC GAATTCACTT ACCAAGAAGC CTCTCATCAT 414 0 

TACTTGCTTC TTAATAAGCG TACCTTGAAT ACGTGGTCCG ffiATCGGCTT CAGGATGATT 4 200 

ATATGACGAA ATTGCGCCAC ATACTGGTAC ACGTGCAAAA CGATTTAAAT GCTTAAATAC 4 2 60 

TTCATCGCCA ACTGTTCCAC CAACATTTTC AAAATAAACA T C AAT AC CAT CTGGTACTGC 4 32 0 

TTGTGCTAAC GCTTCTGCAA AATCCTCTTT CTTATAATCA ATACCAGCGT CAAAGCCCAG 4 32 

TGTCTCTGTT AAATAATTTA CTTTTTTGTC GCCACCCGCA ATACCTACTA CACGGCAACC 444 0 

TTTAATCTTA GCAATTTGAC CTACAACTGA ACCTACAGCA CCAGATGCAG CTGAAACCAC 4 500 

AACAGTATCA CCGGCTTTAG GTTGTCCAAT AT CAAGCAGA CCATGATATG CTGTTTGTCC 4 5 60 

TGGCATTCCT AAAAC AC T T A AATATAAATC AAGTGGTACA TCTGTCGTTG GAACTTTAGT 4 62 0 

AATTTGATCC GCTTGGACAT GATTAATGAT TCGCCAAGGC AACATACCTA CAACGACATC 4 680 

TCCTTTTTTA TAATCTGCGA GTGTCGAATC AATTACTTTT GCAACGACAT GGCTAACAAT 4740 

CGGTTTACCA ATTTCAAAAG GCTGTACATA CGAATCTGCC TTAGTCATfi GTCCTCTCAT 4 8 00 

ATATGGATCC ACTGAAATAT ACAGCGTTTG TACAAGTACA CCATCGCTCT CAAGTTTaGG 4 8 60 

CGTGTCAATC TCTTCaATTT TGAATGTATC CTCTTGAGGC ATGCCkTCTG GTATTTTGTT 4 92 0 
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AAAAAGAATT TGTTTATTTT GCATCATTAA TCACCTTTCT TTATTTGAAA CTTTTACTTA 4 98 0 

TTTGTTACTT AAGCGTTAAG TTTGAATTGT GTCtTCGTGA TGTCTGTATG CAAATACATT 504 0 

CTTAGTTGTT ATATTTTGAC TTAAGCACTG ATTCATTCAT GTAACTTCAA CCACAT TATA 5100 

TTTGCTATAA TCATAAATTT AAAATGTTAC G AC T T AG AC A TTTTATGGAA ACTCTCAAAC 5160 

AATAGATAAT TTTTGAAAAG CTCTAATATT ACAAGCTTTT TTGCCCCAGA AAAACTAGCA 5220 

GTTGCTTTAT TTTTTCCATA AGAAGTCGAT T AAC T CAT T A GCAACATTTT CATTCTCATG 5280 

AAGCTGACTA TGTTGTGCAG GCTCACCTTC ATATTTAGAT TCTCGATAAC TTTTCGGACT 534 0 

ATTTCCCAGT AAATATTTTA ATGATTTCGA AGAACTATTA GACACTCTGC CGTCTJKATG 54 00 

TGTACCATCT TTCAAATCCC CATAAATATT TAGCACTTCG ATACCTTTAC CTTTATAAAT 54 60 

ATCTTTTAGC ACGCGTAATT GCTGATAAGG TTGATTCATT CGACTTGGTT TACCATCTTT 5520 

AT C AAC T G T A ATTTCATTGA CATCTTCATT CATAT TTAAA ACACCATTAA ATGTCCCTGC 5580 

AATATTCACT TGTTTGTTTA ACTGTGGCAG TGACTTGTCG T T AC CATAT G TCATCATATA 5 64 0 

TTGTGCAAAT GTTAAGTTCC CCATTGAGTG ACCGACAAAG TTGAATTTAT CGAAATTGTA 57 00 

TTCAGATTGT AACTTAGTCA GTACATTTTT AAACCACGCA GCATTCTTAT CCAAATAGCC 57 60 

TTGTCTGTTA TTTTCAAGTT CAATTTTCAC AATAGGATC ACTGCATCTT TTCTTAGTTT 5820 

CCCTTTAAAT GTCACTGCAC CATCCTTTGA AACGTAAGCA GTGATGATAT CTTTAGTTAC 588 0 

CCCTCTTTTT TCTGCTTGCT TCACCATAAA CTTTTCAGAA TTGGCACTAC CACCAAATCC 5 94 0 

j 

ATGTAAGAAC AATGTTGGAA TTGGCTTTTT AACAAATTGC TGTTGTTGTA TTTTAAATGT 6000 

TTGTGCCTGT CGTTGACTAA ACACCACCAT AATAATAGAG CCTATAATAA TAGCGACCGC 6060 

TAACAATGTC GTAATAATTA CAAAAATTTT CTTCACACTT TTAACTCCCA TTCATGTCTT 6120 

TTATATAAGT ATAAAGGATG TGATTAAAAA TGTCCTTTAG TTGATTTTGA ATACATCATT 618 0 

AACTTTTAAG ATGACTTTGG AAAGTTGTCC GTTAACGTTT GTTAATTGAT TGCTTCTTTA 624 0 

GCTTTCAATG GTGTGTCACC CATTGATTAA TATATAAATA TGTATATGCA TGTTTAATTT 6300 

ATCTCTTTCT ATAAATAAAG ACCTACCAGC ACTCGACTGA TAGGCCTTTT AAT AT C TATA 6360 

ATTATTTAAT TTCTTTTGTT TCGGCTAACT CTTTGTACCA ATAAGACTT TTCTTAGGAT 6420 

AACGTTCTTG AGTCTCAAAA TCGACATAGA ATAAACCATA TCGTTTTTCA TAACCATTTG 64 8 0 

ACCAAGAGAA CACATCCATT AATGACCAAA TAAAGTAACC T T T AAC AT T T GCACCATCTA 654 0 

TAATAGCATC TGCAATAACG TTCAAATGTT G T C T T AC AT A ATCAATACGT GCAT CATCAT 6600 

GAACTGTTTT TTCAGATTCA ATAAATTCAT CTTTATATCC TAAACCATTT TCAGTGATAT 6660 

AAATCTTATG aTAGTTAGGA TAATCTTTAA CAACACGCAT GaTTTGATCA TATAAACCTT 67 2 0 



620 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



GAGGATAGAT CATCCAGTCC CAGTCTGTGC GAGGTACGTC GACATCAAAT TCACGTTGTC 67 8 0 

CGACACCTTT AAGTTGGTAT TTAGAACCGC CTTTATCACC TGTCGCATTA TGCGTGATTT 68 4 0 

CAGATTCTCC ATCGTAACCT CTCATCCAAT CACTCATGTA GTAATTGATA CCTAAGAAGT 6900 

CGTTTAAATC TTTGGCTGCA TCTAAAATGG CATAATCTTC ATCTGTAATG TTTAATTTAC 6960 

CGCCATTAAC AGATAAGATA TGTTGCACAC CTTCCATCGT TTCACGAGAA TfiTTACCTA 7 02 0 

AATATGTTGC ATCTAAGATG AATTTATTAT GGATGATATC TTCTAATTCT GCTGCACGAA 7080 

CATCTTCAGG ATTTGATGGA TCGAACGGAT ATTTTGTTGG CAATGCGTGT ACAACACCAA 714 0 

TTTCTCCTTT GTATCCGCCA TCTTTAAATA ATTTTACTGC TCTAGCATGA GCCACCATCA 7200 

TGTTATGATG TGATTGGAAT ACTTTTTCAA AATCATATTT AATACCTGGA GGGAATTTAC 7 2 60 

CTACTAAATA TTGACCATCA CCAATAGGTC CAATTTCATT GAATGTAGTC CAATATTTTA 7 320 

CTTCTGGGAA TTCTTTAAAA CAATATTCAG CATAATCTAC AAAGTAGTCA ATCGTTTTAC 7380 

GATTTAGAAA ATCGCCATCT TTGTGTAaCA CTTCTGGTGT ATCAAAATGA TGCAATGTTA 74 40 

CAAATGGTTC AACATGACGT TTATGACACT CTGCAAATAA CTTATGGTAA TACTCAACAC 7 5 00 

CTTTAGGGTT AACTTCGCCA TATCCATTTG GGAAGATACG AGACCATGCA ATTGAAATTC 7 5 60 

GGATACCATT AACACCGAAT TTTTCACTTA ATTCTAAATC CACTGGATAT CTGTTATAAA 7 62 0 

AATCACTCGC TGGTTCTGCA GTGTACCAAT AGTTTTCTTC TAAATACGTA TCCCATGCTA 7 68 0 

CGCGACCTTT ACCATCCGTA TTTGTCGCAC CTTCTGCTTG ATATGCTGCT GTTGCTCCAC 77 4 0 

CAAAAATAAA ATCTTCAGGT AATGTTTTAG TCATATGAAA AACTCCTATT CTTAATTTTC 78 00 

AAATTGTTGT TGAACGAAAT CAAGGGCTGC TTGGCCATCT CGTGTCAATT TGATATATTC 7860 

AGCACCTTGA GTCTTCGCTA ATTTAATACC TAATCTATCT GTATCTTGCT TAATATCTTC 7 92 0 

ATAGTTAGAC GCAACTTGTG GCGCTAAAAT GATTAATTGG TACTCTTTCA TAATGTCCAT 7 980 

ATGTGCGCCA TATCCGCCAG cTGCCGCTTT CACTGGCACA EATATTCTT CAGCTGCTTT 804 0 

ATTAAGTGCA TTGGCTAATA ATCCACTTGT ACCACCACCG GCACAAAGTA CTAAGACATT 8100 

TGTTTGTTCT GTGATATTTG AAGCTTTAGC TGCATCGTCT GATACACCAC TTGCCGCTAA 8160 

AATTGAATCA GCTTTTTTCG TATCAAAGTT TGCTGCAACT TTTTCTTTTA AATCTGAATT 8202 

ACTTTCTTTA CGTCCTTCTT CTTCATCAAG AATTTCACTA TCATAAACTT TTAGGAATGG 8280 

GTAGTAAATA ATAATATCTA CAACAATCAA AGTAATAGCT AGTACGAATG AC C AT AAAC C 8 34 0 

AAAACCTGTA CCCATGATAA TGCCCAATGG ACCTGGTGTT GTCCAAGGTA AATTCACACT 8 4 00 

AAAACTATTC ATTCCTAACA CTTCAACGAA AAGTTTGAAA ATCCATACGT TAACAATTGG 84 60 

TGCTAATACA AATGGAATAA AGAACACAGG ATTCAATACT AGTGGTGCAC CAAATAAAAT 8 52 0 

TGGTTCGTTT ACACCAAAGA ATGTTGGTAC AACTGATGCA CGTCCAATCG CTTTGTTTCG 8580 
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TTTAGATTTC GTCATCCACA TAAACATGAA CGGGACGACC AATGTTGCfi CCGTACCTCC 8 64 0 

AAATGTAACG ATAAACATTT GTGTACCTGA TGTAATAATT TTATCTGCGT GTTCTCCAGC 8 7 00 

TTGAAGCAAC TTGAAGTTCG CTTCGATATT CGCATATGTA ATGGCTGCAA TTGCTGGCTC 8 7 60 

TACAATTGAC GGACCATGAA TACCTACAAA CCAGAATAAT GCAAAGGCAC CAAAGATAAT 8 820 

TGTGACACCA ATCCATCCAT CTGCTGCTGT AAATAATGGT TCGAATAATT TTAAAATACC 8880 

TTCCGCTACA TTTGATTTAA AGCTGTTGCG AAT G AC T AAA TCTAATGCAT AAAGAATGAT 8 94 0 

GATTACCGCT GAAAATGGAA TTAAGTCCTT AAATACTTGT GAAATATTCG GCGGTACTTC 9000 

TTTCGGCATT TTAATTGTAA TATTGCGCTT CACACAGAAG TTATAAACAA TAACAGTTAC 9060 

AAATGCTGAT AAGAATGCTG TTAATAAACC TTTAGTTCCC ATAAATGCAC TTAAGAATCC 9120 

ACCATCTTTG GCTGGATCAG AAGCTAAGAA CAAGAATCCA CACATCGCTG CTAGCATTGT 9180 

AGAAATAAAG T T AAT T T GAT TTGTACTTTC TAGCTTACGG TTAAATGAAT CTGTTBAGA 92 4 0 

TTTCGCTGTC GTTCCTGCTA CTAAAAATGC TACAAGCCCC ATCGTATAGT TATATGGTTT 9300 

CATTAAAATG GCTTCCATGC CTTTATCCCA TTTAAAACCA AATATATTTG GCACATATGC 93 60 

AATTAATAGA AAGATACTTG AGAATAAGAT GACAGGCATT GCAGAAATAA ATCCATCACG 9420 

GATGGCTCTT AAATATATGT TACGTGATAA TTTCTCGAAA AATGGCTTCC CTTTTTCAAT 94 80 

TTGTGCGATC AATTTTTGCA TCATTGTCAT CACCCTCTTT TATAAAATTC TAATAAATGC 954 0 

TTCATTAAAT CTTTCAGTAA AATTGTTGTC ATTAAATGAT CTTGACCATG CATCATCGTT 9600 

ACACTGTATG CAATATCATC ACCTTGCGCT TCTTTAGCA ACAGGCTTGT TTGTGCTCTA 9660 

TGCGCTTCCG CAATGCAATT GTTTCCTTCT TCAATCAGTG CATCTGCTTT TGCAAAATCT 9720 

CCAGCTTGAG CTGCTGTTAA TGCTTCTAAA AACTTAGAAC GTGCATCCCC TGCAAATGCA 9780 

AC AAT T T C AA AACCTAATAA TTGGACTTCT TCTCTATTCA TAGCATTAAT CCCCTTTTAA 98 4 0 

ACTTATTTTC TTTGTTTCCA AGATGTCGCA GTATCTTTTA ATACTTTATT TAAGTCATCA 9900 

ATATTTTTGA AACCAGTTGT ACGTAACCAT TCACGAGCAG CATCTTCACC TTGTTCAATG 9 9 60 

TATACTTGAA CAGCACCAGA CCATGTAGCA CGGC CAC AAA GTACCCCGTT GAATTTAGCA 10020 

CCAGCTTCGT GAGCAAATTT TAAAGTTTCT TGGAATAATT CCGCAGAAAC ACCAGCACTT 1008 0 

AAGTAAATGT ATGGTAAATG AGTTGCTGCA TCTTGATCTT TAAAGTGTTG TGCCGCTTCC 1014 0 

TCTTTTGTAT AAACCACTTC ACCTTCAGCA AAGCCTTCTA CATATTTCAT GTTTACTGGT 10200 

ACTTCAACTT TCAATACATC AACGTTAAAG CGTGGTTCTG AGAASATTT CATTGCTTCG 102 60 

TTAACCTTTC TAGGCTTAAC TTTTGCGAAT TcAACAcTAC CGTTAT cAGG AATGTTGTCA 10320 

TCGTATGTTA AT AC T T C T AA AAAGAATGGA ATATCTTCTG C AAC AC AT T C TGAACCGATT 10380 
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CTTTCAATGT ATGCTTTCTT TTGAATGTTA ATTTCTTCAG CATCATCAAC ATCATAGTAA 10440 

AGTAAGAATT TAACAGCATT TGCGCCTTGT TCTTTTAAAC GTTTTGCAGA CCACTCTACT 10500 

AAAC AG T C AG GTAAACGACC TTTAGCGTTT ACGTCATATC CAGTTTTTTC ATAAGCAAGT 10560 

AATAATCCAC AATCTTTGTT ACGTGCATCT GAAGCTGGTA AACCATATTC AGGATCTAAT 10620 

AAAATTGAAG ATGCATATTG TGTTAATTCT TCCGCAACTA ATACTTTTAA TTGTTCAATT 10680 

TGAGCTACAG TTGGTTCTTC AGTTTGATGT TTTGCCATCA TGCGTTTTAA AGCACCACGT 107 4 0 

TGGTCAAATG CTAATGCAGA AATGATACCT TCGTTGTTAC TTAATTGTTC AATTGATGCG 10800 

ATTTTTTGAT TCGATTTTGA CATTTTTTAC ACCTCTAAAA CTTCAATTTG ATGAATAAA 108 60 

TCATCATAAT TATTTAAATT GACATAACCT GTTTGTGCTT CTTGTGCATT CAGCATGCCT 10 920 

AAAGTATTGG CTTTTTTTAG TAAATCGTGG TCGTTTTCAT GATTAAGAAT TGCTGAAGTA 10 980 

ATTCCAGCAA CTGTAGAATC ACCTGAACCA ACCGGATTTA ATACACTTAT TGTCGGAATA 11040 

TTCACTCTAT AGAATGTATG ATTGTGCTTA GCGAATGCAC CTTGTGCACC TAAAGACACA 11100 

ATAATCCACT CAATCCCTTC GAATAAGGGT TGTGACACTG CCTGTTTCAA ACTTTCTAAA 11160 

CTTTCATCAA GTGGCTGGTT AAGCAATTGA TATAGTTCAG AAATGTTTGG TTTAATGACT 11220 

GTAGGTTTGT ATGGATTTTC CAAAACTGTT TGCffiAGTtG CACCCGAGCA ATCTAATATC 11280 

ACAGGCACAC CTTTGTTTTG GCATCGTTCA ATGATTTGTG CATAATAATC TTGATTTAAT 1134 0 

CCTTTAGGTA AGCTACCTGA AATAGCAACT GCTTCAACTT TTTCTAATAA TTGTTCAAAA 114 00 

TGTTTAATAA ATCCTGCAGC CTCTTGATTA TCAATCTCCG GTCCCTGCTC TAAAATTTCT 114 60 

GTTTGTTGCC CTTCATGTAA AATTGCAATG CAGTTTCGTG TTTCACCCTT AATGTTATAA 11520 

AATGCATGCT TGATGTCGGC ATGATCTAAT TTTTTAGCAA TAAATTGACC TAATTCACCG 11580 

CCAATAAAAC CACTCGCAAG GACTGGCTCA CCTACTTGCG CAAGTACTCT TGTTACATTT 11640 

AAACCTTTAC CACCAGCTGT TTTACTTACT TCTTGAACAC GATTAACATC ATCTAATTTC 11700 

AATGCTGTTA ATGGGTATGA AATATCAACG GATGGATTTA ATGTTAAAGT TAAAATCATA 117 60 

TGTGTCGTCC CTTAATCGTG GTATTCGCCT CTGTCCCATT TTTCTAAGAA T T CAT C AAAG 11820 

AAATGTGGAT CAGCTTGATC TGCATTGCTT GTTTCTAAAT (ETTAATTTT AGCGATTAAT 11880 

TTTTTGTTCT CTTCAGTTGG TTTATATTCA GCATTAATAA ATGCATCGAT AATATCGCAC 11940 

ATTAATAACT CACCTATAAT ACGTCCACCG AAGCCAATAA CGTTCGCATT TAATTCTTCT 12000 

TTAGCGTATA ACGCTGATGT CATATCACGT ACTAGTGCTG AACGAACGCC AGGTACTTTA 12 005 

TTTACAGCAT TGTTAATACC AACACCTGTT CCACAAATAC AAAC AC C T AA GTCTGCATTA 12120 

CCGCTAACAA CTTGTTCGCC AACTTTTTTA CC AAAAATTG GATAATGTGT TCTTGTGAAA 12180 

TCGTATGTTC CTACGTCAAT GACTTCATGT CCTTTTGATT TTAAAAATTC AGATACACGC 1224 0 
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ATTTTTGTAT CTGTAACAAT ATG3TCGCAT CCTAATGCAA TCTTCATAGT AATTTTTCCT 12300 

CCTTAGCACA TTTTATTAAG CATATCTACG CGGATTTGGT GTCTACCACC ATCGTATTTA 12360 

CCTTCAACAA AACCTTTAAC GACATTTTTC GCTAATGTGT CTCCAACAAT TTCAGATCCC 12420 

AT AG T GAT C A TTCTTGAATT GTTATGGCCT CTAGTCATAT ATCCAGAGG TTCATCTGAT 12480 

ACTTCAGCAG CAAT CATGCC TTTGATTTTT GTAGCAACCA TAAAGCTACC TGCACCAAAT 1254 0 

GCATCGATAA CAATACCTAA GTTACCTTCT TGACTTTGAA CATCTTTTGC TACAGCCAAA 12 600 

GTTGCATCAA CAAAGTCAAC TTCCTGTCCT TCTGTTACGT CAACAACATC ATATTTATTG 12 660 

TCTAATAAGT ATGATTTGAT GACTTCTTTT AATCGTTTGC CAGCTTCATC TGAACCAATA 12720 

ATAATCGCCA TAATAAGACT CCTTTTTACT TTAATTTTGA AATACCTTTC TTAAAATGTG 12780 

ACATATTTAT TTGTAGGTTA TGAAAATCTT GAGAAAAGGC TTTCAATTTG ATTACGTTTA 12840 

AATTATAAAC ATAAACAAAC AATAAATCAA CATAATATGT TTATAATATG TTTGTTTATG 12900 

ACGTATTTTC AAACAATAAG TGAACATTCA TATTGTGGTG TTGTTTTAAT TAGGTATTCG 12 960 

TCTGAAATTG TAGTAAAACT TTGTCGAGGT TCCCGTTGaC ATAAATTTGC ATAAAAAAtA 13020 

GCCCATAAAT GAATGCAAAT TCACATTCAC TTATGAGCAT ATAGATACAT ATTTTftCAA 13080 

TGCAGTTATA CTTTTAATTT AGTCGACTAC TTCAATATAT GTTTTAATCG TTTCTACTTT 1314 0 

TTCTTCATCT TCATAGTCCA TGACCACTGC AGTCAATTCG TTTAACTGAC AAAATGATGT 13200 

AAAATCTTCT TTGCCAACTT TCGTATGATC GATTAACAAG TATTTTTCAA TTGAATTACT 132 60 

TAGTGCCAGT TGTTGCGTAT AGGCTTCATC TAATGTAGAT GTCATCACAG CACCTTTATT 1332 0 

TACTGCGTTA CTACTAAAGA ACATCTTGCT AAATCTTAGT TTTTCCAACA TGGCGTTCGC 13380 

cATTTCACCT ACAAATGCTT CTGTAATATG GCGCATTTCA CCACCAATTA AATAGACACG 13440 

AAAATGTGCT GTTTGTTTTT CTAACAAAAT TTTATACSC GGCAAACAAT TCGTAATAAT 13500 

TGTGAGCGTA TGATGATTGA CTTCTTCTGC TAATAGTTCC ACTGTTGTTC CTGGTCCGAA 13560 

AAACAAAGTA TCCCCATCTT CAATTAATGA TGCAGCTTTT TTAGCTATAA ATCGTTTTTC 13 620 

TGCAATTTGA CGGGTATGTT TTTCTTTATG CGATATTTCT TTATACTGAA ATGTTGAATT 13680 

ACTGCGTGCA CCACCATGAA TCTTCGTTAA AATCCCTTTA TTTTCCAATT CAATTAAATC 13740 

TCTTCGAACT GTCATATCAG AC AC AT T T AA ACCTTCGACG ATTTCATTCG TTCTTATCGT 13800 

GCCCTTTTTA TTCACTAGTT TAGCAATTTC GTCCAAACGT TCATGTTTAT TCAATGTAAA 138 60 

ATTGCCTC 13868 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4549 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

TTAAGTCAAC TTTGTCTATA CGGTTTGGAT CtTCTaCCCA ATGTCTTATA AAAGACAATC 60 

CCGCACCTGA AACATAACTC ATGAAATAAG AAAATGGTAT ACCATTAATT TGATCATTTT 12 0 

CATTTTTATA ACGTTGTAAA T G AC AC AT AA TTAAATCATAAAGTTTTTCG TGGATATTTG 180 

AATCTTTACC CATACGAAAC ATCAATTGAT AAAATGCGAT GTCTTTTTCT ATCATTTCTA 24 0 

TTAAAACGGT CATAATTTGA TGTATGTTAT CCGTGGATAA CTTAACTGCT CCATTTAACT 300 

TCTCATCATG AATGAAGTCT CTTATTTCCT CCAACTGCTG GTCCTCTAAT TTTTCAAGCA Q0 

AATCATACTT ATCATAATAA TGCGTATAAA ATGTACTACG GTTAACATCA GCTAAATCTG 4 20 

CAATTTGTTG CACAGTAATC TCTTCTAATT GGTGTTGATG TAAAAGTTCA ATAAATGCAT 4 80 

TTCTCATTGC AACTTGTGAT TTTCTAATAC GTCGATCTAT AGTCATTTAT ATCAAGTCCT 54 0 

CCCCAATGAT TATAAACGTT ATGTTCATTA TCCCACAAAT CTCCAACATT GATGATTGGC 600 

ACACAATGTT TACCTGTTTA ATATAGGTGA TACAAACAAA CAGAAAAAGG TGATAACAAT 6 60 

GAACCAACAT TTACTAGGAA ATCCAAAATT AACTGTAACT CATGTCAATG AAGTTAAAGC 720 

CGGTATTAAC CACATCGTTG TCGACAGTGT TCAATATGGA AATCAAGAA TGATTATGGA 78 0 

AAAAGATGTC ACTGTGGAAA TGCGCGATGG CGAAAAATTA TATATTAATA TTTTCAGACC 84 0 

AAATAAAGAT GGCAAATTCC CTGTAGTTAT GTCTGCAGAT ACTTACGGTA AAGATAATAA 900 

GCCTAAAATC ACAAATATGG GTGCCCTTTG GCCAACATTA GGTACCATTC CGACATCTAG 960 

TTTTACACCT GAAGAATCAC CAGACCCAGG ATTTTGGGTG CCAAATGATT ATGTTGTAGT 1020 

TAAAGTTGCA TTACGCGGTA GTGACAAATC CAAAGGCGTC TTATCTCCAT GGTCAAAAAG 1080 

AGAAGCGGAA GATTATTACG ArTGATTGAA TGGGCAGCAA ATCAGTCATG GAGTAATGGA 114 0 

AATATCGGGA CAAATGGTGT TTCTTATCTT GCGGTGACTC AATGGTGGGT CGCATCATTA 1200 

AATCCACCAC ATTTAAAAgC AAtGATTCCT TGGGAAGGCT TAAATGATAT GTATAgAGAA 12 60 

GTAGCCTTTC ACGGAGGTAT mCCAGATACT GGCTTTTATC GTTTCTGGAC TCAAGGTATT 132 0 

TTTGCGAGAT GGACAGATAA TCCAAATATC GAAGATTTGA TTCAAGCACA ACAA5ACAT 138 0 

CCTCTGTTCG ATGATTTTTG GAAACAGCGT CAAGTGCCAT TATCACAAAT TAAAACACCT 14 4 0 

CTACTAACAT GTGCTAGTTG G T C T AC AC AA GGTTTGCACA ACCGTGGCTC TTTTGAAGGA 150 0 

T TTAAACAAG CTGCATCTGA AGAAAAATGG CTATATGTGC ATGGACGTAA AGAGTGGGAA 1560 

AGTTACTACG CTAGAGAAAA TCTCGAACGC CAAAAATCAT TCTTTGATTT TTACCTTAAA 162 0 
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GAAGAAAATA ACGATTGGAA AGATACGCCT CATGTCATTT ATGAAGTTAG AGATCAATTT 168 0 

TATAAAGGCG AATTCAAATC AGCGTCACGT GTCCCTTTAC CTAACGCAGA ATATACACCA 174 0 

TTGTATTTGA ATGCTGAAAA TCACACATTG AATCATGAA AGATTAGTAG CGCGCATGTC 18 00 

GCACAATATG ACTCTGAAGA TAAACAACAA GATGTAAGTT TTAAATATAC GTTTGACAAA 18 60 

GATACTGAGT TAGTTGGAAA CATGAACTTA AAACTATGGG TAAGCACTAA AGACTCAGAT 1920 

GATATGGATT TATTTGCAGG TAT T AAAAAG TTAGATCGTC GTGGTAATGA AGTTAACTTC 1980 

CCTGATTTTA ATCATATTGA AAATGGTCAA GTAGCTACTG GTTGGTTACG CGTATCACAT 2 04 0 

CGTGAATTAG AT C AAGAAAA ATCCTCAATC GCGCAACCTT GGCATAAACA TGAAACAGAA 2100 

TTAAAGTTGT C AC AAGAT G A GATTGTACCT GTTGAAATCG AATTGTTACC TTCAGGCACG 2160 

CTATTTAAAC AAGGCGAAPC ATTGGAAGTT GTTGTAAAGG GTAGTGAAAT TGTAATTGGT 2 220 

AATAGTACTC CTGGCATGAA AACACGTTAT GAACATGAAG AAACCGTAAA TAAAGGCATG 22 8 0 

CACATGATTT ATACTGGTGG TAAATATGAT TCACAATTAA TCATTCCTAT CGTTAATTGA 234 0 

TATGCAGCAA TTACGGTCGC TTTTGATTAA AAGT GACAT A GTGKAGGAC TGTATAACAA 24 00 

GAGAAAGCCA CACGCTTGGA ATCTTAAACC AAGGTGTGGC CCTTTTTATT ATTGATGGCT 2 4 60 

ATTTAATTTT ATAACACTAT CGTATTTTCT TTTTCAT GAA TCATTTCAAT AAT GACAT T A 2 52 0 

TCTTCATTCA TTACTGCTAC TTTAGGTGCA TGGTTTTTAA TTTCTTCTTC ATTCAACTGT 2580 

GCATAAGTCA TGATTATGAC TACATCGCCT ACTTCAACAA GTCTTGACGC TGCACCGTTT 2 64 0 

AAACAAATTT TACCACTACC TCTTTCACCA GCTATTACGT ATGTTTCAAA ACGTGCACCA 27 00 

TTATTATTAT TCACGATGGC TACTTTTTCA TTTGGCAAGA TGTCTACCGC TTCCAATATA 2 7 60 

TCTGAATCAA TCGTAATGCT ACCTAGVTAA TTTAAATTTG ACTCAGTCAC TCTTGCTCTA 2820 

TGAATTTTAG CATTCATCAT TGTTCTTATC ACTTTATTCA GCTCCAATTA TTATATTATC 2 8 80 

TATTAAACGC GCTTTTGAAA ATTTAACAGC TAAcGAGATA AATATGCGTC CAGTTATTTC 2 94 0 

GTGTTGTTCT ACTAATTGAG GATAACTATA AACAGCAACT TCTTCAATGC ETCACTTAT 300 0 

ATGTGATTCA AGATATTCAG TAACCCTGTC TATAATTACT TTACTTTGAC GTTCACCGTC 3060 

TTGATACAAC GCTTGTGCTA ATAGCAAACT TTTACTTAAA TGTACCGCTT CTTGTCGTTC 3120 

TTGCTCCGTT AAATAAACAT TTCTTGAACT TTTCGCCAAA CCATCTGCTT CTCGAACGAT 3180 

ATCAATACCA ATAATTTCAA CGGCATGATT GAAGTCTTTT ACCmTTTGCT CGaCAATAGC 324 0 

CAATTGCTGG GCATCTTTTT TACCAAAATA AGCATAATCC GGCATAACAA TATTAAATAG 3300 

CTTATTAACT ACTGTTACCA CCCCATCAAA ATGCCCTGGr CCGtTCGCTC CTTCTAACAC 3360 

ATCAGCTAAT GGGCCTACTT T GACAT C AAT ACCTAATTCA CCTGGATACA TATCTTCTAC 34 2 0 
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TGCAGGATGA AAAACAATGT CCGCTCCTAC TTCTGATACT AATTCTAAAT CTTTATCAAT 34 8 0 

TTGTCTCGGA TAAGCATCGA AATCTTCGTT TGGACCAAAT TGTAATGGAT TAACAAATAC 354 0 

ACTCACAATT GTAATATCAT TTGTACTAAC TGATTCGCGT ACCATCGTTA AATGTCCACT 3 600 

ATGTAAGGCA CCCATTGTTG GGATAAAACC AATCGTTGTG CCTGAGCGTT TGGCTGCTTT 3660 

AACAATGTGT TGCATCTCTT TTACCGTAGT AATCAGCTTA GTCATTGTTA TTAACCTCAT 37 2 0 

TCATAATTTT CTTTTTATAC GTATATTCTT CTGATGGAAA TGCACCAGAT TTAACTTCTT 37 8 0 

GATCGTATTG TTTTAAACCA TCCACACCAA CACTAAAATC AGCAAATTGC TTCACAAATT 38 4 0 

TCGCTTTATG TTCAACACCA TAATTTAACA TATCGTGATA AACCAATACT TGACCATCTG 3900 

TACCTTTTCC TGCACCAATA CCAATGACTG GAATTGTTAA GTGCTTGCTA ATTTCTTCTG 3960 

CTAAATCATT TGGAATTGCT TCAAGTACTA ACGCAACTGC ACCAGCTTGT TCTACATTTT 4 020 

TCGCGTCTAA AATAAGTTGc TCCGCTGCTT CTTTCGTTGC ACCTTGTAAT TTATACCCCA 4 080 

TAACGCCAAC ACTTTGAGGT GTTAATCCTA AATGTGCAAC AACAGGAATA CCAATTGCCG 4140 

TTGCTTTTTC AATAAATGGT GTAATATGCG CTCCTTCTGC TTTAATTGCA TTTGCATTCG 4QX> 

TCTCCTGATA AAGCTTTAGA GCATGATTTA AGTCTTGTGT CATAGAGATG CCTACTGCAC 4 2 60 

CAATCGGCAT ATCAACAACT ACAAATGTAT TTGGTGCGCC TCTTCTTACT GCACGACCGT 4 320 

GATGAATCAT ATCTGCTAAC GTCACTTGTA CGGTACTTTC ATAACCTAAT ACAGTCATAC 4 380 

CAAGTGAATC CCCAACAAGA ATCATATCAA TACCCGCTGC TTCCACTTGT TTAGCACTTG 4 440 

GAAAATCATA AGCTGTTACC AT AG AAAT T T TAGTTTGCTT TTGTTTCATA TCTATTAATT 4500 

GACTTACTGT TTTCAATGTT ATTCAACCTC TTTTTGCAGT AT n AT TAG A 4 54 9 
(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

TTATCTTTTG TTGTTTCCTT AGACAAACGA C T AAC C AC AT TA&ATGGAC TAATTTATTA 60 

ATTTTATTTA ATTCCATTAA GTTATCCGTA ACACTAAGTG AAGATGCGGA GTTCACTCTC 120 

GTTTGTACTC TTCGTTTTAA TAAAGCACCT CGTAATAATA CAATCATTCT TCTTATTAAT 180 

GATGCTTGTC TATATACCTG TGTTCTTTCA GCATAACGCA TATAGTTTTC AAG T AC AC T A 240 

TTCGTTATTT GTCCTTCATC TACTAAAGAC TCTAATGTTT TkGTTTCTAC AT T AAAAGC A 300 

ATTTTTTGTA GACGTTCTAA TTCTTTAGAG TTTTCATCAT CTTTCTCTAC AGTTTTTAAA 3 60 
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AATGCTAATT TATCATGATA TTCTTTAATC ACGTTACCAT ATTTAAAACT TGTTTCGAAA 420 

GTAGATTTTT GAT T TAG AT A ATCAATAACT TGTTCTAATA TATAAATTCT AGCAACTTTA 4 80 

AACGACATAT TGCCAATTAC TGTTTTAGGT GCAGGTTTCG TTAATAATGG CAATAATACT 54 0 

TGCGCAACTA CCAAACTAAT AAT AAC CAT A CCAGATGCAA TAAATAATAA GTCGTTTCTA 60 0 

TACGTAAAAG CATGATGCCC TGCTAAAAAA TACGGTAACG TTAATGCGATTGCTAAAGAA 6 60 

ATTGTTCCAT GCACACCACA TAACGTCATA ATTAAAGCGT ATAAACTTCG CTTTGGTGGT 72 0 

TTCTCAGTCG TTGGATTATC ATCATCATTT TTAGTCATCA TTTTTTGGAA TGGACTGATG 78 0 

GCTAAATAAA AATAAGGATA TAAGACATAA ACCCAAACAA ATCTAAATAG ATAGACAGCT 84 0 

AAAGCAACAA C AAT AG T GAT GCCTATTAAA AAGATTAAAT TGTGCGGTTC TGTTTTGATA 90 0 

ATTTTAATAA TAACTTCAGG TACTAAAAAT CCTAATATTG AAAAAACAAA GCCATTTAAA 9 60 

AC AT AACC T A GTATATTCCA TGTATGATTG TAACTCATTT GCAGTTGTGT ACGTACTTGC 1020 

ATAATTCTGT CACGTTCGAA ACCATGTACA AffCCTGCAA CTACTGCTGC AATGATTCCT 1080 

GATGCGTGaA ACAATTCAGC AATTAAATAC GTAACAAATG GTGTTAACAA TTGAATAATT 114 0 

GTAAACATAT TAATGTTTTC ATATCCTCGA CGCATCAATG TTAATCGGAA CCTTACTAAT 12 00 

GCCATACCTA TAAGTAAACC AACCACTGCG CCACCAATTG ATGCAATTAA AAACAA03A 12 60 

ACAGCATCAA CAAGTGAAAA AGCACCTGTA ACTAATACTC CAACAGCTAT TTTAAATGAA 132 0 

ATAATACCAG CAGCATCATT CAATAATGAC TCACCTTCAA GAATTGTCAT TGCTCCTTTT 138 0 

GGCAAGACCT TTCCTTTAGT GATTGCTTGC ACTGCTACTG CATCAGTAGG ACAAAGAATG 14 4 0 

GCAGCAATTG CAAATGCTGC TCCAATAGGT AAATCTGGCC AAATCCAATG AATAAATAAA 1500 

CCTACACCTA TCACAGTAGT AAT G AC T AAT CCTAATGCCA T CAT CAT C AC TGGCTTAATA 1560 

TATTTCCTTA AATGGACTCT AG AAAC AT T A ACACCTTCTA CAAATAACAA AGGCGCAATC 162 0 

ATTGTTACCA T AAAC AAT TC AGAATCAAAA T T AAAT T G AA CAGGGATTGG GGTAATAAAT 1680 

AGTAACATGC CCAAGAAAAT TTGTATAAAT GCTAGGGGTA CTTTAGGTAT GAAAGTATGG 174 0 

ACAAACGAAC T TAG TAT C AC AACAGCTATA AATATAAGAA TTGTTTCAAA TATTTCCAAA 1800 

CTTTCACCTC TCTAAAAAGT ATTGTTTAAT TGAAAATTAA GTATCACATC TCGTTGTAAT 8L60 

TATACTTTAG AGGATAAATT GAG T TAG C G A CCACAAAAGC ACT TT AAT AT AGATATATGT 192 0 

CTACGATTGC AG T AC T T AAA TTTGCAATTA TTTAATTTTA TTTTATCACT AATTGTTTGT 198 0 

ATAAATAAAC AACTTGCTTT CACATAACAA CATTAACTTA TAATACAAAA AATGAGCACC 204 0 

TTAAAATCGA C T AAC C AAT T TCaAAGTACT CTTTTAATGA TTAATTTTGA AAAC AGAT T T 2100 

TCaAAAGCAT TGTTATGCTT AACAATTTAG CCAACACTTC AATCGTTTTG ATACCATTTC 2160 
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TTACGATGCT CTTCTCGTTT TTCAGCACGT AATTGTAATG CTTCTGTAGA GTTTTGTTCA 2220 

TTTGAACTTA ATAATATTGA TGCATGTGTG TGAGCATCAT TTTTTCfiTA CATATAAGCG 2280 

CCGTTGCGAT AAGCAGCGCG AGCGACTAAG TGCATGCCGA CTGGTGAAGT TAAATTAATA 2 34 0 

AAAACAAGTG ACAGTAATAA ACGCACACTG AAAAATCCTG TATTCACAAT AAAATAAATT 24 00 

AATACACCGA TTAAAGTTAA TAACACGGAT AAAGTTGAAC TTTTTGTCGC AGCGTGACTT 2 4 60 

CTTAAGAAAA CATCTTGGAA TTTCACGATA CCTATTGCAC TAATAAGAGC AAT AAAAC T A 2 52 0 

CCTAACAACA ACATCACAGC AGCAATAAGA CTAAAGATTT CTTTTGTTAT TTCCATTAAA 2 58 0 

CACATGCCCC CCACCAATAA AGCGTGATAT TGAAACAGAA CTTACAAAAG ATATAATGGC 2 64 0 

AATGAGCATG ATTGAATCTA AGAAAGAASC GGTGCCCATA AGTACACTTA ACACACCCAC 27 00 

AAT T G AC AT T ACGACAGCAC TTGTTGTATC AAATGTAACG AC AC GAT C T G CTGTTGTAGG 27 60 

TCCCTTGATT AATCTAAATA AACAGATGAT TAATGCAATT CC AAAAATAA TGAGTGAACT 2820 

AATAATCATA ATATGTGTTA TTGTTTGTAT CATCGCGACA CCTCCAATAT TAAGCTTCA 288 0 

TAATGCTTAA TACTTCTTAA CAAACTATCT TTTTCTTTTT CTGACACGTC G AT AC TAT G A 2 94 0 

ATAAAAAACT TTTTAGAGTC TTGAGAAATT CGTATTACTG TAGACCCTGG AGT T AT AAT A 3000 

ATTAAAATTG TTAAAAATGT T AT T G AC C AA TCACTTGTTA GTCTTGTTTC ATATGAAAGT 30 60 

AATCCAGGGT TCATATCTTT TGTTTTAAAA AGAATATAAT TAATCGTGCT AATGCTAGAT 3120 

GTTATTAATT GATATAAATA AAC AC C T AAA AAT T T AAT AG CTACCCATAT TTTTCTAACA 3180 

T AAAAAT CAT CGCTGAAAAA CCTGTGTAAT AT AT AAAT GA CAAT TAAACC AAT TAG AT AT 32 4 0 

C C AG AAAAG A AAGTCGAGAA TTTAAAATGA TCTTCffCTT G AAAT AAT AC CCATAAGAAT 3300 

GCAATGATAA TATTTAAAAC TATTTGATTC ATTTAGTCCT CTCCTTTCAA ATGCGGATTT 3360 

ACAAGTTTTT GAT AT AAT T G ATCACTCGTG T T C AAC T C AG TTGCATCACT TGTAACATTT 3420 

AACACAACAG GTGCAGCAAT TCCGATTGCG ATAACCACAA CTACTAAAAT ACTTAAAATT 34 8 0 

CTTTTTCGAT ATAGCGGGAT TTTCTTAAAA TTAACTTCCT CCCCATCTTT ATCTCCAAAA 354 0 

TACATATAAA AAAGTATCCT AAATAAACTG TACATTGCAA TTAGACTAGT AATAATCATT 3600 

AACGCTAGTC CAATATAATT GCCATTTTGC AATGCACCTT GGAAAATAAG TACTTTCCCC 3660 

GGAAAGCCAC TAAATGGAGG CACGCCGCCA ATAGCAAAAA T CAT TAT AAT AAACGCAACT 37 2 0 

CCAAATAAAG GTTCTTTTTT AGCTAAGCCA TTCAAATATT GATATTGTCG ATAGCCTGTA 3780 

ATGTAAACTA AAC T AC CAAT AATAAAAAAT AGCAATGTTT T T AC AAC AAT GTCATTTACC 38 4 0 

AAATAAAATA TTGCACCATT AATACCTGCA AACGTGTTTG TTCTAAACC T AAAAT GAT A 3 900 

AATCCTATTG AGATTATGAC TTGGTAAGCT GCAATCTTTT TAATATCTTT ATAAGCAATG 3960 

ACACCTATAG CGCCGATGAC CATAGTTATA GCAGCCATAG TTGCTAGCAA TGGATGTATG 4 020 
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AGATCAT TAT GTTGATCAAA TAGTAAAGTG AAGAATCGAA TTAATGCATA GGCCCCTACT 4 080 

TTGGTCATTA ACGCTGCAAA TAATGCTGCA AGCTcAGTAT TTAACACAGC GTAGGCTTTG 4140 

GGTAGCCACA TAAAAAGGAC CAGCGCTGCT TTCGCACTAA ATGCGACTAA GAAGATTAAT 4 2 00 

GAAATCATTG TAACAGTGCG ATTATCTCCC ATGTCATTCA AACGCATTGC AATATGTGAA 4 2 60 

AAGTTTAATG TACCTACTGT TTTATAAAGT AAACC T AT AC CTAATAAGAA TAGCCATGAA 4 320 

CCAATAATAT TCAAGACAAC ATAAATAATT GCAGCACGTA ATTGTTCTAC AGATTGTCCA 4 38 0 

AGTGTAATGA GTACAAATGA CGCTAGTAAC ATAATTTCAA AC AT G AC G T A TAAATTAAAT 44 40 

AAATCTGATG T T AG AAAAG A GCCTATCACG CCAACACTTA AAAAT AAT ATGAAC GATGGC 4 500 

AAGTGATAAC GATTTGCTTT ATGTTCGCCA CGCCCAAATC CGTATGCCAT AATTAAAGTA 4 5 60 

ATCACAAACG AAGCGGTTGT AACCATAATT AAACTTAAAG AATCTCCTAA AAACTGTATA 4 62 0 

CCAAAGGGCG CTGACCATCC TCCAAAGTCT AGCGTAATTG GACGGTGACG CTGAACATAA 4 68 0 

ATTAATAGCA TTAATGAAAT AATTGTGGTG ATAGTCATTG TACCTAAGTA TAAATATTTA 474 0 

GAAATACGAT CATTATTTTT TAAAAATACA AGGATTAAGG CACAAAGGAA TGGTAATAAC 4 8 00 

ATTGGTAAAA TCAATAAGTT ACTTAGCATC ATCTTCCCCC CTTAGGCCTT CAATTTCATC 4 8 60 

TTCTTTTGTT ACTTTATAAG TTCTATAAAC AiSTACAAGT AAAAAC G C AG TCATCCCAAA 4 920 

CCCTATAACT ATTGCAGTTA GTACAATAGC TTGTAACAAG GGATCAACAA ACAATTGGTT 4 980 

TCCACCAGTT ATTAGTGGTT CTGATCTACT AGAACCATAC GTTCCCATAC TCATAATAAT 504 0 

GAGATTACCA GCATGAGTAT AT AT T GAAAT TCCGATTACA ATACGAATTA AATTGATSA 5100 

TAAAATCATA TATGTTCCTA TAAACACTAA AAATCCTATA ACTAGTAATA ATATTAAATT 5160 

CATGATCGAC CTCCGCTAAG CGACAACATC ACTGTGACAA TAACACCAAC AACTGAGAAT 522 0 

AAAAT ACCTA ATTCAAAAAG TGTTATTGTA CTTACATGAA TTTGTCCTAA AATTGGAAGT 52 8 0 

ATCCAAGTTG TTTCATATTG AGACAAAAAT GGTTTTCCAA AAAACATAGG TATTATCGCA 534 0 

GTAATAGATG ATACCAATGC TCCAATAATC ATTAAAATTC TAAAATCAAT CGGTAAACTT 54 00 

TCTAAAACCT CTTCAACATT AAAAGCCAGA AACATTAAAA TAAACGCTGA ACTAAATATT 5 4 60 

AAACCACCAA TAAACCCACC ACCAGGATTA T T AT G AC C T G C G AAG AAG AC ATAGAATCCG 5520 

AAAGTCAATA AAATAAATAC AACAAGTTTC GTGACCGTTC TTAACACGAC ATCATTCTCT 558 0 

TTCATCTTGT CCCCTCCGAT CTTGATAATT TAATAATGtg TAAAT ACCTA GCCCAGTAAT 5640 

AATTAACACT AATCCTTCAA ATAATGTATC TAATGCTCTA AAGTCACCAA GTATCGCATT 7500 

TACAATATTT TTACCACCTG TTAGTTTGTC AGCTTTTAAA TAAAAGTCTG ATATTGATGA 57 60 

T AAAC CAT C T GTTTGTTGTG TAATAAAAAT TAATGATACA ACAATAAGTG C C AT C AAG AG 5820 
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TGATACAGAA ATTTTAATTA TTTCTCTTTT TTTGTTAGCG TTAGATCTTG GCACGTTTGG 5880 

TAATCTTGAA AAACTGACAA TAAATAGTAT CGTCGTTATT GTTTCAACTA CTAGCTGAGT 5 94 0 

CAATGCTAGA TCAGGGGCTT TCATTGCTAT AAAGAATAAG GTCACAACAA ATCCGATGAC 6000 

yCCATTTAAA ATGACCATTG TCAGTCGTTG ACGTATAAAA AT T AAAG AAA TACCAATTGT 6060 

GACAGTTACG ATTGCTAATA TAATTTCTAA TGCCCCAAAT TCAGAAEAT GTAACTGATG 612 0 

TACTTTAGGA AGTCCaATTC GAATATAACC ATATCCAATG ATAATCATAA ATATGCCTAA 618 0 

GGTCATAATA ATGTACTGGT TTAAACGATC TTGCATAACA CGTTTAAATC GCTTCGTAGC 62 4 0 

AAACTTTTCA AAATGTCGAT ATACCATCTC ATAGCTTTTT G AAAC T G AAA TCTGTCTAAT 6300 

TTTACCTGTG AACACTTTTT TCCAATCTAC TTTGATTGCT AG T AC AC T AC CCAATAAAAT 6360 

AATGATGATG GTTAAAAGAA GCGGTATGTT AAATCCATGC CATTGCGAAA CATGTGGTGC 6420 

CAATTGATCA ATTTGATGAT TACCACCTGA TACAGCTCTT AATGCnAGAA CGATAATCCC 6480 

CTTCCCAAAT ATATnTGGTA CAAAAAAGiT TACAGGTACT AGCACCATTA a T AT AAG AG A 654 0 

TGGTAAACTA a AC AAC CAT G GTTCGTGGAT ATTTTTTTTA GTAAAAACCT TAGAATCATA 6600 

TTTTGtCCAA AATACTTCTT TTACCATGTA TAGTGCATAT GTGAATGTAA AAACACTCGC 6660 

AATAACACCA ACAAACACGA TAGCTATCAT T G AAAT C AAA CTAAATTGGG ATAATGTCC 6720 

AGTTTGTGTT AATGCATCTA AAAACATTTC TTTACTTAAA AATCCATTTA AAAATGGTAC 67 8 0 

TCCAGCCATA GATAGAGCCG CTATCGTCAT GACTAGATTC ATTTTAGGAA ATAGTTGACG 68 4 0 

CATTCCACTT AAAATTCGTA TATCCCTTGA ACCTGCTTCA TGATCTAAAA TACCTACTCC 6900 

CATGAAAAGC GCACATTTAA AGATGGCATG ATTCATTAGa TGAAATAGcG CACCArATAA 6960 

TACmAATACA TAAATaGATG CTATTGCGTC TTGTTGGTGT TGAGCATATC CGCCACCTAT 7 020 

ACCCACCATA GCCATAATCA TCCCAAGTTG ACTGATTGTA GAGTACGCTA GGATACCTTT 7 08 0 

TAAATCCCAT TGTTTTAAAG CTGTAATTGA ACCAAAAAC ATTGTTATTA AACCAACAAA 714 0 

CGTAACGATA TATACGTACA TATTGCTAnG ACCTAATAAT GGTGTAAATC GAAGTAATAG 7 2 00 

AAnGATACCA GCTTTTACCA TCGTGGCTGA ATGTAAATAA GCACTTACAG GTGTAGGTGC 72 60 

AGCCATTGCT CTAGGTAGCC AG TAT G AAAT GGAraTTGTG CTGATTTTGT AAATGCACCT 7 320 

AATAAAAACA TAAAAATCAT AGGGATAAAC AATCCATGAT TCTTAATATG ATCTGCTTGT 7 380 

CCTAATATCT CTGTGATGTT ATTCGTTCCT G T CAT GAT AT ACAGCATAAT AAAACCAACT 7440 

AATAACGCCA ATCCACCAAA TACTGTAATC ATAAATGATT GAATCGCACC AAATTGACTG 7 500 

TCACCATTGT TATACCAATA TGAnATCAAT AAAAATGATG ATmCACTCGT TAATTCCCAA 7 5 60 

AAaATGTACA TCmATATCGT ATTGTCTGAT AATACaaTAC CAATCATACT G AAC AT AAAT 7 620 

AACGTTAAAT AAAAATAAAA CCTTGGTAAA TTGTCTTTTC GAGAGGATAA ATATTGAGTT 7 68 0 
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GCATAGAAGA ATACTGCAAT TCCAATAAGT GAAATAATAA GASAAACAT TAAACTTAAA 77 4 0 

CCATCTAAAC GTAAATCTAA ATTAATATCT AATGTCTTAA TCCATGGAAT AGAGGTAGAA 7 8 00 

AGATATTGCA GTTTAGCTAC TGATGGTATT TGAATCAAAA AATAGATAGA TGAAATTACA 78 60 

GGTGCAACCA ACGCTATGTA CCCGGCATAT TTAGCCAATG CTCTACGTTT AGACATTAGA 7 92 0 

AGTATCATCG CCATAATCAC AAGTATAGCA ATTAATAAAT AAACCAAACT CATTATTAGC 7980 

CTCCTTTGTT TCTATAATTG TAATGAAATA TAAATACTAT GTTCACACTC ATTTTCTAAA 8040 

CCGATAAAAT TTAGTGTTTC AATAGCAGAT TGATGCCCTA AATACTTTTG AATGACTGGT 8100 

AT AAG TAT AC CTTTTTGATA AGCATSATAT GCAAATGTCT TACGCAATGT CGTTAGTCCT 8160 

ACATTATCTA TACCAGCTTC AATTGATGCT TGGTGAATTA TTCGATATGC TTGCTGTCTA 8220 

GATAATACTT GATTTGTTCG TAG T GAT T G A AAAAGAACGT CTTCATTCGA AAGACTCCTG 8280 

TCCTCTATAT ATTGAAGTAG TTCTTTCGAT AATGTTTCTG GTAACCTAATTTTAATCAA 8339 
(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

CCCGATTTTT TTACGTAATC TAATACATAC GGCAAAATCA ACTTTAATCA AAAAAGACTC 60 

AT AC AC AAT G CCTTTAAAGC ACATGTATGA GTCCTTTTTA GTAGTTTATA TCAAAAAATA 120 

GTTTAATGTA TAAATTAGTT TTTGTTTACA GATGCGTCGT AGATTGATTC TACAGCATCA 18 0 

CCTAAAGCTT TATCGAATTC TTCTTTAGAT TGATCAGCTC TTAAATCACT AGCTAATGCA 240 

CGTGAGAAAC TTGCGATAAG TTCAGCGTTA TCTTTAAGTA ATTCATTTGC TTTTTCTCTG 300 

CTGTAACCAC CTGATAATAC AACGACACGA ACAACATTAG GATGTTCAGC TAACTCTTTG 360 

TATAAGTTTG GTTCAGTAGG AATTGTTAAT TTCAACATTA CTAATTGATC AGCATTTAAG 4 20 

CTATCTAAAC CTTTTTTAAG TTCAGCTTTT AATACTTTTT CAATTTCAGC TTTGTCTTTT 480 

GCATTAATAT TAACTTCTGG TTCGATAATT GGAACTAAAC CTTTAGCAAT AATTTGTTTA 54 0 

GCAACTTCAA ATTGTTGTTC AACAACGTCT TTGATACCTT GCTCATTT 58 8 

(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2841 base pairs 
-(B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 177: 

AT AG AG T n C T GGnACTTACT AT GAC AT AT G GCGCTAGAAT GGCTGAGCCA GGTGAATTTA 60 

CAAAACGTGC CTTTTTAAAT GGTCGTATTG ATTTATCTCA AGCTGAAGCA GTTATGGACT 120 

TTATTCGCTC GAAGACAGAT AGAGCTTCTA AAGTTGCGAT . GAATCAAATT GAAGGTCGTC 180 

TAAGTGACTT AATCAAAAAA CAACGTCAAT C TAT AT TAG A GATACTCGCT CAAGTGGAAG 24 0 

TGAATATTGA TTATCCTGAA TACGATGATG TTGAAGATGC GACTACTGAA TTTCTTTTAG 30 0 

AGCAGTCTAA AG AAAT C AAA CAGGAAATTA ATCGTTTATT AGATACCGGT GCGCAGGGTA 360 

AAATTATGCG TGAAGGTTTA TCTACAGTTA TGTTGGTAA ACCAAACGTA GGTAAATCAT 420 

CGATGTTAAA TAATTTAATA CAAGATAATA AAGCGATTGT AACTGAGGTA GCAGGTACTA 48 0 

CTAGAGATGT CTTAGAAGAG TACGTCAATG TTCGTGGCGT GCCATTAAGA TTAGTTGATA 54 0 

CTGCTGGTAT ACGTGAGACA GAAGATATAG TAGAGAAGAT TGGTGTTGAA CGCTCTQAA 600 

AGGCTCTTAG CCAAGCAGAC TTAATTTTAT TTGTATTAAA CAATAACGAA GCATTGACwC 660 

AAGAAGATTA CACATTATAT GAAGTGGTTA AAAATGAAGA TGTAATCGTA ATTGTTAATA 720 

AAATGGATTT AGAGCAAAAC ATAGATATTA ATGAAGTTAA AGATATGATA GGTGATACGC 78 0 

CATTAATTCA AACTTCAATG TTAAAACAAG AAGGTATTGA TGaATTAGAA AT AC AAAT T C 84 0 

gAGATTTGTT CTTTGGTGGa G AAG T AC AAA ATCAAGATAT GACTTATGTT TCTAATTCAA 900 

GACATATTTC ATTATTAAAA CAAGCAAGAC AAAC GAT AC A AGATGCGATT GATGCAGCAG 9 60 

AATCTGGTGT GCCTATGGaT ATGGTACAAA TTGATTTAS TAGAACTTGG G AAAT AT TAG 1020 

GAGAAATTAT T GG T GAG AC T GCAAgTGATG AACTCATCGA TCAGTTATTC AGTCAATTCT 108 0 

GCTTAGGTAA ATAGTAATTG AAATAGACGG AATACCGTCT TAAGAAGGCT AG T AAGAT AT 114 0 

CAAATAAGGA GGTTTATATT GTGGTTCAAG AATATGATGT AATCGTTATA GGTGCGGGAC 12 00 

ATGCAGGTGT AGAAGCAGGT TTAGCATCTG CAAGACGTGG TGCTAAAACA TTAATGCTAA 12 60 

CAATAAATTT AGATAATATT GCATTTATGC CATGTAACCC ATCTGTAGGT GGACCAGCTA 132 0 

AAGGTATCGT TGTTCGTGAA ATTGATGCTT TAGGTGGACA AATGGCAAAA ACAATCGATA 138 0 

AAACACACAT TCAAATGAGA ATGTTAAATA CAGGTAAAGG ACCTGCTGTA AGAGCACTAA 14 4 0 

GAGCGCAAgc AG a T AAAG T A CTTTATCAAC AAGAAATGAA ACGCGTGATT G AAG AT G AAG 150 0 

AAAATTTGCA TATAATGCAA GGTATGGTAG ACGAACTTAT TATAGAAGAT AATGAAGTTA 15 60 

AAGGTGTACG TACAAATATT GGTACAGAGT ATTTATCTAA AGCAGXATT ATTACAACGG 162 0 

GAACATTTTT ACGTGGTGAA ATCATTTTAG GTAATATGAA GTATTCAAGT GGACCAAATC 168 0 
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ACCAATTACC ATCAATCACA T T AT C AG AC A ATTTAAGAGA ACTTGGTTTT GATATTGTTC 17 4 0 

GTTTTAAAAC AGGTACACCA CCGCGTGTAA ATTCAAAAAC AATTGACTAT TCGAAGACTG 1800 

AAATACAACC AGGTGACGAT GTAGGTCGTG CATTCAGCTT TGAAACAACA GAATATATAT 18 60 

TAGATCAATT GCCATGTTGG CTAACGTATA CTAATGCTGA AACACACAAA GTTATCGATG 1920 

ATAATTTACA TCTATCTGCA ATGTATTCAG GGATGATTAA AGGAACCGGG CCACGTTATT 1980 

GCCCTTCAAT T G AAGAT AAA TTTGTTCGVT TTAATGATAA GCCGCGACAT CAACTTTTCT 204 0 

TAGAGCCTGA AGGTCGTAAT ACAAATGAAG TATATGTGCA AGGATTGTCT ACAAGTCTTC 2100 

CTGAACATGT GCAcGTCAAA TGTTAGAGAC GATACCAGGT CTTGAAAAAG CAGATATGAT 2160 

GCGTGCCGGC TACGCAATTG AATATGATGC GATTGTGCCA ACGCAGTTAT GGCTACACT 2220 

T G AAAC G AAA AT GAT T AAAA ACT TATATAC TGCAGGTCAA ATTAATGGTA CATCTGGTTA 228 0 

TGAAGAAGCA GCAGGACAAG GATTGATGGC AGGTATTAAC GCTGCAGGTA AAGTGTTAAA 2340 

CACAGGCGAA AAGAT ATTAA GTCGTTCAGA TGCATATATT GGTGTCTTAA TCGATGATCT 2 4 00 

TGTAACTAAA GGTACTAATG AACCTTATCG TTTACTAACA TCACGTGCAG AATATCGTTT 24 60 

GTTAcTACGT CATGATAATG CTGATTTGAG ATTGACGGAT ATGGGATATG AACTTGGTAT 2520 

GATTTCTGAA GAAAGATATG CACGTTTTAA TGAAAAACGT CAGCAAATTG ATGCGGAAAT 2580 

TAAGCGTTTA TCAGATATTC G TAT T AAAC C AAAC (AA CAT ACGCAAGCGA TTATTGAACA 2 64 0 

ACATGGTGGT TCTCGCTTAA AAGATGGTAT TTTAGCTATC GAT TT ATT AC G C AG AC C T GA 27 00 

AATGACTTAC GATATAATTT TAGAACTTTT AGAAGAAGAA CAT C AAT T G A ATGCAGATGT 27 60 

TGAAGAACAA G T AG AAAT AC AAACAAAATA TGAAGGTTAT ATCAATAAAT CACTACAACA 2820 

AG T T GAG AAA GTTAAGCGTA T 2841 
(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3025 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

ATCTAATTTC AAACCCGGTG ATAAATTGCC AAGCGTGACG CAATTAAAAG AACGTTATCA 60 

AGTAAGTAAG AGTACTATCA TTAAAGCATT AGGCTTATTG GAAC AAGAT G GTTTGATCTA 120 

TCAAGCACAA GGCAGTGGTA TTTATGTGAG AAATATTGCT GATGCCAATC GTATCAACGT 18 0 

CTTTAAGACT AATGGTTTCT CTAAAAGTTT AG G T GAAC AC CGAATGACAA GTAAGGTACT 24 0 
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TGTTTTTAAG GAGATTGCAA CGCCACCTAA ATCTGTACAA GATGAGCTCC AATTAAATGC 30 0 

AGATGATACC GTCTACTATT TAGAGCGATT AAGATTCGTG GACGATGATG TTTTATGTAT 360 

CGAATATTCT TATTATCATA AAGAAATCGT GAAAT AT T T A AAT GAT GAT A TTGCTAAGGG 42 0 

CTCTATCTTC GACTATTTAG AATCAAACAT GAAACTTCGT ATTGGTTTTT CAGATATTTT 480 

CTTTAATGTA GATCAACTCA CTTCAAGTGA AGCTTCATTA C T AC AAT T G T CTACAGGTGA 54 0 

ACCATGTTTA CGTTACCACC AGACTTTTTA TACAATGACT GGCAAACCCT TTGATTCATC 60 0 

TGACATCGTA TTTCATTATC GTCATGCACA GTTTTATATT CCTAGTAAAA AGTAATAAAT 660 

ACATAAAAAC GTCTATATCC CAGTTATAAA CTGGAGTATA GACGTTTTTT TACGATAATA 720 

ACAATGGCTC AAATTGCTAT TATCTTGCTT AGGTTTTTCG TTTTAGAAGA ATATTGCTAC 78 0 

AAAGACAGGC ACAACTGCTA CAACAACTAC ACCAACTAAC ACTAAAGCTA TACTTGCCAT 84 0 

TGATTCTTCT ACAGGTCCTA ATTCTTTGGC TGGTGCTACA CCTAATGTGT GACCACTTGT 900 

TCCAAGTGCT AATCCTCGGG CAATAGGGTT AGTAATTCGG AAAAGTTTA AGAATTTATT 9 60 

ACCTAGGGCA TAAATAATGA CACCATTTAA AATAACTGCT AATGATGTTA ATTCTTTTAT 102 0 

ACCACCGATA C C AG C T GAT A CTGGTAACGC AATCGCTGTA GTTGCTGCTT GAGGTAACAT 108 0 

TGATAAAATA ACATCATTGG CAAATTGTGC TAACTTCGCA AAAGTTAAAA T AAT T AAT AA 114 0 

CGCTACAACT GTACCGATAC CAATACCTCC GAT G AT AC G A TGCCAATGTT TAACAAGCAC 12 0 0 

TTCACGCTTT TTATATAACG GAAT CGC AAA AC AG AT T G T T GCCGGTTCTA AGAAGAAGTA 12 60 

AATAATGTCT CCACCTATTT TGTAAGTCTT ATACGGAATG CCTGTTAAAT AGAGGAAGGC 132 0 

CACACCAAAT ACCATACTGA CAAATACEGG TGCGAATAAG AAGAAAC GAT TAGTTTTTTC 1380 

AAATAATATG GTCGCTAAGA AAAATGGTAT AACGGATAAC AGTATTCCGA AGTAAGGTGT 14 4 0 

GTTTaGTGCT AAGTGGTTAA TCaTGAGCTT GTGCCTCCTC TATTTTGATC TTTTTTGTGA 1500 

CTTTGTCACC TTTAGATCTC GAAGTAACTT T CAT AAT AAT TTgTGTGACA T&CCAGTAC 15 60 

AAATAAGTAA TAGTATTGTT GAGACGATTA TTAGTCCAAT GAT T AAAAAT GGTGCTTGGC 162 0 

T AAT G AC AC C TAAAGAGTTA ACAACTGAGA TACCGGCTGG ' TACGAAGAGT AAGCCAATGT 168 0 

TATTTGTTAG TGTCGTTCCT ACTTTTTCGA CTTCGCCTAA CTTAACAGCA CCAGTACATA 174 0 

AT AAT AC AAA T AATAAT AC T AAACCGATTA CTGATGCAGG CATAGGAATT GGCATAAATG 1800 

ATTCAATTAT TTTCGATACA AAGAGTACTA AAGCAAT T AC AATGACTTGG TGAAAAAAGT 18 60 

GTGCTGGTTT TGATGCGTCT TTTTGTTGTT TCACGACCAT TGCCTCCTAC GTTTGATTTA 1920 

ACTAAAGTAT AGATGGCTCA CTTCGATTTG CGTGiTTTTT AGTCCGAAAT ACAAAATATC 198 0 

ATAGGTAAAA TGCATAAAAA AAAGGAT T AC TGTTAAAGTA ATCCTATCGA CGCTTTAAAA 2 04 0 

TCTTTCATAA ATGAACGTCC AACTTGCATC TTGACACCAT TTGTCAATAT TACCATATAA 210 0 
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G TAT AG T T AA ACCATTGTTG CACTTCTTTA ATGTGTTTCG TGTTAATAAT ATATGAACGA 2160 

TGAATACGTA TAAAATAAGT GGGATTCAAT CGTTTTTCAT AACGATTCAA TGGCTCTGTT 2220 

GTTTCGTATT TATGATTCGT TGTATGTATG GTTGTAATAC CATTATGTGT GCCAATCCCA 2280 

ATAATATTTT GTTGCTTTAA CATGTGAATT TTATCGTCAA TTTCAACAGG TAAGCTTTGA 234 0 

TCAAAATTCG CCGACATATC ATTCGCAATT GCACTTGCGT TATTATCATC TTTGGCTTTA 24 00 

GTCGCACGCA CTTTATTGAC TGCTTGTTCA ATACGTTTTT GACCAAACGG TTTCAAAATA 24 60 

TAGTCTGTCG CATTTAATTC AAATGCCTGT ACTGCGTATT GGTCATGTGC AGTTGCAAAA 2520 

ATAATCGCAG GTGGCTCTTT CATCTTTTGA ATCTTAGCTC (TAATTCGAT CCCATTTTCA 2 58 0 

TCCATTAAAT T G AC AT C T AA AAATATAATG TCATATTGAT T GAT C AG TAG TGCTTCCAAT 2 64 0 

GTTTCTTTTA CATTTTCTGC CTCATTAATT TCTTCAAAAC CACCAATTTC ATT T AAT AAA 27 00 

TATGTTAATT CATTACGTGC TAATGGCTCA TCATCTATGA TTAATGCTTT CATATTTATT 2 705 

CCTCCTCTTG TCTTTCATAA GGAAGTACAC AC C AAAAAG T GGTACCGCTC GATGTCGATT 2820 

CAAATTGTAA TGCTGCGGAT TTTCCAAATA ATCCTTTTAG GCGTAAGTTT AAATTTTCTA 2880 

AAGCACTACC AGTTCCAGAC TCTGATTCTA CAGATGTnTC TCCCaACAAA TGCATTTTAT 2 94 0 

CTTTAGAAAT ACCCTGACCA TTATCTTGTA CAATAATACG TACATGTGTT GCAGTTTCTT 3000 

TAATCACTGA CACGTCAATA TCGTT 302 5 
(2) INFORMATION FOR SEQ ID NO: 17 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

ACAGAATTTC ACAGCATTTT TAGATGAAAA AATAAGCCAG TCATAGCGTT GATTTAACAA 60 

AT G AAT AT C A AAATTTAGTG GCTTTATATC AATAAAGGGT TTGtEAATAA TTGATACTAA 120 

ATCACTTTGC ATGTCATTTG TTTGTGTCAT AACTACAACT GGCTTCATAT TTAAACGTCA 180 

CTCCATTATT TAATGTTGTT CATTTAAGCG TTTTATAATT TCATAAGCAC CTTGCTCTTT 24 0 

TAATTTGTTA CTCACTGTTT TGCCTAACTC AACCGGATCT GTTCCGTTCA TTGTATATTC 300 

AAATCGTTCT TTACCATCTG GGGTCATAAT TAAACCTGTA AATTCGATTT CGTTTTGATC 3 60 

TGAGATTGTA GCATATCCTG CAATTGGCAC CTGACAACTA CCATCCATTT CTGCTAAAAA 420 

CGTTCGTTCA GCAGTCACAC ATTTTGCAAC CTCATCATTA TGTACTTTGC TTAATAATGT 4 80 
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TAATAGTTCT TCATCGTCAC TACGACATTC TATCCCTAAA GCACCTkGTC CGATTGCAGG 54 0 

TAACAATGTA TCTCTATCAA GATAAGATGT TnCAATATCA TCTGACCAGC CCATTCTTCT 600 

TAAACCAGCT GCAGCTAAAA TAATCGCATC ATAATCTTCA GTTTGTAACT TTTCTAATCG 6 60 

TGTATCTATA TTACCTCTAA TCCATTTAAT CTCTAAATTA GGATACTTAG BAATATTTG 720 

TGCACCACGA CGTAATGAAC TAGTACCAAT AATACTGCCT TCTGGCAATT GGGATAGTGG 780 

TGTATGTGTT T T AG AAAT AT ACGCATCAAA AGGTAATTCT CTATCAGGGA TACAACCTAA 84 0 

TGTTAAACCT TCCGGAATTA CACTTGGTAC GTCTTTAAGC GAGTGTATTG CCATATCGAT 900 

ATTTTTTTCA AAAAGTTCAT GTTGTATTTC TTTAACAAAT AAGCCTTTGC CTCCGACTTT 960 

AGACAATTGT TTATCTACTA TACGATCGCC TTTCGTGaCA ATTTCTTTAA TTTCAATTTC 102 0 

TAGATTTGGC TCGACAGCTT TTAATTTATC AATAAATTGC TGGCTTTGTG TTAAAGCTAA 108 0 

TTTACyTCTT CTGGAGCCAA CGACTrATTT ACGATGTTC AATTCCTCCT AGGAACGGAT 114 0 

TGCTCTAGAT TATTTTCTCA ATTCACAAAA TGTGTTGCAA AAAATAAATT AATCATATTT 1200 

AAGCAAAATA AAATAATGTT ATAGTATATT AAATATCTTG AATTCAACCA TTTGTTGATT 1260 

CTAAGTAAAA TATAACTTCC ATATAATACT GTAATAATTG AAGAGAG TAT TACCTTCGS 1320 

TCAATGAATA TACGTTCACC AACTGAAATT ACACCCCACT GTGTACCTAA AATAATACTA 138 0 

AATATGAGAA TTATCCACCC ACTTAACGTT GAGTAAAACA CAATTGATTC AAGTGTAGCA 14 4 0 

ACGCTACCAA TTCTAAAGTA TTTTTGATCA AAACGTTTTT CCTTCAAATT ACGGTATTGC 1500 

ATGATATACA GTAATGCATT GACAAAAGCT AAGGCAAAGA AGACATAACT TAACACAGCT 1560 

AGACCGATAT GGACTAACAG TAACTCGTCT ACAACAGCAA TTTTCTGAAC CTTATTAGTA 162 0 

TAATGTGTCG GTTGAAATGT ATTCATCCCT AAnAGTGTTA ACCCTATTAA ATTCCAAGGA 168 0 

AAAACACAG 168 9 
(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

nTGGnTGGCT TTTCCTATTG GACCAAATGG ACCnTTTACC TGGCCnTTCC CAGGACACCC 60 

CGCTTGTGCC CACATTCCAA TCGGAAAAGG TGTATGTGGT ACAGCCGTTT CAGAACGTCG 12 0 

TACACAAATT GTAGCTGATG TTCATCAATT CGAAGGACAT ATCGCTTGTG ATGCTAATAG 180 

TAAGTCTGAG ATTGTCGTTC CAATTTTCAA AGATGATAAA ATTATCGGCG TCTTAGATAT 24 0 
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CGATGCCCCT ATAACGGATC GATTTGATGA CAATGACAAa GAaCATCTTG AaGCAATTGT 300 

TAAAATTATT GAAAaGCAAC TCGCATAAAA GGACATCAGC ATTTTCAATA AAGTGTTGAC 3 60 

AGTTAGCAGG AAAATGTTAC AATAATCTTT GTGTGAATTA ACGAAAGTAG CAGTTGTATA 4 20 

TTATTGAGCG CTATGTTGTT CCCAATGCGG ACGTGTCACG TAACTGTCGC TATAAGGTGA 4 80 

AGACACATAA AACAATATAT CTTAGTAAGC ATGCAACACT CTTTTTTGTT TATTCATAAC 540 

AACAAAAAAG AATTAAAGGA GGAGTCTTAT TATGGCTCGA TTfiGAGGTT CAAACTGGAA 600 

AAAATCTCGT CGTTTAGGTA TCTCTTTAAG CGGTACTGGT AAAG AAT TAG AAAAACGTCC 6 60 

TTACGCACCA GGACAACATG GTCCAAACCA ACGTAAAAAA T T AT C AG AAT ATGGTTTACA 720 

ATTACGTGAA AAACAAAAAT TACGTTACTT AT AT GG AAT G AC T GAAAG AC AATTCCGTAA 7 80 

CACATTTGAC ATCGCTGGTA AAAAATTCGG TGTACACGGT GAAAACTTCA TGATCTTATT 84 0 

AGCAAGTCGT T TAG AC GC T G TTGTTTATTC ATTAGGTTTA GCTCGTACTC GTCGTCAAGC 900 

ACGTCAATTA GTTAACCACG GTCATATCTT AG TAG AT GG T AAACGTGTTG ATATTCCATC 9 60 

TTATTCTGTT AAACCTGGTC AAACAVTTTC AGTTCGTGAA AAATCTCAAA AATTAAACAT 1020 

CATCGTTGAA TCAGTTGAAA TCAACAATTT CGTACCTGAG TACTTAAACT TTGATGCTGA 1080 

CAGCTTAACT GGTACTTTCG TACGTTTACC AGAACGTAGC GAATTACCTG CTGAAATTAA 114 0 

CGAACAATTA ATCCGTTGAG TACTACTCAA GATAATACGG T C AAT AC C AAC AC CC AC AAT 12 0 0 

TGTGGGTGT 1209 
(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 698 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

AAATCCCTTt GTtaAAgTsC AAAtTTTTCc AACrgCTTTA AtArGACCCA TATTACCtTC 60 

TTGGATTAAA tCmAGGaATG AcATACCACG ACCaCGTATC TTTTAGCAAT ACTTACAACT 120 

AAACGTAAGT TCGCTTCTGC AAGTCTTGAT TTTGCTACTT CATCACCTTG TTCAATACGT 180 

TTGGCTAATT CGATTTCTTC TTGTGCACTT AATAAGTTAA CACGCCCAAT TTCTTTAAGG 240 

TACATACGAA CTGGGTCATT TATTTTAACA CCTGGAGGGG CACTAAGATC ACTTGGATTC 300 

AGTTTCTCGT CAGTATCTGA ACTATCTTTT T CAT T AAC T A GTGAAATATC ATTATCATTT 3 60 

AATTGATCAA AG AAAT CAT C CAT T T GAT C A GAGTCGATAT CAAAATTCTG AAGTTTTTCA 4 20 
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GCAATTTCTT CATGACTTAA ATGACCCTCT TTTTTACCTT TTTCAATTAA TTGCTTCTTA 



480 



ACATCTTCTA ATGTTAATGT CGGATCAATT GTTTGTTTTT TAATTTTAAC TGTETATCA 



540 



G AC AT G AAAC GGCCTCCCGA TTTTAAATAT GAACATTCGA AATTTATTCA AT AT T GC TAT 



600 



T T T AAAC G AA ATTCTTAATT AATTCCATCC ATATTTTnAA TTTTATTTTA CAAATTGGGA 



660 



ACTAAATCCC CAATATTTAT TTTTCAATAG TGGTGGTT 



698 



(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

ACTTGATGAT GTATACAATG TATTTCAAGA ATATTATCAA AAAAC AT C T A ACATTAAGTT 60 

TTGTAGAATT CACAATTCTA GCTATTATCA CTTCTCAAAA T AAAAAC AT C GTTCTTCTTA 120 

AAGATTTAAT TGAAACAATC CACCATAAAT ACCCTCAAAC TGTTAGAGCT CTCAATAATT 180 

TAAAAAAGCA AGGCTATCTA ATAAAAGAAC (ETCAACTGA AGATGAAAGA AAAATTTTAA 24 0 

TTCATATGGA TGACGCGCAG CAAGACCATG CTGAACAATT ATTAGCTCAA GTGAATCAAT 300 

TATTAGCAGA T AAAGAT CAT TTACATCTTG TTTTTGAATA ATATCTCTAT TACGCAAGTG 3 60 

TGCTGTATTC TAAAGTGCAC TTGTGTTTTC TATTTTTTAA TAAAACCTCA GCACAT&TG 4 20 

AACAACTTTC TATTTTCTAT AT CAC T T AAA ACCATTTCCG AAATTAAACC TCAGCACATT 4 80 

CAAAGCCCCA CTTTATTCTT AAAAAT AT T T TTTAACTCAT ATGTATTAAA CCGCTTTCAT 54 0 

TATAAAAAAT ATCTCTATAT TtTATCTGtT TtTATTAATC GAAATAGCGT GATTTTGCGG 600 

TTTTAAGCCT TTTACTTCCT GAATAAATCT TTCAGCAAAA TATTTATTTT ATAAGTTGTA 660 

AAACTTACCT TTAAATTTAA TTATAAATAT AGATTTTAGT ATTGCAATAC ATAATTCGTT 720 

ATATTATGAT GACTTTACAA ATACATACAG GGGGTATTAA TkTGAAAAAG AAAAACATtT 780 

ATTCAATTCG TAAAC TAGGT GTAGGTATtG CATCTGTAS TTTAGGTACA TTACTTATAT 840 

CTGGTGGCGT AACACCTGCT GCAAAtgctG CGCAACACGA TGAAGCTCAA CAAAATGCTT 900 

TTTATCAAGT CTTAAATATG CCTAACTTAA ATGCTGATCA ACGCAATGGT TTTATCCAAA 960 

GCCTTAAAGA TGATCCAAGC CAAAGTGCTA ACGTTTTAGG TGAAGCTCAA AAACTTAATG 1020 

ACTCTCAAGC TCCAAAAGCT GATGCGCAAC AAAATAACTT CAACAAAGAT CAACAAAGCG 108 0 

CCTTCTATGA AATCTTGAAC ATGCCTAACT TAAACGAAGC GCAACGTAAC GGCTTCATTC 114 0 

AAAGTCTTAA AGACGACCCA AGCCAAAGCA CTAACGTTTT AGGTGAAGCT AAAAAATTAA 1200 
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ACGAATCTCA AGCACCGAAA GCTGATAACA ATTTCAACAA AGAACAACAA AATGCTTTCT 1260 

ATGAAATCTT GAATATGCCT AACTTAAACG AAGAACAACG CAATGGTTTC ATCCAAAGCT 1320 

TAAAAGATGA CCCAAGCCAA AGTGCTAACC TATTGTCAGA AGCTAAAAAG TTAAATGAAT 1380 

CTCAAGCACC GAAAGCGGAT AACAAATTCA ACAAAGAACA ACAAAffiGCT TTCTATGAAA 14 4 0 

TCTTACATTT ACCTAACTTA AACGAAGAAC AACGCAATGG TTTCATCCAA AGCCTAAAAG 1500 

ATGACCCAAG CCAAAGCGCT AACCTTTTAG CAGAAGCTAA AAAGCTAAAT GATGCTCAAG 15 60 

CACCAAAAGC T G AC AAC AAA TTCAACAAAG AACAACAAAA TGCTTTCTAT GAAATTTTAC 1620 

ATTTACCTAA CTTAACTGAA GAACAACGTA ACGGCTTCAT CCAAAGCCTT AAA G AC G AT C 168 0 

CTTCAGTGAG CAAAGAAATT TTAGCAGAAG CTAAAAAGCT AAACGATGCT CAAGCACCAA 17 4 0 

AAGAGGAAGA CAATAACAAG CCTGGCAAAG AAGACAATAA CAAGCCTGGC AAAGAAGACA 18 00 

ACAACAAGCC TGGTAAAGAA GACAACASCA AGCCTGGTAA AGAAGACAAC AACAAGCCTG 18 60 

GCAAAGAAGA CGGCAACAAG CCTGGTAAAG AAGACAACAA AAAACCTGGT AAAGAAGATG 1920 

GCAACAAGCC TGGTAAAGAA GACAACAAAA AACCTGGTAA AGAAGACGGC AACAAGCCTG 198 0 

GCAAAGAAGA TGGCAACAAA CCTGGTAAAG AAGATGGTAA CGGAGTACAT GTGTTAAAC 204 0 

CTGGTGATAC AGTAAATGAC ATTGCAAAAG CAAACGGCAC TACTGCTGAC AAAATTGCTG 2100 

CAGATAACAA ATTAGCTGAT AAAAAC AT G A TCAAACCTGG TCAAGAACTT GTTGTTGATA 2160 

AGAAGCAACC AGCAAACCAT GCAGATGCTA ACAAAGCTCA AGCATTACCA GAAACTGGTG 222 0 

AAGAAAATCC ATTCATCGGT AC AAC T G T AT TTGGTGGATT ATCATTAGCC TTAGGTGCAG 2280 

CGTTATTAGC TGGACGTCGT CGCGAACTAT AAAAACAAAC AATACACAAC GATAGATATC 234 0 

ATTTTATCCA AACCAATTTT AACTTATATA CGTTGATTAA CACATTCTTA TTTGAAATGA 24 00 

T AAGAAT CAT CTAAATGCAC GAGCAACATC TTTTCTTGCT CAGTGCATTT TTTATTTTAC 24 60 

TTACTTTTCT AAACAACTTC TGAAACGCCT C AAC AC T T T C TACTCTGATT AC AT AT AT G A 2520 

CATTTTTAGG CATTAAAAAA T C GAAC TAG A CAAGATGCTC ATTGCATTTC GTACTAGTTC 2580 

GATTCATGAA TAATTAGATT TAAAATGTCA TTTGAATCCA AG T G AC AAC A TTATTTATAT 2 64 0 

TTAGAATATT AACGTTAGTA TAAACGTCCA AACACAAATA AAAGCAACAA ATATAATACT 27 00 

GTATTTTAAC GTCATTTTTA ATAATGCAGA TTCTTCACCA ACTTTTTTAA CAGCTGCAGT 2 7 60 

CGCAATGGCA ATTGATTGTG GTGAAATAAG TTTCGCTGCT ACACCACCTG CAGTGTTAGC 28 20 

TGCCACAAGT AATGAACCGC TTGTTGAAAT TTGTTGTGCC ACTGTCGCTT GAATAGGTGC 28 80 

GAATAAAGTA TTGTTATTTA C AAC T GAAC C AGTCATAAAC ACACCAATCC AACCTAATAC 2 94 0 

TGGAGAGAAT AATGGGAAAA TTGCTCCCGC TTTAGCAATA CCTTGTCCAA TTGCTACAGT 3000 
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CAAACCACCG TATGTCATAA CTTTAGCAAT AGCTAGGATA GdGAAATTG TAAGGATCGG 30 60 
TAACCATAAT TCTTTAATTG CTTCGACCAA TAAAGCACCT GCACTTTTCC ATTTTAACTT 3120 
CGTAATTAAA ATTGTAATAA TTACTGTTAA TAAAATCGCT GTCCCAGTTG CACCAATTAA 318 0 

ATCGAGACGC AACGCAATTC CTTTAGGCGA TAAATCACTC ACAGTATTTG GAATTGGCAA 324 0 

TTTTATTACT AAACTTTCAA GTGCACCTCC AGGTTGGAAT AATTTTTTGA AGAATGGTGC 3300 
ACTCCATACT AATACAAAGG CAGTTAAAAT TACGAACGGA CTCCAAGCAA AGACAATTTC 3360 
TTTAGGCGTT CGTTTTTGAA TTTTATGTTC AGACGCTTCC AATCTGAAAA TGTTTTTCGG 3420 

TTTAAATTTA CGACAAACAA ATGCTAACAC CACCATTGTT GCTAGTGATG GAATAATGTC 34 80 
TGCTAGTTCT GGACCATGGA ATATTGTTAA TAATAATTGT AATCCAGTAT ATGTACCACT 354 0 

CACTGTTAAA ATGACAGGTA AAATTTCTTT AATACCTTTC ATACCATCTA CAATGAATAC 3 600 

TAAAACAAAT GGAATAATAA AGTTTAAAAT TGGAAGTGTT AATGCTGAGTATCTCGCAAC 3660 
ATCTAATGTT GTAACGCCTC CACTTAAGTT AAACGTATCA ATAATACTAA CTGGTAAACC 3720 
AATTGCACCA AAGGCACCCG CCGCACCATT AGCAATTAAA CATAACATCG CTGCTTTTAA 37 80 

TGGTTCAAAT CCAAGTTGAA TTAATAATAC TGCACAAATC GCAATTGGCA CACCAAATCC 38 4 0 

TGCTGCACCT TCTAAAAATG CGTTGAAACA AAATCCAATT AATAATAGTT GGATTCTTTG 3900 
GTCCACTGAA ATACTTGCAA TACTATCTTG AATAATAGAA AATTGTCCTG TTTTAATAGA 3960 
AAC T T TAT AT AACCAAACTG C CAT T AAAAC GATATATCCT ATTGGGAAAA TACCGGCAAC 4 020 

AACGCCTTCT GTAATCGCAC C T GC T GAT AC iCGCGCTGGT AATTCAAATA CAAATAAAGC 4 08 0 
CACAATCAAT GTAACAACCA AAGTTGTCAA TGCTGCATAA ATGCCTTTCA TTTTAAAAAC 4140 
GGTTAAGCAT AATAAAAATA AAATAATAGG TACTGCTGCA ACTAAGGCTG ATAATCCGAC 4 200 

ATTATCGAAT GGATTTACAG TAAGTAGTGT CATAATGACT CCCTCTCTTT ATATAAMTA 4 2 60 
TTTATCATTC TGATTAATCT ACAACCTATT T C AAC T TATA TTTTGCGATG ATCACATATT 4 32 0 

TAAAATGTAA CACTCCTATA TGTGACAGGC AATCGAATTT T T AC AAAAAG TTCACAAAAT 4380 
ATACACAATA TTTAACTATA ATAmATAATA TATCaTntTA ATTATAAATA C TAG AT ATT A 4440 

TTTATAATAA TCTCAGGAAT TCGCTTCAAA ACTGCATCAT GAG AG T T TAT ATTTTTATTG 4 5 00 
AGAATCTCTC ATTTTATGAA TTGTAGGAAG TAAACAAAAT ATGACAAGCG TCAAACCAAT 4 5 60 

GATAATGATA AATATCATAT TAAACCATAG TAAATTGAAT TGATGATGGT GTTGTATTTG 4 620 

CCAAATTTCT AATACTGTGA AGATAGACAT ATAGCTCAA ATCTCTAAAT TTAACGTACT 4 680 
TAAAAATAAT TTAATCGCTC GAATTTCACG TGGTGTATGA TTATAATGTG ACCATTTGGC 4 74 0 

AAATCGTTCA TAG TAT C T AC CTGCAATGAA AAATATAAGC CAAATCACTA TAAATGCGCT 4 800 

ATTAATCAAA AGCAGCACCC ATTTATCAGC AAAATTATCA GCATCCCCTG CTAAATTATA 4 8 60 
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ATGAATAGGC ACTTTGGTTG GTAATTTTGG ATAGGTCACT ACTGTATAGC ACATCATAGC 



4920 



TAAGTAAATA AGTAGACTTA AT AT T GT AAA AGACCTGATT TTAGACATTC TATCGCCTcT 



4980 



TcTTTACATT TTATGTATAA CACTCTGCCT ATTTTACCTT TTAATaCATT ACCCCAAcGA 



5040 



TtAAaCAATA tGTAaTGATA CTATAATTGC GTCAGGAGTA TCCGCTTGTT AAATGTGCAT 



5100 



AGCTTATATT TAGCTGTTTA AC AT G C C AC A TAATGATTCG AATTATT 



5147 



(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

CACTTACTTC CACCATTATC ATAACTTTAA AATGGATATA nTTCATCAAA CATTATCTAA 60 

AGGCGTCGCA CCTACACCAA CACCATCCAA CAATTAACTT SAACTCTGC GATTACTTCT 120 

TCAGCAGCAA CTTTCACnTG CGTAATACAA TCAGGTAGTC CAACCGCTTC AAAAGAT GC A 180 

CCAGTTACTC TAAGTCGTGG ATATGTTTGT TTAATATGTG CTTGAATCTG TCTAATTTGT 24 0 

TGAATATGAC CGACATGGTA CTGTGGCATA CTTTTCGGCA AACGATTGAC AATTGTAAAT 3D 

TCAGGATCAC CTTTAAATGT CATCATTTGA CTTAAATCTC TACGTACAAT CGATACTAAT 3 60 

TCATTATCTG TATGATCATC AACCACAGTA TCACCTGGTT TACCTACATA CGCACGAATC 4 20 

AAAACCTTAC CTTCCGGTGT AGTAAATGGC CATTTTTTCG ATGTCCAAGT ACATGCGGTA 4 80 

ATGTCTGTAT CACTCGTTCT CGCAATTACG AAGCCAGTAC CATCATGGGT ATTTTCAATG 54 0 

TCTTTTTCAT CAAATGCCAA TACAACAGTT GCAACAGTCG TACTATCCAT CGTTTTAAAG 600 

TAATCAAATG CTGGATCTTG TCCGAACCAA TTTAAAAACA CTTGATGTGG TGTCGTTACT 660 

AATACGCCAT CATACACTTC TTCTAGTTGA TCATTGTAAA CAATTTTAffi TTGTTTTTGA 720 

GATGTAATTA TATCATCCAC TGACGTATTG TAGCGTATTG TCACACCTTT ATTTTTAACA 780 

TCTTGTTCTA ATGCTTCAAT AAATGAGCTT AAACCATGCT TAAATTGTTT GAATTGTCCT 840 

TTCGGTGCGC CAGGATATAA TTGTCTTTGT TTCAGACGCT TATTTTTCTC ATCCTTCATA 900 

CCTTTTATCA GACTTCCGAA TGCCTCTTCT TTTTCTTTAA AATTAGGAAA CGTACTCATC 9 60 

AAACTTAATT TATCAATATC GGTACCATAA ATACCACCCA TTAAAGGCTC TATTAAATTC 1020 

TCAAGTACCT CATTACCTAA TCTTGCTCTG AAAAATGCAC CAACAGAAAT GTCACCATCC 1080 

TGCATTTGAG TAGGTTTTTT TAATAAATCA AACCCTGCTC TTAATTTACC AAGTGGCGAT 1140 
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ATTAATTTTG TAG T AAC AAA TGGTTTAATA TCTGTTGGAA TACCCATAAT TGAACCACCT 1200 

GGAATCGGAT ATAATTTATT TTTCGCAAAA ATATATGATT GTCCAGTCGT ATTTGTAACA 12 60 

ATATCTTGTT CTAATCCAAT ATCTTTCGCT AATTCTGTCA TAATCGTTTT TC 1312 
(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

TTTTACAATA AAAATATGAT ATACTACTTG TCGTATATAA GGAACGGAGG ACAATTTATG 60 

CAT AC AT T T T TAATCGTATT ATTAATCATT GATTGTATTG CATTAATAAC TGTTGTACTA 120 

CTCCAAGAAG GTAAAAGCAG TGGACTTTCA GGTGCCATCA GTGGTGGTGC TGAGCAGTTA 180 

TTCGGTAAAC AAAAACAACG TGGCGTCGAT TTATTCTTAA AT AG AT T AAC AATTATTTTA 24 0 

TCAATATTAT TTTTTGTACT TATGATTTGC ATAAGTTATC TTGGTATGTA AGGTCCGGCG 300 

ATGTAAATGT CGGGCTTTTT TATTTATAAT TAAGAATGTA ATAGTTTAAC AATAAGCTAT 3 60 

GTAAAATATA TAGCCTAGTT AAGTATGCAA AGSAGCGTT AGATTTATGC AGATAAAATT 4 20 

ACCAAAACCT TTCTTTTTTG AGGAAGGTAA ACGTGCCGTG T TAT TACT AC ATGGTTTTAC 4 80 

AGGCAATTCG TCTGATGTTC GTCAATTAGG TCGATTTTTA CAAAAGAAAG GTTATACATC 54 0 

ATATGCACCG CAATATGAAG GCCACGCGGC ACCACCAGAT GAAAT AC T G A AATCTAGTC 600 

TTTCGTTTGG T T T AAA GAT G CGTTAGATGG T TAT GAT TAT CTTGTTGAAC AAGGTTATGA 660 

TGAAATTGTT GTTGCTGGTC TATCATTAGG TGGGGATTTT GCTTTAAAAT TAAGCTTAAA 720 

TAGAGATGTA AAGGGTATTG TAACGATGTG TGCTCCTATG GGTGGCAAAA CTGAAGGTGC 780 

CATTTATGAA GGCTTTTTAG AATATGCACG CAATTTTAAA AAGTATGAAG GTAAAGATCA 84 0 

AGAGACTATT GATAATGAAA TGGATCATTT TAAACCAACT GAAACTTTAA AAGAACTAAG 900 

TGAAGCATTA GATACGATTA AAGAGCAAGT TGATGAAGTG TTGGATCCTA TTTTAGTGAT 960 

TCAAGCAGAA AACGACAATA TGATTGATCC ACAATCCGCAAATTATATAT ATGACCATGT 1020 

AGATTCTGAT GACAAAAATA TCAAGTGGTA CAGTGAATCT GGACATGTTA TTACGATTGA 1080 

TAAAGAGAAA GAACAAGTAT TTGAAGATAT TTATCAATTT T TAG AG T CAT TAGACTGGTC 114 0 

AGAATAAAAA GAGATTTTAA CAT TAGAAAG GAGGGGCATA ATGAATTTAA AGCAATCTAT 1D3D 

AGAAGAGATT ATTAATCAAC CTGAATATGA ACCTATGTCA GTGTCAGATT TTCAAGATGC 12 60 

ATTAGGTTTA AGCAGTGCCG ACTCGTTTAG AGATTTAATT AAGGTGCTTG TGGAGTTAGA 1320 
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ACAATCAGGA TTAATCGAAC GTACAAAAAC AGACAGATAC CAAAAAAAGC ATAGTTATAG 138 0 

AGGTCAATCA AAATTGATAA AA3GAACGTT AAGTCAAAAT AAAAAAGGCT TTGCATTCTT 14 4 0 

AAGACCTGAA GATGAGGATA TGGAAGATAT ATTTATTCCC CCGACGAAAA TTAATCGTGC 1500 

CTTGGATGGA GAT AC T G T T A TTGTAGAAAT CCATCAATCA AAAGGTGAAC ATAAAGGTAA 15 60 

AATCGAAGGG GAAGTTAAGT CGATTGAGAA GCATTCTGTA ACTCAAGTG TTGGTACGTA 162 0 

TAGTGAAGCT AGACATTTTG GCTTTGTTAT TCCGGATGAT AAACGT AT TA TGCAAGATAT 1680 

TTTCATTCCT AAAGGTCAAA GTTTAGGCGC AGTCGATGGT CATAAGGTAC TTGTACAAAT 17 4 0 

TACTAAGTAT GCTGATGGTT CAGATAATCC AGAAGGACAT ATTTCTGCTA TTTTAGGACA 1800 

TAAAAATGAT CCTGGCGTAG ATATTTTATC TATTATCTAT CAACATGGCA TAGAAATTGA 18 60 

ATTTCCTGAT GAAGTGTTAC AAGAAGCTGA AGCAGTACCT GATCATATTG AAAAT AC T G A 192 0 

AATTAAAGGC CGTCATGATT TACGTGATGA ATTGACAATC ACAATTGATG GTGCTGATGC 198 0 

TAAAGACTTA GATGACGCAA TTAGTGTTAA AAAGTTAGCG AACGGTAATA CGCAATTAAC 2 04 0 

TGTAAGTATT GCTGATGTCA GCTATTATGT AACAGAAGGT TCTGCATTGG ATAAAGAGGC 2100 

ATATGATAGA GCGACAAGTG TATATCTTGT TGACCGTGTA ATTCCAATGA TTCCACATCG 2160 

ATTAAGTAAT GGTATTTGTT CATTGAATCC TAATGTTGAT CGTTTAACTC TAAGEGTCG 2220 

CATGGAAATC GATGCTAGTG GTCGCGTTGT TAAACATGAA ATTTTTGATA GTGTTATACA 228 0 

TTCTGATTAT CGAATGACGT ATGATGCGGT AAATCAGATT ATTACTGAAA AGGATCCTAA 234 0 

CATTCGCGAA CAATATAATG AAATTACGCC TAT GC TAG AT TTAGCACAAG ATTTATCTAA 24 00 

TCGTTTGATT CAAATGAGAA AACGACGTGG TGAAATCGAT TTTGATATTA GTGAAGCAAA 24 60 

AGTATTAGTT AACGAAGACG GTATACCAAC AGATGTTCAA TTAAGACAAC GTGGCGAGGG 2 52 0 

TGAACGTCTA ATTGAATCAT TTATGTTAAT TGCAAATGAA ACAGTTGCTG AACATTTTAG 258 0 

T AAG T TAG AT GTACCTTTTA TTTACCGAGT GCATGAGAA CCTAAATCAG ATCGCTTAAG 2 64 0 

ACAATTCTTT GATTTTATTA CAAACTTTGG CATCATGATT AAGGGTACTG GCGAAGATAT 2700 

TCATCCAACA AC AC T T C AAA AGGTTCAAGA AG AAG T AG AA GGTCGACCTG AACAAATGGT 27 60 

CATTTCAACA ATGATGTTGC GTTCAATGCA ACAAGCGCAT TATGATGATG TGAACTTGGG 2820 

ACATTTTGGC TTATCAGCTG AAT AT TAT AC GCATTTTACA TCACCAATTA GACGTTATCC 28 8 0 

TGATTTAACA GTTCATCGTT TAATCCGTAA GTATTTAATT GAGAAATCAA TGGATAACAA 2 94 0 

AG AAG T G AAG CGTTGGGAAG ACAAATTGCC TGAGTTAGCT GAACATACTT CTAAACGTGA 3000 

ACGTCGTGCT ATTGAGGCAG AAC G T G AT AC TGATGaATTG AAAAAAGCAG AATATATGAT 3 0 60 

TCAACATATT GG T GAT G AAT TTGAAGGTAT TGTCAGCTCA GTAGCTAACT TCGGTATGTT 312 0 
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CATTGAATTG CCAAATACGA TAGAAGGTAT GGTTCATATT GCGAATATGA CTGATGATTA 3180 

TTACCGTTTT GAAGAGCGTC AAATGGCATT AATTGGTGAG CGTSAGCTA AAGTATTTAG 324 0 

AATTGGTGAC ACAGTTAAGG TTAAAGTGAC GCATGTTGAT GTAGATGAAC GATTAATTGA 3300 

TTTTCAAATT GTAGGTATGC CTTTACCGAA AAATGATCGA TCACAGCGCC CAGCGCGAGG 33 60 

TAAGACAATT CAAGCCAAAA CGCGTGGTAA AT CAT TAG AT AAATCAAAAT C T GAT GAT AA 34 2 0 

GGGTCGTAAG AAAAAAGGTA AGCAACGTAA AGGTAAAAAC CAACGTAATA ATGATAAATC 34 8 0 

AGGTAATAGT AAGCATAAGC CATTTTATAA AGATAAAAGT GTGAAAAAGA AAGCACGTCG 35 4 0 

TAAGAAAAAA TAAGCAGCAA TGAGGTGAGT ATGAATGGCT AAGAAGAAAT CACCAGGTAC 3600 

ATTAGCGGAA AATCGTAAGG CAAGACATGA TTATAATATT GAAGAT AC G A TTGAAGCGGG 3660 

AATTGTATTG CAAGGCACAG AAATAAAATC AATTCGCCGA GGTAGTGCTA ACCTTAAAGA 3720 

TAGTTATGCG CAAGTTAAAA ACGGTGAAAT GTATTTGAAT AATATGCATA TAGCACCATA 37 8 0 

CGAAGAAGGG AATCGTTTTA ATCACGATCC TCTTCGTTCT CGAAAATTAT JKTTGCACAA 3840 

GCGTGAAATC ATTAAATTGG GTGATCAAAC ACGTGAGATT GGTTATTCGA TTGTGCCGTT 3900 

AAAGCTTTAT TTGAAGCATG GACATTGTAA AGTATTACTT GGTGTtGCAC GAGGTAAGAA 3960 

AAAATATGAT AAACGTCAAG CTTTGAAAGA AAAAGCAGTC AAAC GAG AT G TTGCGCGCGA 4 020 

TATGAAAGCC CGTTATTAAG CGATTTAGTT GCTTAATCGG GCTATATTTG ATATAGTTAT 4 080 

ATGTGCTTTT GTAAATTACA AAAGTATGAT TTGTTTGATT TATTATTTCG GGGACGTTCA 414 0 

TGGATTCGAC AGGGGTCCCC CGAGCTCATT AAGCGTGTCG GAGGGTTGTC TTCGTCATCA 4 200 

ACACACACAG TTTATAATAA CTGGCAAATC AAAAATAAT TTCGCAGTAG CTGCCTAATC 4 2 60 

GCACTCTGCA TCGCCTAACA GCATTTCCTA TGTGCTGTTA ACGCGATTCA ACCTTAATAG 4 320 

GATATGCTAA ACACTGCCGT TTGAAGTCTG TTTAGAAGAA ACTTAATCAA ACTAGCATCA 4380 

TGTTGGTTGT TTATCACTTT TCATGATGCG AAACCTATCG ATAAACTACA CACGTAGA& 4 4 40 

GATGTGTATC AGGACCTTTG GACGCGGGTT CAAATCCCGC CGTCTCCATA TTTGTAGCCT 4 500 

ACAGCCTTTG TGGTTGTGGG . CTTTTTTATT TTGTGTTTTT CAGGGGATAA TGCATTGCAG 4 5 60 

AATTTGTTGT GAGTATTGAT ATAGCAGTGT TTGTATAGGT GTTTATTTGA TGGAGGAAAG 4 620 

AGTAATAAGT GATTATGAAT TAGTTTTTGA GATATAAGGG GACAGTGATG TGTGTCAAAT 4 680 

AAGTGTCAAA AAAGTTGGAT TCTGAGTTTT AC AT T C AAC A TTGTTCATGA AGAAACTTCT 4740 

TTATACGCAA AAAATTCTCC ATGTTATATA TGTCAATATA AAAATGTGAA TCGTCTACAC 4 8 00 

TTAATTGGAT AAATGGCTAC TGAAAAAGAA CTTTTCATTT TTGTTACGTC ACTAAGTGGG 4 8 60 

TGTAGTTATA AAGAG AT GAG CCGAGTTTTG ATATTTTCAT TAGAATCAAT ATGCCTATTA 4 92 0 

ACACAATCAG CAATAGTTGA C G AG AC G G AA ATAAAAGAAG TCGTAGTTAA GAAATGCATT 4 980 
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TCACAACATA CCATTGTAGC CATTTTTATT GTTTTGGATG ATAAACTCTT TTTGGAATTT 54D 

TTAGTTTTTA TAATTTGCAA CTACACTACT TCTTTTACTA ATATTAATGT CTAAGTAATC 5100 

GATAAAAAAT TTTCCATTGA ATAAATGAGA AGTTAAAAAC TTTACTTAAC CTTTCycATT 5160 

GCATTTTCCT ATTCACGATT TTAAGAACCC AACATACTAC AAACGAATTT TAAAAGGCGA 5220 

GAGTAAAGCT TACTTGTTTA T TAT AC AT AT TTAAAATCCA AGAGTCAGAA CAGACTACTC 5280 

CTCTTTATAA CTATAAAAAA TAGCTATGAA AAAATCTATC GTCATAGATT CCTTCATAGC 534 0 

TAATCTTAGT ATGTTTATTT TTATTTTAGG ATGCTATTTA TCAACTCAAC ATATAACTCA 54 00 

CTATTTTTAT AACCTTCTAA TATATCATTA ACTTGTCTAA TAGGTATTC TGGTACTTCT 54 60 

CTAATGTTTT CCAATTTTGT TTTAAATTGT TTTTTTGTTA TTTGCTCTTT ATTTGTAGCC 5520 

AATTGGAACA AGTAAGAATC TAGCATATTA ATTTCTTTAT AT GAATACAT ATATCTTAAT 5 58 0 

AACACTAAAT CTCTAGTTTT TAAGTTAGGC GCTAGTTCTT CTTGTAATTG TTCTATTGAT 5 64 0 

TGTyTCATTA ATAACAATCT CATTTCTAAT TCTTCATTAT TCATTTTATC ACACTCTTtT 57 00 

TAT AT TAATG CTTGACCAAC TTGGGAAACC CAAAACCCTA TGCTTCTTGC AGTAGAATCT 57 60 

TTAATACCAG TTCCCATCAA TGCTTGTGAA ACTTGACCTT GTACATTTCC CCATGTAGCC 5820 

TCTTCTTGTT TTAATGCATT ATTCAATGCG GGATTTACAA ATTTATCCCA TCTTTTTTTT 5880 

ATGATTTTCC GGCACGGGGA CTGATTTCTT TAACACCATT AAACACAGAT TTTTTATTTT 5 94 0 

TAATCATAGC TTTATAGTAT CATGTTGGCT AAGC T AT AAA TAAGTCAGTT TCTCTAAAAA 6000 

TTAAATAACT GAATGTAAGA CAATCAACAA wCCAAATTTA TACTTCATCT AAACSCTGT 60 60 

GGTCGTCATC TTTTTGCTTT TCTTTTTCTT TCTCTCGTTC TTGTTCTTTT TTGTACTCTT 6120 

CTTCAAATTC TTTTTCTTTC TTTTCTACTT CTTCTCT 6157 



(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

CATTTGTTAT TCTGAGTAGC CAATTTGGCA AAGATGAACA AACGTCTGAA CAAACGTATC 60 

AAGTTGCAGT CGCATTAGAG TTAATTCATA TGGCAACACT TGTTCATGAT GACGTTATTG 12 0 

ATAAAAGCGA CAAGCGTCGA GGCAAGTTAA CCATATCAAA GAAATGGGAT CAGACAACTG 180 

CTATTTTAAC TGGGAATTTT TTATTGGCAT TAGGACTTGA ACACTTAATG GCCGTTAAAG 24 0 
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ATAATCGTGT ACATCAATTG ATATCTGAAT CTiTCGTTGA TGTTTGTAGA GGGGAACTTT 300 

TCCAATTTCA AGACCAATTT AACAGTCAAC AGACAATTAT TAATTATTTA CGACGTATCA 3 60 

ATCGCAAAAC AGCACTGTTA ATTCAAATAT CAACTGAAGT TGGTGCAATT ACTTCTCAAT 4 20 

CTGATAAAGA GACTGTACGA AAATTGAAAA TGATTGGTCA TTATATAGGT ATGAGCTTC 480 

AAAT CAT T G A T GAT G TAT T A GACTTCACAA GTACCGAAAA GAAATTAGGT AAGCCGGTCG 54 0 

GAAGTGATTT GCTTAATGGT CATATTACGT TACCGATtTT ATTAGAAATG CGTAAAAATC 600 

CAGACTTCAA ATTGAAAATC GAACAGTTAC GTCGTGATAG TGAACGCAAA GAATTTGAAG 660 

AATGTATCCA AATCATTAGA AAAT C T G AC A G CATC GAT GA GGCTAAGGCA GTAAGTTCGA 720 

AGTATTTAAG TAAAGCyTTG AATTTGATTT CyGaGTTACC aGATGGACaT CCGaGAtCAC 780 

TACyTTTAAG TTTGACGAAA AAAATGGGTT CAAnAAACAC GTAGTATTTA TGnAAAAGTA 84 0 

TTGAAAGCGC TTTACCAACC TGTTAATATA TAATAGTAAT AT AC 884 
(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

AATTTCATCT GCTCGTGCAA AATCTTTGTT TTTCCTTGCT TCATTACGCT CTTCGATTAA 60 

TTTTTCAACA TCTTCATCCA ATAATTCATC TGCATTTTTA GATTTTAACG GTACACCTAA 120 

AACATCGCTG AAAATTTGAT AAACTGCTTT AAATTTATCA ATTACTTCTG TTGATGTTGT 18 0 

GTTCTCTAGT ACATATTTAT TCGCAAGTkT TGCTAAATCA TAC CAAGCTG TAATTGCATT 24 0 

AGCTGTATTA AAAT CAT CAT TCATAACTGT TTCAAAACGA TTTAAAATCG CATCAATTTG 300 

AT C AAT AT AT GTCTGTTGAT TTTCAATATT AGTAGCAATT TGTGCGCGCT CTTCAATTAA 360 

TTGATAACTA TTGCGAATAC GCTCTAGTcC aCTACGTGCT GATTCTACCA ATTCTAGATT 4 20 

AT AG T T AAT T GGGCTTCTAT AATGTACGCT AATCATAAAG AATCTTAGTA CATCTGGATC 4 80 

AACTTCTTTA ATAATATCGT GAACTAAAAT AAAGTTGCCT AG T GAT T T AC TCATTTTTTC 54 0 

ATTATCAATA TTAATGAAAC CAT TATGCAT CCAATAATTA GCAATGGCG CAT GAT TAT G 600 

TGCTTCTGAT TGTGCTATTT CATTTTCATG ATGTGGAAAT TGTAAATCTG AACCACCCGC 660 

ATGTATATCA ATTGTAGGTC CTAGCTCATG AAATGCCATT ACAGAACATT CTATATGCCA 720 

TCCTGGTCTA CCTTCACCAA ATGGGCTATC CCAACTAATC TCGCCAGGTt CGCTTTTTTC 7 80 

CACAATGTAA AATCAAGTGC ATCTTCTTTA TGCTCTCCTG CATCTATACG AGCACCCACT 840 
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TTTAAGTCAT CTATGGATTG ATGACTTAAT TTACCATAAC CTTCAAATTT ACGTGTTCTA 900 

AAGTAAACAT CGCCACCACT TTCATATGCA TAACCTTGAT CCACCAAATC TTTAATAAAT 9 60 

TGAATAATGT CATCCATATG GTCCSTTACC CTTGGATTTG AAGTCGCTTT TCTAACATTT 102 0 

AACGCACCAA CATCTTCATG AAAAGCAGCG ATATATTTTT CTGCAATTTC G G G AAC AG AC 10 80 

TGATTTAATT CTTGAGAACG TTTAATTAAT TTATCATCTA CGTCTGTAAA ATTTGATACA 114 0 

TATTCTACAT TATATCCTTG GTATTCAAAG TAACGTCTCA CTACGTCATAATTAATTGCw 1200 

GGTCTTGCGT TACCAATATG AATGTAGTTA TATACAGTAG GACCACATAC ATACATTTTT 12 60 

ACTTTCCCTG GTTCTATAGG CTTGAACACT TCTTTTTGAC GTGTAAGCGT ATTATATAAT 1320 

GTAATCATCT TGAATCTCTC CATTCCTAGT CTTTTCAAGT TGTCGTTCTA AATGCTTAAT 1380 

TTGTTCATAA AT T G GAT C AG GTAGATGGCG ATGATCAAAT GTTTTTCCAA CTCGAACACC 14 4 0 

ATCTTGCTTA ACAATATGTC CTGGTATACC AACAACCGTT GAATAACTTG GAACTGATTG 15 00 

TAAAACAACT GAATTTGCAC CAATATTTAC ATTTGAATTT ATTTTAATAT TTCCTAAAAC 15 60 

TTTCGCACCG GCTGCTATTA AAACATTGTC TCTATATCT GGGTGTCTTT TCCCTCTTTC 1620 

TTTCCCTGTC CCACCAAGTG TCACGCCTTG ATAGATTGTC ACATTATCAC CAATTGTACA 1680 

TGTTTCTCCT AT T AC AAC G C CCATACCATG ATCTATAAAT AGACGCTTTC CAATTTTAGC 17 4 0 

ACCTGGATGG ATTTCTATAC CTGTGAAAAA TCTTGAAATT T GAG AT AT C G CGCGTGCOC 1800 

AACATATTTT TTTTGGTTGT ATAACTTATG TGCAATCAAA TGACTCCAAA CTGCATGTAA 18 60 

ACCTGCATAC GTTGTAATGA CTTCTAATGT TGAACGTGCC GCTGGATCCT GCTCAAATAC 1920 

CATTTTTATA TCGTCTCTCA TTCTTTTTAA CAAGATCATT TCCTCCTCAA TGATTGAACT 1980 

ACGTAAATAC ATAATTGAAG TACCTGCGAA ATTAAATATC AAAAAAGCAC CACTAACATA 2 04 0 

CAAATTGTAT TGTTAGAGGC GCTTCCGCAC GGTTCCACTC TGAATTTAGC GAATAACATT 2100 

AATAATATTG CGGGCGCTTC CAAATTATCA AGGAAACTAA GTCAACTTAA TGCTCATCAC 2160 

TCTCATTATA TATTTAATTC ATTTTACGAA GGTGCATTCATTAATTTCTA CGTTGTACTC 2220 

ACAGCAACCG TACACTCTCT GCATCGTATA AATTTAATTA CTAATCCTTC GTTTTATATA 22 8 0 

AAATACTTTA TTCATATTGT TGTAATTTGT TAATACAACT TTATCATATG TGGTTACGTT 234 0 

ATAAAATTCA AGTATATACT ACCTTGATCT TGTCTATTTC ATTACTTATA TTGTTTTAAA 4330 

CGGTTTAGCA CTTTTTCTTT ACCAAGTACT TCAATTGTAT TTGGTAATTC AGGACCATGC 2 4 60 

ATTTGGCCTG TTACAGCAAC ACGAATAGGC ATAAATAATT GCTTGCCTTT TATTCCTGTT 2520 

TCTTTTTGAA CTTCTTTAAT TGTCTTTTTA ATTTCAGCCG CTTCAAATGG TTCAAGTGCT 2580 

TCTAATTTAC TGAATAAGTG 03 T CAT T AAC TCTGGTACTT GCTCTCCATT AATCACTTGT 2 64 0 



648 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



TGTTCTTCTT CACCAAGAGC TGGCATTTCT TTAAAGAACA TTTCTGATAA AGGTACAATT 2700 

TCACCGGCAT AACTCATTTC TTTTTGATAA AGCGCAATTA ATTTGCGTCC CCAAGATAAA 27 60 

TCCTCTTCTG ACGGCACCTC AGGAATCAAA TTTGCTTTAA TTAAATBGG TAATGCTAAT 2820 

TGGAATACTG TTTCAGTATC TTTTTGTTTC ATATATTGGT TATTAACCCA TGCTAATTTT 2880 

TGCTTATCGA AAAATGCTGG TGATTTTGAC AAACGCTTTT CATCAAAGAT TTTGATAAAT 2 94 0 

TCTTCTTTAG AAAAGATTTC TTCTTCACCT TCAGGAGACC AACCTAATAA CGCAATAAAA 3000 

TTAAATAACG CTTCAGGTAA ATAACCTAAG TCACGATATT GCTCAATAAA TTGTAAAATT 3060 

TGCCCATCAC GTTTACTTAA CTTTTTACGT TCTTCATTAA CAATTAATGA CATATGACCA 3120 

AAACGAGGTG GCTCCCAGCC AAATGCTTCA TAAATCATAA TTTGTTTAGG CGTGTTTGAA 318 0 

ATATGATCAT CACCACGAAT TACATCTGffi ATTTGCATGT AATGATCATC TATAGCTACT 32 4 0 

GCAAAATTGT ACGTTGGAAT GCCATCTTTT TTTACGATAA CCCAGTCACC AATACCATTT 3300 

GAATCAAATG AAATATTTCC TTTTACCATA TCATCAAATG AATACGTTTG GTTTTGAGGT 3360 

ACTCGGAAAC GAATTGATGG TTGGCGTCCT TCTGCTTCAA ATTGTTGACG TTGTCTTCA 3420 

GTCAAATGCG CATGTTGACC ACCATAGCGA GGCATTTCAC CACGAGCGAT TTGCGCTTCA 34 8 0 

CGTTCAGCTT CTAATTCTTC TTCTGTCATA TAGCATTTAT ATGCTTTATC TTCTGCTAGT 354 0 

AACTGATCTA TTAATGGTTG GTAGATATGT TGACGTTCAG ATTGACGATA TGGTCCGTAG 3 60 0 

CCATTGTCTT TATCTACAGA CTCATCCCAA TCTAATCCTA ACCATTTAAG ATTATCAAAT 3 660 

TGTGATGTTT CTCCATCTTC TAAATTACGT TTTTTATCAG- TATCTTCAAT TCGAATCACA 37 2 0 

AAATCTCCGT TGTAATGTTT AGCATACAAG TAATTGAATA ATGCTGTTCT TGCATTACCA 37 8 0 

ATATGAAGAT ACCCAGTTGG ACTTGGTGCA TATCTACTC TTATACGATC GCTCATTTTT 38 4 0 

TTCACTCCTA AATTAAATAT CAGATTTTCA AGTTAGTTCA TATAAATTGT TCATTTGCTA 3900 

TCTTCGACCG TCATAACAAA TGTCTAACTC GTCTTATTGT TAAAACGAAA CAATGCTTTT 3960 

TAACATGACC TTAAAATAAT TTCATTGTTT AATCATAACA TAATTCCCTG GGTAATATGC 4 020 

TTAAATTTTA AATAGAAAGC TGTTGTTTTT TCAACACTTT AAAAAAGCTA TCCCTAAGAA 4 08 0 

TCaAGACTGC ACTACATAAA CATTTCAACG AC T T AAAAAT GAATATTATG TACTGTGAAA 414 0 

TTAAACTTCA AATTAACTAT TCAAATACGT TAAAATTGAT TCTAATTTTG TATGTCTTGA 4200 

TTGCTATAAG AATAACTTTA TTAATATCTA AAATTTAACA CTTAATGAAC TTGTTTCAAT 4 2 60 

GATATATTAG CACTATTTGT ATTTTTTGAT AACTAATATG TTTTGCATTT ATTTATAGTT 4 32 0 

ATACTTCAAA TTACAAACTt CGCCATTTCA TATACCTTTT AATATCTATT TTGTTTTCGT 4 38 0 

CAACTACAGT TTTTATAATG ATACTGTATC TTCGATTTTT TTBCAAAAA CAATTCTTCC 4 4 40 

TGAAGATGTT TGCAATAAGC TGACTACTTC TAAATTGACA TGACTGCCAA TAAGATTTTT 4 50 0 
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AG CAT TAT C A ACAACTACCA TCGTACCATC ATCTAGATAT CCTACTGCCT GACCAGGCtC 4 5 60 

CTTACCCATT TTTGTCAGTA AAATATGCAG TTGATCACCT TGATGTACAT TAGGTTTGAT 4 62 0 

TGCTTCTGAT AAATCATTAA CATTTAATGC TTTGATACCA TGTACATGAC AAACTTTATT 4 680 

TAGGTTGAAA TCTGTCGTTA TAATACTTGC ATGATATTGT TTTGCTAATT TTAATAACAT 4 74 0 

CGTATCAATA TCACTATGTG TTTTAGTTGG ATGTATAACC TTTGTAGGAT AGTCTAAATC 4 8 00 

ATACAATTCA TTTAAAATAT CTAAGDCTCT TTTACCCTTT TCaCGTTTAA CACTGTCATT 4 8 60 

TGAATCTGCA ACAATTTGTA ATTCATTAAT AACACCTTGT GGAATTAAAA TATTGCCATC 4 920 

GATAAAACCG CAACGAATGA CTTCTAAAAT ACGACCATCA ATAATTGCGC TTGTGTCGAT 4 980 

AATTTTTGGC GTAgcaCTTT TaGTATGTTG TGACATGGAA CGCGCTATATTCTCAGGTAA 5040 

AAACATTAAC ATTTCATCTC GTTTTTTAAG GCCAAATTGG AAACCGAAAT AAC AT AG T AA 5100 

TATCGTAATT AT G AC AG G AA TGAAATGATT AAAAATAGAG TTGCCAATTG ATTCTAATAT 5160 

AAACGACACC ATAACAGAAA TAAGTAATCC GAT TAT T AAA CCTATTGTTG CGAATAGTAT 522 0 

TTCAACAGCA CTTCTACGCA TAATAAAATG TTCTAAACCT TTTATAGCGT TAGTAACTCG 52 8 0 

TCTAATAAAT ACACCAAAAA TTAAGAACAT AAAAATACTA CCGATAATGC CATCTACATA 534 0 

GTGATTTTTT AAAAAGCTGG AGTTTTGTAA TCCAAGATCA TTTGCAATTT CAGGAATAAT 5 4 00 

AATTATTCCT AATGCGCTCC CAATAATTAA GHAATAATA ATAACCATTA GTTTAACGAT 54 60 

ATTCACACAA TGTCCTCCTT TCTTGATGTT TTATGAATGA AGAGCAAATG ACAATACTTC 5 52 0 

ATGTACAGTA GTTACACCTA TTACTTGTAT ACCTTCAGGA TATGTCCATC CGCCTATATT 5580 

ATTTTTAGGA ATAATTACAC GTTTGAAACC TAGTTTTGCA GCCTCTTGCA CGCGTTGIC 564 0 

TATCCGAGAT ACACGACGTA CCTCACCCGT TAAACCAACT TCTCCAATAT AGCAATCTAA 57 00 

TCCGTCGACA GCTTTATCTT TAAAGCTAGA TGCAGTTGCT ACAATTACAC TTAAATCAAC 5 7 60 

TGCTGGCTCC GTTAACTTTA CACCGCCAGC TACTTTGATA TAAGCATCTT GTTGTTGTAA 5820 

TAGATAATTT TCTTTCTTTT CCAAAACAGC CAT C AAC AAA CTTAATCGAT TATGATCAAT 58 80 

CCCTGTTGCC ATTCGTCTCG GAT T GT T AAA AGTCGTTGGA GTTACCAGCG CTTGAACTTC 5 94 0 

TATTAAAAGT GGTCTGGTTC CCTCCATGGT TGCAACAATT GTTGAACCTG GAACATTTGT 6000 

TGAACGTTCT TCTAAAAACA TTTCAGATGG ATTATTTACACCTTTTAATC CACTTTGCTT 60 60 

CATTTCGAAG ATTCcCATTT CATTCGTTGA ACCAAAACGG TTTTTAACAG CTCGCAAAAT 6120 

TCGATATGCG TGGTGTTCAT CGCCTTCAAA ATAAAGCACA GTATCaACCA TGTGTTCTAG 6180 

CAATCTTGGG cCCAGCAATT TGACCTTCTT TCGTTACATG ACCCACTATA AAAGTTGCaA 2S4 0 

TGTTCATTTG TTTAGCAATA TTCATTAAAC TTTGTGTACT TTCACGAACT TGTGAAACAG 6300 
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AACCTGGCGC AGAGCTGATT TCAGGATGAT ATATTGTTTG AATCGAATCC AC T AC T AAT A 63 60 

AATCAGGTTG TTCTTCTTTT ACTGTTTGAT AAATAACTTC AAGATCTGTT TCAGCTAATA 64 20 

CTTGCAATTC ACTTGAATCT TCATCTAATC GCTCTGCACG TAATTTAGTC TGACTAAGCG 64 80 

ATTCTTCTCC AGTAATATAT AGTACTTTTT TCTTTTGAGA TAACGATGCA CAAATTTGTA 65 4 0 

AAAGTAACGT TGACTTACCA ATACCTGGAT CCCCACCAAT AAGTACTAAC GATCCGCTCA 6600 

CAATACCTCC ACCTAATACA CGGTTGAATT CTGCTGAATC TGTTAAECT CTCGGCGTTG 6660 

TTTCATGTTT AATACTATTT AATTTTTGTA CTTTACCTGC TAATTCCTTG GTTTTAACTC 67 2 0 

CATGTTTAGG ATTGGCTGCT TTTTCAACAA TTTCCTCCAT TTGATTCCAA GCGCCACAAT 67 8 0 

TAGGACATTT CCCCATCCAT TTAGGAGATT GATAACCACA AGCCATACAT TCAAAAATCA 68 4 0 

CTTTTTTCTT GGCCArAATT GCAcCTCCAC TTTCTT 687 6 
(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

CAACTCAAAC AGCAGAACAA CGTCGTGAGT TGATTAATGG TGTATTTACT GACATTAATC 60 

CCATACATTA AAAATATGAT GTACGTGTTA GCAGATAATA GACATATCTC ATTAATAGCT 120 

GACGTATTCA AGGCGTTCCA AAGCT TAT AT AAC G G AC AC T ACAATCAAGA TTTTGCAACA 180 

AT T G AG T C AA CATATGAATT GAGTCAAGAA GAGTTAGATA AGATTGTCAA ACTAGTAACT 24 0 

CAACAAACGA AGTTATCTAA AGTTATTGTA GATACAAAAA TTAATCCAGA TTTAATTGGT 30 0 

GGATTTAGAG TTAAAGTCGG CACAACTGTA TTAGATGGTA GTGTTAGAAATGATCTTGTC 3 60 

C AAT T AC AAA G AAAAT T TAG AAGAGTTAAT T AAT TAT AAA GAGGAGTGAC ATAGATGGCC 4 20 

ATAAAAGCTG AAGAAATCAG TGCATTACTT CGCTCACAAA TTGAAAATTA TGAGTCAGAA 4 80 

ATGTCCGTAA CTGATGTAGG TACTGTATTA CAAATTGGTG ATGGTATTGC ATTAATTCAC 54 0 

GGATTAAATG ACGTTATGGC TGGTGAGCTA GTAGAATTCC ATAACGGCGT ACTTGGTTTA 600 

GCCCAAAACC TTGAAGAGTC AAACGTGGGT GTGGTTATTT TAGGACCATA CACAGGTATT 660 

ACTGAAGGTG ACGAAGT T AA ACGTACTGGT CGTATCATGG AAGTACCAGT AGGTGAAGAA 72 0 

CTAATCGGAA GAGTTGTTAA TCCATTAGGA G\ACCTATTG ATGGAcAAGG ACCGATTAAC 780 

ACAACTAAAA CACGTCCaGT AGAGAAAAAA GCTACTGGTG TAATGGATCg TAAATCAGTA 84 0 

GATGAGCCAT TACAAACAGG TAT C a AAGC A ATTGATGCTT TAGTACCAAT TGGTAGAGGT 900 
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CAACGTGAGT TAATCATCGG TGACCGTCAA ACAGGTAAAA CAACAATTGC AATTGASCA 960 

ATTTTGAACC AAAAAG AT C A AGGTACGATT TGTATCTATG TTGCTATTGG T C AAAAAG AT 1020 

TCAACAGTAA GAGCAAATGT TGAAAAGTTA AG AC AAG C AG GCGCTTTAGA C T AC AC TAT T 108 0 

GTTGTAGCAG CATCAGCTTC TGAACCTTCT CCATTATTAT ATATTGCACC ATATTCAGGT 114 0 

GTAACAATGG GTGAAGAATT CATGTTTAAC GGTAAACATG TTTTAATCGT TTA 1193 



(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

T G C T AAG AAG TCAAAATAAA CTAACTATnA AACATCTAGT AC GAT TAT T A AAG T G AC AG A 60 

TnATAAAATT GAATTATTnA GAGAAGGAGA TATAAAGTTT GAAGAAATAA AAG AAAG AC T 12 0 

AGGTACAGGT ATTATTTATG AATAAGTTAA TACTGGGAT T TAT T TAT AC CGAATTTTTT 180 

CACGAGCATA CTTTTATTTA CCGTTTTTAT TAATTTACTT TTTGATTCAA GGTTATTCCA 24 0 

TAATACAATT AGAAATATTA ATGGCGTCTT ATGGCATTGC AGCATTTTTA TTCTCTCTAT 300 

ACAAAGAGAA GTGTTTTAAA ATTTGTAACT TAAAAGATTC TAATAAATTA GTTGTTAGTG 3 60 

AAATATTCAA AATCATCGGT TTATTGTTGT TATTATATCA AAATCAATAT TTAATTTTAG 420 

TAGTGGCACA AATATTATTA GGGTTAAGTT ACTCAATGAT GGCGGGTGTT GATACCGCAA 4 80 

TAATTAAAAG AAATATAACA AATGAGAAAT ACGTACAAAA TAAGTCAAAT AGCTATATGT 54 0 

TCCTATCATT ATTAATTTCA GGGATTATAG GTAGTTATCT TTATGGAATA AAT AT TAAAT 600 

GGCCTATAAT AATGACTGGT ATATTTTCAA TTCTAACAAT T AT AAT TAT T CGATGCACAT 6 60 

TAG T T G AAAA TAGGGAATTA AAT T T AAT AG GAGAAACAAA GGGAAAGATA AAGAAATTTC 720 

TACCAGAAGA GAAGTTTTGG ATATTGCATT ATTCTTTTTT AGAGCGTTA ATATTAGGAT 7 80 

TTTTTATAGG ATTTATTCCA ATTAATATAT ATAATGATTT AAAACTGAAT AAT TT AC AAT 84 0 

TTATTTCAGT ATTAACTTGT T AC AC AG TTA TGGGTTTTGT ATCTTCACGT TATTTAACTA 900 

AAT AC T T G AA TTATAAGTTT GTGTCAGAAA TTTGTTTAGT AATATTTTTA ATAATATATA 905 

CAT ATC AAAG TTTCATAGCA GTTACTATTT C T AT GAT AT T TTTAGGTATT TCTTCAGGGT 102 0 

TAACTCGTCC ACAAACTATA AATAAACTTT CTAGCAGTAG T AAC T T AAG A GTGATGCTTA 1080 

AT TAT GC AG A AACGTTATAT TTTATTTTTA ATATCGCATT TTTACTTATG GGTGGTTACT 114 0 
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TATATACAAT AGGAACTATT CAATACTTAA TAT TAT T TAT TTCGTTATTA ATTTTTATAT 1200 
ATTTAATAAT AATATTTyAT TTTACAAGGA GAGAGCAACA TGAAAATAAA AACTGAATTT 1260 
AAAGGGAACA ATATACCATA TGAATACGCA GCAGGTGCAG ATGTGAGTGA TTCTATTAAC 132 0 

GGGAATCCAA TTAAGTCATT TCCATTTGAA GTAATTGAAT TACCGGAAG g AC T AAAT AT 1380 
CTTGCTTGGT CTTTAATTGA CTATGATGCA ATTCCTGTAT GTGGCTTTGC TTGGATTCAT 14 4 0 

T GG AG T G TAG CTAATGTAAG TGTTAGTGGC AATTCAATTT CTATAAAAGC AG AT T T AT C A 1500 
AGAACAAAGG GCGACTATGT ACAAGGTAAA AATAGCTTTA CTAGTGGGTT GTTGGCTGAA 15 60 

GATTTTTCAG AAATAGAAAA TCACTATGTA GGACCTACAC CACCTGATCA AG AT CAT C AA 1620 
TATGAATTAA CAGTTTATGC GTTAGATCAT TCTTTAAATT TGAAGAATGG GTTCTACTTG 168 0 

AATGAATTTT TAAAAGAAGT AAATCAACAT AAAATTGATC AAACAAGTAT TAACCTTATA 17 4 0 

GGAAGAAAAA TTTAATACTA AATATCTCAT CAATATAAAA TTGTTCAATT AAAAGTACAA 1800 
AGAAACAAAG GTTTTAATTT ATATATTAGG TACGGCGTTC GCTATAATGC AAAGAAGTAA 18 60 

TTAAATTTAA GAAATGTAAA CTTAGTTATT GTAATGTGAA TTTATTTGAA AAAATAGAAA 1920 

GTATTAACAA TTATAGCTTT TACATTAATT AAAATTTATT TTTAAAAACA AGTAAEAAT 1980 
T T AC AT AC T T ATAATTTTTG AAAATTTTCA ATTTGTGTTA TATTGATTTT GTAAGATACT 204 0 

i 

TTAACTCACA AAGGAGAGAG AGTATATGAA AT T AAAAT C A TTTATAACTG TAACTTTGGC 2100 
ACTGGGCATG ATCGCAACGA CTGGCGCTAC TGTGGCAGGT AATGAGGTAT CTGCAGCAGA 2160 
AAAG G AC AAA CTACCGGCAA CTCAAAAAGC TAAAGAAATG CAAAATGTTC CATATACAAT 2220 
TGCAGTAGAT GGCATTATGG CTTTCAATCA ATCTTACTTA AATTTACCAA AAGATAGCCA 2280 
ATTATCATAT T TAG AT T TAG GAAATAAAGT TAAAGCTTTG TTATATGATG AACGCGGTGT 2340 

AACACCTGAG AAGATTCGAA ATGCAAAATC TGCCGTTAC ACGATTACTT GGAAAGATGG 24 00 
TAG T AAAAAA GAAGTGGATC TTAAGAAAGA TAGCTACACA GCAAACTTGT TTGATTCAAA 2 4 60 

TTCAATTAAA CAAATTGATA TTAATGTAAA AAC T AAAT AA ACACTTAAAA T TAT C AAAT T 2520 

CAACATGAAG CATTTAATTT T AC AG T GAT G AT TAT AAAAT AATTGCCTTG ATACAAAGAT 2 580 
TACTCGTAAA TGACATCTTT GTATTAAGGC TTTTTCTAAA TTTAAAAGTG ATGGGTTAGA 2 64 0 

GGTCATTGAG CTTTAAAATA T T C AAAAT AC AAAAC AT T AA TGGCCAAAAA TAAAAGCCGC 2700 
CTTTATCTGG GCAGCTTCAA T AA T AAG AAA GACATATTTC ATTTTATACT AAATAGTTAT 27 60 

TGT GATGAAT CTTTCGGCGG TTTAATTACT GCAGCAAAAA TTGCTGTGAA AATCGTGAAC 2820 
AATACTGCCA TGATAATTGG ATTCACTACA TTTAAGCTGT CTCCACCTAC TAGGCTATTA 2880 
AGTACAAAGT TAACCATTTG CAT T AAT AAT AATGCCCAAA AGAATGT T AC GAGGTGTTTC 2 94 0 

ATGTCATTCT ACCTCCACTT TAATTATATA TATTTTATTT T AAG B AAAG TTAGAAATTT 3000 
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GTATAGTAAC ATCTCATATA TTTTGACCAT ATTATACAGT TTAAATAAAT GATTTTATCT 30 60 

GAATGGCTAT TCTAAATTAA GCGCATTAAA ACCAATTTCA TACTGAAATT TGACGATAAT 312 0 

AAAGCAT TAA AATTTTATTA ACTAGTCAAT ATTCCTACCT CTGACTTGAG TTTAAAAAGT 3180 

AATCTATGTT AAATTAATAC CTGGTATTAA AAATTTTATT AAGAAGGTGT TCAACTATGA 32 4 0 

ACGTGGGTAT TAAAGGTTTT GGTGCATATG CGC CAGAAAA GAT TAT T G AC AATGCCTATT 3300 

TTGAGCAATT TTTAGATACA TCTGATGAAT GGATTTCTAA GATGACTGGA AT T AAAG AAA 33 60 

GACATTGGGC AGATGATGAT CAAGATA^TT CAGATTTAGC ATATGAAGCA AGTTTAAAAG 34 20 

CAATCGCTGA CGCTGGTATT CAGCCCGAAG ATATAGATAT GATAATTGTT GCCACAGCAa 34 8 0 

CTGGaGATAT GCCATTTCCA ACTGTCGCAA ATATGTTGCA AGAACGTTTA GGGACGGGCA 35 4 0 

AAGTTGCCTC TATGGATCAA CTTGCAGCAT GTTCTGGATT TATGTATTCA AdATTACAG 3600 

CTAAACAATA TGTTCAATCT GGAGATTATC AT AAC AT T T T AGTTGTCGGT GCAGATAAAT 3 6 60 

TATCTAAAAT AACAGATTTA ACTGACCGTT CTACTGCAGT TCTATTTGGA GATGGTGCAG 3720 

GTGCGGTTAT CATCGGTGAA GTTTCAGATG GCAGAGGTAT TATAAGTTAT GAAATGGGTT 37 80 

CTGATGGCAC AGGTGGTAAA CATTTATATT TAGATAAAGA TACTGGTAAA CTGAAAATGA 38 4 0 

ATGGTCGAGA AG TAT T T AAA TTTGCTGTTA GAATTATGGG TGATGCATCA AC AC G T G TAG 3 9 00 

TTGAAAAAGC GAATTTAACA TCAGATGATA TAGATTTATT TATTCCTCAT CAAGCTAATA 3 9 60 

TTAGAATTAT GGAATCAGCT AGAGAACGCT TAGffATTTC AAAAGACAAA ATGAGTGTTT 4020 

CTGTAAATAA ATATGGAAAT ACTTCAGCTG CGTCAATACC TTTAAGTATC GATCAAGAAT 4 0 80 

TAAAAAATGG TAAAATCAAA GATGATGATA CAATTGTTCT TGTCGGATTC GGTGGCGGCC 414 0 

TAACTTGGGG CGCAATGACA ATAAAATGGG GAAAATAGGA GGATAACGAA TGAGTCAAAA 4 2 00 

TAAAAGAGTA GTTATTACAG GTATGGGAGC CCTTTCTCCA ATCGGTAATG ATGTCAAAAC 4 2 60 

AACATGGGAG AATGCTCTAA AAGGCGTAAA TGGTATCGAT AAAAT T AC AC G TAT C GAT AC 4 32 0 

T G AAC C T TAT AGCGTTCACT TAGCAGGAGA ACTTAAAAAC TTTAATATTG AAG AT CAT AT 4 380 

CGACAAAAAA GAAGCG3GTC GTATGGATAG ATTTACTCAA TATGCAATTG TAGCAGCTAG 4 440 

AGAGGCTGTT AAAGATGCGC AAT TAG AT AT C AA T G AAAAT ACTGCAGATC GAATCGGTGT 4 500 

ATGGATTGGT TCTGGTATCG GTGGTATGGA AAC AT T T G AA ATTGCACATA AAC AAT T AAT 4 5 60 

GGATAAAGGC CCAAGACGTG TGAGTCCATT TTTCGTACCA ffiGTTAATTC CTGATATGGC 4 620 

AACTGGGCAA GTATCAATTG ACTTAGGTGC AAAAG G AC C A AATGGTGCAA CAGTTACAGC 4 680 

ATGTGCAACA GGTACAAATT CAAT CGGAGA AGCATTTAAA ATTGTGCAAC GCGGTGATGC 4740 

AGATGCAATG ATTACTGGTG G T AC AG AAG C ACCAATTACT CATATGGCAA TTGCTGGTTT 4 8QD 
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CAGTGCAAGT CGAGCGCTTT CTACAAATGA TGACATTGAA ACAGCATGTC GTCCATTCCA 4 8 60 

AGAAGGTAGA GATGGTTTTG TTATGGGTGA AGGTGCTGGT ATTTTAGTAA TTGAATCTTT 4 92 0 

AGAATCAGCA CAAGCTCGAG GTGCCAATAT TTATGCTGAG ATAGTTGGCT ATGGTACTAC 4 980 

AGGTGATGCT TATCATATTA CAGCGCCAGC TCCAGAAGGT GAAGGTGGTT CTAGAGCAAT 50 4 0 

GCAAGCAGCT ATGGATGATG CTGGTATTGA ACCTAAAGAT GTACAATACT TAAATGCCCA 5100 

TGGTACAAGT ACTCCTGTTG GTGACTTAAA TGAAGTTAAA GCTATTAAAA ATACATTTGG 5160 

TGAAGCAGCT AAACACTTAA AAGTTAGCTC AACAAAATCA ATGACTGGd ACTTACTTGG 522 0 

TGCAACAGGT GGAATTGAAG CAATCTTCTC AGCGCTTTCA ATTAAAGACT CTAAAGTCGC 5280 

ACCGACAATT CATGCGGTAA CACCAGATCC AGAATGTGAT TTGGATATTG TTCCAAATGA 534 0 

AGCGCAAGAC CTTGATATTA CTTATGCAAT GAGTAATAGC TTAGGATTCG G T G G AC AT AA 5 4 00 

CGCAGTATTA GTATTCAAGA AATTTGAAGC ATAACTATAA nAATCTTCAG TAACGTTGTT 54 60 

TTAGTTACTG AAGATTTTTT CaGTTTCTTT ATACTAAGAT GAGCGACAcA CAATCGTCAT 5520 

AATAAAATAT GAATATTTAT TAATAATAA 554 9 
<2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4832 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

AGATTATAGT AAGAT TGATA GTTTGGTGAC TGaAGCgCGa GaAAAATTAT CAGaAGTAAA 60 

mCCTTTAAAT ATTGCACAAG CTTCTAGAAT ATCAGGGGTA AATCCAGCAG ACATATCTAT 120 

AT TAT T GAT T TACTTAGAAC AAGGTAAACT CCAAAGGGTG AGTGACTAAT GACTGTAGAA 180 

TGGTTAGCAG AAC AAT T AAA AGAACATAAT ATTCAATTAA C T GAG AC T C A AAACAACAG 24 0 

TTTCAAACAT ATTATCGTTT ACTTGTTGAA TGGAATGAAA AGATGAATTT GACAAGTATT 300 

ACAGATGAAC ACGATGTATA TTTGAAACAT TTTTATGATT CCATTGCACC TAGTTTTTAT 360 

TTTGATTTTA ATCAGCCTAT AAGTATATGT GATGTAGGCG CTGGAGCTGG TTTTCCAAGT 4 20 

ATTCCGTTAA AAATAATGTT TCCGCAGTTA AAAGTGACGA TTGTTGATTC ATTAAATAAG 4 80 

CGTATTCAAT TTTTAAACCA TTTAGCGTCA GAATTACAAT TACAGGATGT CAGCTTTATA 54 0 

CACGATAGAG CAGAAACATT TGGTAAGGGT GTCTACAGGG AGTCTTATGA TGTTGTTACT 600 

GCAAGAGCAg TAGCTAGATT ATCCGTGTTA AGEAATTGT GTTTACCGCT AGTTAAAAAA 6 60 

GGTGGACAGT TTGTTGCATT AAAATCTTCA AAAGGTGAAG AAG AAT TAG A AGAAGCAAAA 720 
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TTTGCAATTA GTGTGTTAGG TGGTAATGTT ACAGAAACAC ATACCTTTGA ATTGCCAGAA 7 80 

GATGCTGGAG AGCGCCAGAT GTTCATTATT GATAAAAAAA GACAGACGCC GAAAAAGTA 84 0 

CCAAGAAAAC CAGGGACGCC TAATAAGACT CCTTTACTTG AAAAATAATG CATAATCCTT 900 

TACAACTAAC ATAAAAGGAG CGAATGGATA ATGAAAAAAC CTTTTTCAAA ATTATTTGGT 9 60 

TTGAAAAACA AAG AT GAC AT CATTGGACAT ATTGAAGAAG ATCGCAATAG TAATGTTGAA 1020 

TCCATTCAAA TTGAACGTAT CGTTCCCAAC CGTTATCAAC CAAGACAGGT GTTTGAACCA 1080 

AATAAAATTA AAGAACTTGC TGAATCAATA CATGAACATG GTTTACTACA ACCTATTGTT 114 0 

GTAAGACCGA T T G AAG AAG A TATGTTTGAA ATTATTGCTG GAGAGCGCCG ATTTAGAGCA 1200 

ATACAATCAC TAAATTTACC TCAAGCAGAC GTTATTATTCGTGATATGGA T GAT G AA GAG 12 60 

ACGGCTGTTG TTGCATTAAT TGAGAATATT CAAAGAGAAA ATTTGTCTGT TGTTGAAGAA 1320 

GCGGAAGCCT ATAAGAAATT ATTGGAAATT GGTGATACAA CGCAAAGTGA ATTGGCAAAA 1380 

AGTTTAGGTA AAAGTCAAAG CTTTATTGCA AATAAGTTGC GTTTATTGAA GTTGGCGCCG 1440 

AAAGTACTAC TTCGCTTAAG AGAAGGTAAA ATTACTGAAC GTCATGCGAG AgcGGtATTA 1500 

TCATTGTCTG ATAGCGAACA AGAAGCGTTG ATTGAGCAAG TCATTGCACA AAAGCTAAAT 15 60 

G T G AAc AG Ac T G AAG AT AG A GTACGCCAAA AAACGGGGCC CGAAAAAGTC AAAG C AC AAA 162 0 

ACCTTCGCTT TGCACAAGAT GTCACTCAAG CACGAGATGA GGTAGGCAAA AGTATCCAAG 168 0 

CGATTCAACA AACAGGATTA CATGTTGAGC ATAAAGACAA AG AT CAT G AA GATTATTATG 174 0 

AAATAAAAAT T C GAAT AT AT AAACGTTaGT AGTAGGATGT CGTATACATG AT GAC T AAC A 1800 

CATAAAAGAC AAAGCTAAGA TCATAACAGC TTTGTCTTTT TTTTTTGTT TACGTGAAAC 18 60 

ATAAAAATTT ATATTTATAT GTTGATCAGG CTGGTACATA AATCAATGTT CTATGCTCTA 192 0 

CGAAGTTATA TTGGCAGTAG TTGACTGAAC GAAAATGCGC TTGTAACAAG CTTTTTTCAA 198 0 

TTCTAGTCAA CCTTGCTGGG GTGGGACGAC GAAATAAATT TTGCGAAAAT ATCATTTCTG 204 0 

TCCCACTCCC TAATTTGAGC TGGATATACT TTCATTTGAA CCCTTTATTG CTAGTTTATG 2100 

AAAGTATCAT GAAAGCTTTA TGAACATCGC TTGAGTTGCC TTTACAGTAG AAAATTTAAG 2160 

TTTTACACTT TGTGTGAATG ATACGTTTTG TATTGAATTA ATTATAGAAA GGTACGTTGA 222 0 

AGATGTTTTC AATTGGAAGT GCAATTCTTC ATTTTGTCAT TGGTGGTATC GCTGTTGCAT 2280 

TAGCTTCAAT TATTGCTGAT AAGGTAGGTG GTAAGTTAGG AGGTATTATA GCTACTATGC 2 34 0 

CGGCAGTCTT TCTTGCGGCT ATTATCGCAT TAGCTTTAGA TCATCGTGGT ACGCAAT TAG 24 00 

TGGAGATGTC GAT GAAT C T T AGTACTGGAG CAATTGTCGG TATTCTGTCT TGTASTTAA 2 4 60 

CTGTATTTTT GACATCTCTC T AC AT T AAGC ATAAAGGTTA TCGGAAAGGC GCAATATTCA 252 0 
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CAGTTGTTTG TTGGTTTGTC ATTTCCCTCG CAATATTCAG TAT TAG AC AT TTATAGTTTG 2580 

GAAAATGCGT GATAATTAGT TGTATTCAGT TATTAAGTAA TAAATTATTG GAGGCAGAAC 2 64 0 

AT CAT GAAAT TAACATTAAT GAAATTTTTT GTGGGGGGAT TTGCAGTATT ATTAAGTTAT 27 00 

ATTGTATCTG TAACACTACC TTGGAAAGAA TTTGGCGGTA TATTTGCaAC GTTTCCGGCA 27 60 

GTATTTTTAG TGTCTATGTT TATTACAGGT ATGCAATATG GTGATAAAGT CGCTGTGCAT 2820 

GTAAGTCGTG GCGCAGTGTT TGGTATGACA GGGGTATAG TTTGTATTTT AGTTACATGG 288 0 

ATGATGTTAC ATATGACGCA CATGTGGTTG ATTAGCATTG TTGTTGGTTT CCTAAGCTGG 2 94 0 

TTCATCAGTG CAGTATGTAT TTTTGAAGCG GTAGAATTTA TAGCACAAAA AAGATTAGAA 3000 

AAGCATAGTT GGAAAGCTGG AAAATCGAAT AGTAAATAGT GTGAACGTAA TCTCTTAACT 30 60 

AGGACTAACT TTGCAAGCAT TGAATAGCAT GGAAAAGTTG CAT CAT T AAT AAGTGAAATT 3120 

CAAGTTGGCA TTGAGAAAAT TACAAGCGCG TAATCATACa GGTCTGTCTT AAGGGAGTCT 3180 

TCGAACCCCG ATGTTGTCGT ATGTCAAAAC ATTTAGTCAA TCATAAAGGT GACTTGATTT 324 0 

AACTTTATCT GATAGTCTG^ TTGTAATGAT TGTACTAATT GACTGGAGGC GTATGTAATT 3300 

GAATCTGAGT AAACAAATTA AAAAGTATAG G G AAC GAG AT GGTTATTCAC AAGAATATCT 33 60 

TGCTGAAAAG TTATATGTAT CTAGGCAGAG TATTTCTAAT TGGGAAAATG ACAAAAGCTT 34 2 0 

ACCAGACATA CATAACTTAT TAATGAyGTG TGAATTGTTC AATCEAACTT TAGATGATTT 34 8 0 

AGTAAAAGGG ACCATTCCAT TTGTACCTGA TATTAAAGCG CAACGAAGTC TTAACTTATG 354 0 

GACATATGTG ATGCTTATTT TCATGACATT AGCTGCAATT TTAATGGGAC CTTTAGTTGT 3600 

TTATTGGAAT TGGACTTGGG GTGTAACGGT GGCAATCATT TTGGGAATAG GTTTTTATGC 3 6 60 

ATCTATGAAA ATAGAAGATT TAAAAAAAGT GCATAAAATG GACAACTACG ATCGAATTGT 37 2 0 

TGCTTTTATG AATGGAAAAG ATCCTAGTGA AGTACAAACG ACyAAAGCTA GAAAT ACGAT 37 8 0 

GACAAATGCG CTTTCTATTA TATCAGTAAT TGGTATACTC AGCCTCATAA TTTTCCTTAG 384 0 

TGTGTATTTG GCAAATAAGT TTTTATAAAT CATCGTGGTA TCGTCTCATA TTATTTATAT 3900 

TATCCAAAAT AGCATAAAAA AATACCAACA AGATTTAGAA CCTTGTTGGT AATCAAAGCG 3960 

aTTCATTTAT AATGAGTCGT TTTATGTTGT AAGATTAAAC AGTTTGTACG TTAACTGCTT 4 02 0 

GGTCTCCACG TTGACCTTCA GTGATTTCGA AAGTAACTTT TTGACCTTCT (TTAAAGTTT 4 080 

TGTAGCCATC GCTAGCGATA CCTGAGAAAT GTACGAATAC GTCTCCGCCA TTTTCTTGTT 414 0 

CGATGAAACC AAAACCTTTT TCTGCrTTAA ACCATTTwAC TGTACCGTTA TTCATATwGA 4 2 00 

AwACCTCCGT gTGCTTTTGC ACTTAATATT TGTAACAAAT TCATAACTAA AAAAGAGGAT 4 2 60 

ATTCTAAACA AATACACTAC AATTTAATTC ACGAGCTTTT ATTACGTAAG ACCAACTATA 4 320 

CGCTCATATT GGCATAATGT ACAGTGTTTT TTGAAAATAA ATTAAAAAAG ATTTTTAAAA 4 38 0 
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ACCTTAGAAA CGTTGATTTA AAGGGGTTTA TAAAAATwAw AAAATTGTAG TCTTTTATGG 



4440 



TGTTTGCTAG TTTTCAAAGT G AC AT AT C G T TTffiACATGA TGATTTTATA AGCAATCCAT 



4500 



AAAAAACAAG CAGCGATAAA CGCTACTTGT T GAT ATT AAA ATCTGACTTG AAAGGTCATA 



4560 



GCAATGTTCT ATACCGATGG AATGTGCTTA CTTGCCTTTT TCTTCACGAC GTTTTAAATA 



4620 



ATAAGAGCCA CCTAATAAAC CAGCTGGAAT GCCTATCATT GGTGTTGTGA ATGAGCTm 



4680 



TACAATAACA AGTATTGTTA AAGCAATGAC GTTATACCAA GTTACAGTCA AATTTTTCAA 



4740 



ATCCTCATAT GATTGTTTTA CTAATTCTCT AAATTTCATG ATTCAATCTC TCCTTTTTTA 



4800 



TAAATCTTTA GATTGTCAAA TTAAGCTGGA CA 



4832 



(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

CAAAGCTGTT CAAAAGGCTT ATAATTTAAA T T T AG AT AAC ATACGTACAA TGGAACCTAA 60 

GTTGAGATAT CAAGCGAT CA ATAAAGGTAA TATTAATTTA ATAGATGCAT ATT C AAC T G A 12 0 

CGCTGAATTA AAACAATATG ATATGGTTGT GTTAAAAGAT GATAAGCACG TATTTCCACC 180 

ATATCAAGGA GCACCATTAT TTAAAGAAAG CTTTTAAAG AAACATCCAG AAATTAAGAA 24 0 

ACCGTTAAAC AAACTAGAAA ACAAAATATC TGATGAAGAT ATGCAAATGA TGAACTATAA 300 

AG T AAC AG T T AAAAATGAAG ACCCATATAC AGTTGCGAAA GATTATTTAA AAGCAAAAGG 3 60 

GTTAATCAAA T AAC G AC C AA CGCCACATAA GATGCGTAAC ACCAAATTAT ATCTTATGTG 4 20 

GCGTTGTTAT ATTTAAATCT ATAATTATGT TCAATTTAAA CATGCAATAA TGATTAAAAA 4 80 

ATATGACATG TTAAACACAA TGTAAGCTAT TATGATGTGA AAATAGTAGC ATTGCATTTT 54 0 

AGAAACATAG AGCGATATAA TGAATATAAG TTTTTTGAAA TTTCAGTTAA TTCTAAGGAG 600 

GTTGTTTTTA TTATGAAPGA ACAACTTAAT CAACTATCAG CATATCAGCC TGGTTTATCT 6 60 

CCAAGGgCAT TGAAAGAAAA GTATGGCATT GAAGGAGATT TATATAAACT TGCATCAAAT 720 

GAAAATTTGT ATGGACCATC GCCTAAAGTT AAAGAAGCGA TATCAGCACA CTTAGATGAG 7 80 

T TAT ATT ATT ATCCTGAAAC AGGATCACCG AC AT T AAAAG CGGGATTAG TAAACATTTA 840 

AATGTAGATC AATCACGCAT TTTATTTGGT GCGGGATTAG ATGAAGTTAT ATTAATGATT 900 

TCTAGAGCTG TATTAACGCC AGGGGATACT AT T G T T AC AA GTGAAGCGAC ATTCGGTCAA 960 
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TAT TAT C AC A ATGCGATTGT TGAATCAGCT AATGTGATAC AAGTACCTTT AAAAGATGGT 102 0 

GGCTTCGATT TAGAAGGTAT TTTAAAAGAA GTTAATGAAG ATACGTCATT GGTATGGTTA 108 0 

TGTAATCCAA ATAATCCTAC AGGTACATAT TTTAATCATG AGAGCTTAGA TTCGTTTTTA 114 0 

TCTCAAG TAC CTCCACATGT ACCAGTAATT ATAGATGAAG CTTATTTTGA ATTTGTGACA 1200 

GCAGAGGACT ACCCGGATAC ACTTGCTTTG CAACAAAAAT ATGACAATGC TTTCTTATTA 12 60 

CGTACATTTT CAAAGGCGTA TGGATTAGCG GGTTTACGTG TAGGATATGT GGTAGCAAGT 1320 

GAACATGCGA TTGAAAAATG GAACATCATT AGACCACCAT TTAATGTGAC ACGTATATCT 138 0 

GAATACGCAG CAGTTGCAGC ACTTGAAGAT CAACAATATT TAAAAGAGGTAACACATAAA 14 4 0 

AATAGTGTTG AACGCGAAAG ATTTTATCAA TTACCTCAAA GTGAGTATTT CTTGCCAAGT 1500 

CAAACGAATT TTATATTTGT AAAAACmAAG CGGGTAAATG AACTTTATGA AGCACTTTTA 15 60 

AATGTAGGGT GTATTACGCG ACCATTTCCA ACTGGTGTTA GAATTACAAT TGGTTTTAAA 162 0 

GAACAAAATG ATAAAATGTT AGAAGTTTTA TCAAACTTTA AATACGAATA GTAAGTGGGG 1680 

AGTGGGACAG AAATGATATT TTCGCAAAAT TTATTTCGtC GTCCCACCCC AACTTGcATT 17 4 0 

GTCTGTAGAA ATTGGGAATC CAATTTCtCT TTGTTGGGGC CCCGCCGGCA AGGTTGACTA 1800 

GAATTGAAAA AAGCTTGTTA CAAGCGCATT TT:GTTCAGT CAACTACTGC CAATATAACT 18 60 

TTGTAGAGCA TTGAACATTG ATTTATGTCT CAAGCTCAAT GCAGTGTGAA TGATGAGGTG 192 0 

AGAGTATTCA GTGTAAAAAG CAACAATAGA TGATATTGTT TTGTATCAAT TGCTTTTTTG 198 0 

CTATACTGAA TCAATACTGA TATTTTCAGG AGAAGATTAA AATGACCCGT AAATCAAOG 2 04 0 

CGATTGATAT GGATGAAGTA TTGGCAGATA CATTAGGAGA AATCATTGAT GCTGTCAATT 2100 

TTAGAGCGGA TTTAGGTATT AAAATGGAAG CTTTGAATGG TCAAAAACTT AAACATGTTA 2160 

TTCCTGAACA T GAT G GAT T A AT T ACAGAAG TATTGAGAGA ACCAGGCTTC TTCAGACATC 222 0 

TTAAAGTGAT GCCGTATGCA CAAGAAGTTG TGAAAAAATT AACTGAACAT TATGATGTAT 22 8 0 

ATATTGCTAC AGCAGCAATG GATGTACCAA CATCATTTAG TGATAAATAT GAATGGTTAC 2 34 0 

TAGAGTTCTT TCCATTTTTA GATCCTCAGC ATTTTGTTTT TTGTGGTAGA AAAAAC AT C G 24 00 

TTAAAGCTGA TTATTTAATA GATGACAATC CTAGACAGCT TGAAATTTTT ACTGGTACAC 24 60 

CGATTATGTT TACAGCAGTG CATAATATTA ATGATGATCG ATTTGAACGC GTAAATAGCT 2520 

GGAAAGATGT AGAACAGTAT T T T T TAG AT A ATATTGAGAA ATAAAATATA TCACTTGAAA 2 58 0 

AATTTCATGT AGAAAAGATG ATGGATAGGC TATAAAGTAA TTGTGACTGA GATGAACTTT 24 0 

TATGTCTTAG ACACTACAAC ACTATATTGG C AG TAG T T GA CTGCGGGGCC C C AAC AT AG A 27 00 

GAAATTGGAT TCCCAATTTC TACAGACAAT GCAAGTTGGG GTGGsCCCCA ACATAAAGAA 2 7 60 

ATACTTTTTC TTTAGAAATT AGTATTTCTT AT G CAT GAG T GTAACTCATG CATTCATATT 2 82 0 
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TTTAAGTACA CATTAGCTGT G^CTAATGAT AAAGAATCGC TACATAATCA ATCATTAGTC 28 80 

GTTCTTTATC ATTTCCGTCC CGCTCTCAAT AAATGTTAGT CTATCTTATT ATTATAAATC 2 94 0 

GGATGAATGT GTTAATCTAT GGCAGATTAC ACGTCATCCG ATTTTTTATA GAATTTGAAA 3000 

AAGACGCATA AACCACTATG AT T T AAAAT A CAACATCAAT CATTTTSTG gCATGCGCCA 3 0 60 

AAATTATATG TCTGTTTTTG AAACAGGGTA ATAGCTTAAA GCTAATAAAA ACGAATATAA 3120 

GGTGCGTTGA ATCTTATGAT TACACTCCAA ACCTAATATA ATATCGGGTT AAGATCATTC 3180 

CGGATGCTTA C AAAT CAT T G ACAGTAAGTA ACTGAATGGC ATTTGGTATA ACCTCAATAT 32 4 0 

CAATAGGTGT TTCTAATGAA ATTTCGCCAT CAATATCAAC TTTCATTGCT GGATCTGTTG 3300 

TAAGTGAAAT CTTTTTACCA GGTATATGCT CAATACCTTG AGTAATTTCA TTCCaATTCA 3360 

TGCTATCACG CTTTTTAAAA ATATCATTTA AAAT AC T G AA ACTTTGTTCA TTAAAAATGA 3420 

AAGTGTTCAG TTCACCATCT TGAGGAGAG\ AATCAGTCaA TGGTATACGA CTACCACCAA 34 8 0 

TGAATGGACC ATTTGCTGTT AGTATCATGG TCGTTTCGCC AGAATATGTC TTATCATCTA 3540 

TTGATAATTG AT AAT T AAAT TGTGTTGGAT TTAGCAGTGT TTTGACAGTT GATCCAATAT 3600 

AACTCAATTT ACCAAATATA TCTTTTGAAC CATCTTGTAC GTTTTCAGCG TTTGAACAA 3 6 60 

TGAGACCTAA GCCAACAAAG TTGAGTGCAT ATTGATTATT TATTTTAATT ACATCGTATG 3720 

TACCAACTTG TGCAGAAATC ATTTGTTCAC TAGCTTGTTT ATGATTAGGT GCTATATTTA 37 8 0 

GCGTTTTTGT AAAATCATTA AAAGTACCGC CTGGT AAAAT GCCAATAGGG AGTTGAAGGT 3840 

CATGTGTCAT AACACCGTTT ATAAGTTCGT TAACCGTGCC ATCACCGCCA AGAATAAATA 3900 

ATATATCTAC ATCTTTTGCA TAGTTTTTAG TTTTGATTTC TTGGCAATAT TTAATAATGT 3960 

CACCTTCGTT TTCACTCAAT T G AAT AG AAA GATGCTTACA AATTGAACTT AATGCTGTTG 4 020 

TAACTTCCCC AATACCTTGA TTAATATTTT TTAATCACT GTGTTCATGG TAAAAGAGGA 4 08 0 

CACCATGTGT ATATTTATTT TCCATAGTTT AGCCTACTTT CTAAAAATTG GTTCATTAAA 414 0 

TATATATACC CACTTTTAAT TGTTAATACC AAAAATATGT TTTTAAATAG AGAAAATGGT 4 2 00 

AATAAATGAA ATTGATTTCT ATAGAGTGGG AC GAG AAAAT ATAGT T AT AG CTGTCTATAA 4 2 60 

TGAGCATATT AAGTTTTTAT TTATACTGAT ATCTTGAATT TAATTAATAG AAACCTATAA 4 320 

AAAAACAGTA AGCCATTTAA ATGACTTACT GTTTTTTGAA TTAGGCCAAC AATATTAACG 4 38 0 

TATACCTTTC ATCGCTTTGA TGATTAAAGG TGAGAATGCT AATACAATTG TTGTAACAAT 4440 

AAT TGCAACA ACACCTAG3A AAATAAAGTA ATTTGTTTGA CCTAGTGGTT CTATTAACTT 4 500 

AACTAAAGTA CCATTGATTG CTTGTGCAGA AGCGTTAGTT AAGTACCAAA TACTCATCAT 4 5 60 

TTGGGCATTA AATGCTTTAG GTGCTAACTT AACAGCAGCA CTATTACCCG TTGGTGATAA 4 620 
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GCATAGCTCA CCGATAACAC AAATAATGTA CGATAAAATA ACCAGTTAA CTGAAAAGTT 4 680 

TGATGAACCT GATGCATAAC CTACAATACC AATTAGTATG TATGACGCAC CTGCTAAGAA 47 4 0 

CGTACCAATT GCAAATTTTA CTGGCAGGCT AGGTTGTTTA GTTCCAAGCT TTTGCCATAA 4 8 00 

AAGTGAAATA ATTGGAGCTA GTAATAAAAT AAATAATGGG TTAATTGATT GGAAGATCGC 4 8 60 

TTCACCAAAG TTTGTTTTCC AACCAAATAA GTTTAATTTC ATATCTGAAT GTTCAATTCC 4 920 

ATATATGTTT AATACATTAG ACCCTTGTTC TTGAATAGCC CAGAACACCA TTCCAAGAAT 4 980 

AAATAATGGA ATAAATGCTT TAACACGAGA ACGTTCAGTA TCAGTGACAT CTTTACTTCT 50 4 0 

AATAATTAAA GTGAAGTAAA TGAnEGTM TGCAATACCT AATACTAAAA CAGTATTACT 5100 

AACTAAGTTA AATGATAATG AGTTAGTTAA TGCACCAATA ACGATAATTA ATACAATTGC 5160 

TAAAACAACA CTTCCGATAA TAAGACCATA CTTTTTCTTT TCAGCTGGTG TCAATGGGTT 52 2 0 

AGTAGGTTTC ATACCAACGC TACCTAAGTT TTTGCGGTTG AAAAG T AC ATACC AT AC T AA 528 0 

ACCTAATGCC ATACCAACTG CTGCAATCAA GAATCCGCCG TGGAAGTTTT TAACATTAAC 5340 

AAAGTGTTGC AAAATAATAG GTGATAATAA TGCACCCATA TTAACTGACA TATAGAAAAT 54 00 

AACAAAACCT GCATCCATAC GTCTATCATT TTCAGGATAT AAACGGCCAA CGATATTTGA 54 60 

AATGTTTGGC TTCATTAAAC CTGAACCAAT AATGATGAAG AACATTGATG TGAATAAGCC 5520 

GATTAATGCA AATGGTAAGC TTAAACAAAT ATGTCCGATA ATAATAAAGA CTGCACCTAA 5580 

TAAAGTAGCG CCTCTAGTGC CTGTAATTCT GTCAGCAATC CATCCGCCTG GTATTGATGT 5 64 0 

CATATAGATT AAT GAACCAT ArACTGACAT AATGACATA GCTGTTGTTT TATCAATTCC 5700 

AAGGCCATTA TCTGTTACGG CAAGTAC 57 27 
(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14078 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TGGACTATTA ACGGCGaAGA AGATTTAACG AAATACTTAC AAACCAATGT TGATGGTATT 60 

ATCACAGATG ACCCAGCATT AGCTGATCAG AT T AAAGAAG AAAAGAAAGA CfiAACATAC 12 0 

TTCGATCGTT CTATAAGAAT TTTGTTTGAA TAATATAAAC AAAGACCTCT AAAGTTATCA 180 

AG AT GAT AC C TTCAGAGGTC TTTTTAATGT TGCCATCTAT GGGATAGGCA ATCGTTTCAT 240 

TCGTTTATAT T CAT AT G AC A AGTATTTGTA TGGCAATTTG GCGTCACAAA C AC T T AC AT G 300 

ATTTATTGGT GAATTATTAA TTGTTTTGTG AATGCAAAGG GTTAGAAATT GAATTGTAAA 3 60 
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TACTTTCTAA TCTTTGTTTC GCTTTAGTCA TTTGATCCAA ATTTTTAGTG CGTATAGCGG 420 

ATTTTGCAAT ATAGTGCGCA cTAAAATATC GCGTTTTTGA AACGCATCTA AATTTAGGTA 480 

CGATAATTTA TTTAAGTCAG TGTTTGCTAT TAATCATGT AATTGATCTA CAAGCGCTTG 54 0 

ATGTTGATAC GTATGTGATG TAGTTTCAGA TTTGCTTGCT AATTTAATAC CAGTCGTATC 600 

AAGGAGCGCC GCTTTAATAC CAGCAACTAA ATATGTTTTG ATTTTCATTT GTGTTGTCAT 6 60 

GCTTTGTTAC TCCTTTGATG TACATTAATC AAAAAAATTA TACACTATTG TATATTGCAA 720 

AGCTAATTAA C T AT AAC AAA AAGATAGTTA ATGCTTTGTT TATTCTAGTT AATATATAGT 7 80 

TAATGTCTTT TAATATTTTG TTTCTTTAAT GTAGATTGGG CAATTACATT TTGGAGGAAT 84 0 

TAAAAAATTA T GAAAAAGC A AATAATTTCG CTAGGCGCAT TAGCAGTTGC ATCTAGCTTA 900 

TTTACATGGG ATAACSAAGC AGATGCGATA GTAACAAAGG ATTATAGTGG GAAATCACAA 960 

GTTAATGCTG GGAGTAAAAA TGGGACATTA ATAGATAGCA GATATTTAAA TTCAGCTCTA 102 0 

TATTATTTGG AAGACTATAT AATTTATGCT ATAGGATTAA CTAATAAATA TGAATATGGA 1080 

GATAATATTT ATAAAGAAGC TAAAGATAGG TTGTTGGAAA SGTATTAAG GGAAGATCAA 114 0 

TATCTTTTGG AGAGAAAGAA ATCTCAATAT G AAG AT TATA AACAATGGTA TGCAAATTAT 1200 

AAAAAAGAAA ATCCTCGTAC AG AT T T AAAA ATGGCTAATT TTCATAAATA TAATTTAGAA 12 60 

GAACTTTCGA TGAAAGAATA CAATGAACTA CAGGATGCAT TAAAGAGAGC ACTGGATGAT 132 

TTTCACAGAG AAGTTAAAGA TATTAAGGAT AAGAATTCAG ACTTGAAAAC TTTTAATGCA 138 0 

GCAGAAGAAG ATAAAGCAAC TAAGGAAGTA TACGATCTCG TATCTGAAAT TGATACATTA 14 4 0 

GTTGTATCAT ATTATGGTGA TAAGGATTAT GGGGAGCACG CGAAAGAGTT ACGAGCAAAA 1500 

CTGGACTTAA TCCTTGGAGA TACAGACAAT C C AC AT AAAA TTACAAATGA ACGTATTAAA 15 60 

AAAGAAATGA TTGATGACTT AAATTCAATT ATTGATGATT TCTTTATGGA AACTAAACAA 1620 

AATAGACCGA AATCTATAAC GAAATATAAT CCTACAACAC ATAACTATAA AACAAATAGT 1680 

GATAATAAAC CTAATTTTGA TAAATTAGTT GAAGAAACGA AAAAAGCAE TAAAGAAGCA 17 4 0 

GAT GATTCTT GGAAAAAGAA AACTGTCAAA AAATACGGAG AAACTGAAAC AAAATCGCCA 1800 

GTAGTAAAAG AAGAGAAGAA AG T T G AAG AA CCTCAAGCAC CTAAAGTTGA T AAC C AAC AA 18 60 

GAGGTTAAAA CTACGGCTGG TAAAGCTGAA GAAACAACAC AACCAGTTGC ACAACCATTA 192 0 

GTTAAAATTC CACAGGGCAC AATTACAGGT GAAATTGTAA AAGGTCCGGA ATATCCAACG 1980 

ATGGAAAATA AAACGGTACA AGGTGAAATC GTTCAAGGTC CCGATTTTCT AACAATGGAA 2 04 0 

CAAAGCGGCC CATCATTAAG CAATAATTAT ACAAACCCAC CGTTAACGAA CCCTATTTTA 2100 

GAAGGTCTTG AAGGTAGCTC ATCTAAACTT GAAATAAAAC CACAAGGTAC TGAaTCAACG 2160 
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TTAAAAGGTA CTCAAGGAGA ATCAAGTGAT ATTGAAGTTA AACCTCAAGC AACTGAAACA 22 20 

ACAGAAGCTT CTCAATATGG TCCGAGACCG CAATTTAACA AAACACCTAA ATATGTTAAA 2280 

T AT AG AG AT G CTGGTACAGG TATCCGTGAA TACAACGATG GAACATTTGG ATATG&GCG 2 34 0 

AGACCAAGAT TCAATAAGCC ATCAGAAACA AAT GC AT AT A ACGTAACAAC ACATGCAAAT 2 4 00 

GGTCAAGTAT CATACGGAGC TCGTCCGACA TACAAGAAGC CAAGCGAAAC GAATGCATAC 24 60 

AATGTAACAA CACATGCAAA CGGCCAAGTA TCATACGGAG CTCGTCCGAC ACAAAACAAG 2520 

CCAAGCAAAA CAAACGCATA TAACGTAACA ACACAT GGAA ACGGCCAAGT ATCATATGGC 2 58 0 

GCTCGCCCAA CACAAAACAA GCCAAGCAAA ACAAATGCAT ACAACGTAAC AAC AC AT G C A 2 64 0 

AACGGTCAAG TGTCATACGG AGCTCGCCCG ACATACAAGA AGCCAAGTAA AACAAATGCA 27 00 

TACAATGTAA CAACACATGC AGATGGTACT GCGACATAG GGCCTAGAGT AACAAAATAA 27 60 

GTTTGTAACT CTATCCAAAG ACATACAGTC AATACAAAAC ATTACGTATC TTTACAACAG 2820 

TAATCATGCA TTCTATGATG CTTCTAACTG AAT TAAAGCA TCGAACAATC GGAAGCATAT 28 8 0 

TTCTAAATTA TTTATTCATT ATAGTCTTAA ACATAACATG ACCTAATATA TTACTAACCT 2 94 0 

ATTAAAATAA ACCACGCACA TCTAAGTGAT AT AC G AC AAT CACAGCAATA ATAATTGCTT 30 00 

TAGAAAGTCG TGCCGAACTG GAACTTACAA GTCTAGTTCG AACACACACT GATGTGAGTG 30 60 

GTTTTCTTTA TTTTAAACAT G AAC AAT C AG ATAAGTTACT AGCATTAGCA AAT AT TAT T A 3120 

AATCAAAGGG CTTCGATTCA TAAAATTTAA AACAATGATT AAAAT TAG AC GTGTAAATGT 318 0 

TAAATTCTAA AACGGAAATA ACCACCATCC CATTAAACCA CTTTTTTTGT TCAATCACTA 32 4 0 

TATTTCACAC AGCTTCATTA ATAAAACGAA ATTGCTTCAA CCCGCTTCAA CTTCAACTGG 3300 

CTTCAACTTC AGCCTACTTC ATTCAATAAC AAAACGAATC CGCTCATCC AAAATCAACC 3360 

ATTCTAACGC ACATAT TCAA ATATAGCAGC TGCACCCATG CCGACACCAA TACACATCGT 34 20 

AACCATGCCG TAACGGCTAT CGGGACGTCT ACCCATTTCA TTAAGTAAAC GCGCGGTTAA 34 80 

CATTGCGCCT GTAGCACCTA ATGGATGACC TAAAGCAATA GCGCCACCAT TCACATTCGT 354 0 

ACGTGATATA TCTAGACCTA CTTCTTTAAT AGATGCAATC GTTTGAGAAG CAAATGCTTC 3 600 

GTTCAATTCG ATCAAATCAA TGTCTTCAAC AGATAGATTG CTGAGTGACA ATACTTCAGG 3660 

AATCGCATAT GCAGGCCCAA TACCCATAAT TTTCGGGTCA ACGCCTACTG CCTTAAAACC 37 20 

AACGAATCGT GCAATAGGTG TCACGCCEAG TTCTTTCACT TTATCTCCAG ACATTAAAAC 37 8 0 

TACAAATCCT GCACCATCAG AAAGTGGGGC AGATGTTCCT GCAGTCATAG TGCCGTCAGC 38 4 0 

TTTAAATACT GTACGTAATT TGGCTAATGC CTCCATCGTG GTGTCAGGGC GTATAAATTC 3900 

ATCTTGGTCA AAGATATTTG TGTGTACTTT TGGTCCTGCG TTTGTATATT CACTGAGTT 3960 

TACTTGTATT GGAATAATTT CATCTTTGAA CCGACCATCA CGTTGTGCGT CATAGGCACG 4 02 0 
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TTGATGACTT CTGACAGCAT AAGCATCTTG ATCTTCGCGT GATACGTCAA ATTGGGATGC 4 08 0 

TACATTTTCA GCAGTTAAAC CCATAGGATA TGACGCACCT AT AT CAT CAT ATTGTAAGGT 414 0 

TGGATTGTTT GTGGGCTCGT TGCCACCCAT TGGTACGGCA CTCATCAATT CAACGCCACC 4 200 

AGCTACAAGT ATATCTCCTT GACCAGCCAT AATTTGATTG GCTGCAATCG CGATGGTTTG 4 2 60 

TAATCCTGAT GAGCAGTAGC GATTCACTGT TTGACCCGGT ACCGTGTCAG ATAATCCCGC 4 320 

ACGCAATGCA ATCGTTCGTG CAATGTTTTG GCCTGTAAT CCTTCTGGAA AAGCCGTACC 4 38 0 

AACAATGACA TCTTCAATCA TATTCTTATT GAATTTTCCG TCAATACGTT TCAATACGCC 4440 

TTGTAATACT TTGGCTGCGA CATCATCAGG TCTTTCGTGG AATAATGCGC CTTGCTTTGC 4 500 

TTTCGCTGCG GCTGAACGCC CATAAGCTAC AATGTATGCT TCTTGCATGG TTATCATCCT 4 5 60 

CTCTTAATGA CTATCTTTTA ATTACGTAAT GGCTTACCAG TTTTTAACAT ATGTGCAATT 4 62 0 

CTTTCATATG ATTTTTTAGA TTTTAGTAAG TCAATAAAGC CAATTTTCTC CAACGATTGA 4 68 0 

ATGTAACGTT GATTGATAAA TGTATTTCTT GGTAAATCAC CACCCGCTAA AATTGTGGCG 47 4 0 

ATATTTAAGG CAATATGATA ATCATGGTCG CTAATAAAAT GACCCCGTCT TTGCGCATCT 4 8 00 

AATTGTCCTT GGATCAATGC TTTGAAGTCT TCACCTAAAG CGATATATTG ATGTCTAGGA 4 8 60 

TTCGGAATAT AGTTTGTTTC TGCTTCATAT TTCGCACGTT TGAGCGCAAC TTCGACACGT 4 92 0 

TGTGCTGTAT TGAAAATAAT CGTATCTGTA TCACGTAAAT ACCATAACG ACGTGCCTCA 4 98 0 

AAGGCATTTG TAG AG AC T T T CGCAAATGCG ATATTCGTCA GTACTTTTGT CATGGAAGCT 504 0 

TGTTTGTCAT CAAACTTATG CGATGTGCGT AATATGCGAT CAGCCATTTC TGCAAGGCCA 5100 

CCGCCACTCG GTAATAAGCC AACACCTGCT TCAACAAGAC CGATATATGT TTCACTTGCA 51£6 

GCGACAACAA TAGGTGAGTA AAGTACAAGC TCACAGCCAC CGCCTAAGGC ACGACCTTGA 522 0 

ACAGCTGTGA CTACTGGTTT CAAACTATAC TTCAAACGAT TAAAGCTATA ATGTAATTTA 52 8 0 

TCAATTGATT GTGCAACGAC ATCATCTACA AGACCGTCTT CATGCGCCTT TTTCATTAAG 534 0 

AAAAGGTTAG CACCCACACT GAAVTTGTTA CCATCTGCAT AAATAACCAT ACTTGTGTAA 54 0 0 

TGGTCATTTT CCAGTAAATC AATCGCATCA ACTAACGCAT CGTTGAATTC ATCGGTAATG 54 60 

ACATTATTTT TACTTTGTAA TTTCAGTAAC AGTTGATCAT CATGAGTTAC GGAAAGTTTG 5520 

GCATCACCTT TATCCCAAAG TTCATCTTTT AC G AAG T GAG AAATAGGTC TGCATATTCA 5580 

ATGGTCTCAT CTTGTTTATA AAAGCCACCA TCTAAATCAC TAATCCATTG TGGTAAGTCT 5640 

CCAAGTTCGT CTTCCATACG TGTTTTAACA CGTTCGTATC CCATTGCATC CCATAATTGG 57 00 

AATGGACCAA GTTTCCAGTT GAACCCCCAG ACAAGCGCAC GGTCTATGTC TCGGAAATCA 57 60 

TCGGTAGCTT TAGGTACATT GATAGCAGAG TAATAGAAAT TATTACGTAA TGTCTCCCAT 5820 
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AAAAATAGTC CCGCTTCGTC TTGCGCATTG AATATGGTAT CAAGGTTATG CACTAAGTCT 5880 

TTATTAAATT CATTTAAAAT TGGTAATTGT GGTTGCGATA CAGGTACATA ATCTTGTTTT 5 94 0 

TCAACATCGT AAACAAGTCG AGCTTTAGTT TCTTTATCCT TTTTGTAAAA TCCTTGTTTC 6000 

GTTTTACGTC CGAGTGCGCC ATTGTCAAAC AACGTATTTA CAATTTTGAC AT CAT GAAAA 60 60 

TAAGGTGTTT CTTCAGGTAC TTGTTGCATG CCTTTAATTA CAGACACTGC AAT AT C T AAA 6120 

CCGACTAGGT CAGATAGCGC ATATGTACCT GTTTTAGGAC GACCAATCGC TTGCCSGTT 618 0 

AAAGCATCCA CAT CT AC AAT GCTTATCTTG TGTTGCTCGG CGCGATACAT AAT AT CAT TC 62 4 0 

ATTGTTTGCG TGCCGACTCT ATTTGCGACA AAGCCAGGCA CATCATTGAC G AC AAT G AC A 6300 

CCTTTACCTA ACACATTTTG CGCGAAATTT TTTACATCTA ATATAATAGA TTCCTTCGTG 6360 

TGTGACGTAG GTATTAACTC C AC T AAT T T C ATAATACGTG GTGGGTTAAA GAAATGTAGA 64 20 

CCAAAGAATC GTTCTTGATC CTTCTCGTTA AATGCTTGAG CAATCGCATT AATTGGAATA 6480 

CCTGATGTAT TTGTAGCGAA T AAAGC AT C T TCTTTAGCAT GTTGTAGAAC TTGTTGCCAA 654 0 

ACAGCATGCT TAATTTCAAT ATCTTCTTTG ACTGCTTGA T AT AT AAAT C AGCATCATCA 6600 

TTTACCAAGT CAT CAT C AAA ATTACCATAT GTTAAATGAC TCGCTAGATT TAAGTCGAAT 6660 

AGTAGCGGCC GTTTCTTATC TGTAATTTTA TCGTAAGATT TTTTCGCAAT GAGATTTGGA 6720 

TCGTTTTTGT C C AC T AC AAT ATCTAATAGT TTTACTTTAA GTCCAGCATT CACAAAAAGT 67 8 0 

GCTGCCAGTT GAGCGCCCAT TGTGCCTGCG CCAAGAACGG TTACTTTATT AATTGTCATA 68 4 0 

GTGATTCCTC CAATTTAGTT GAGGATAAGA TAACCATTAA GATAATTGGA ATAACGTTGC 6900 

TATTTTATAA AAT T AAT T AA GTATCTTTGA CAGTCATCTT AGCCTCTTAT TTAAGGAAAA 6960 

AGCTTTATGC TTAAAATAAG TCTTTTTTAG TGAAATTAAT GCATCTCATA TAATTATTTG 7 02 0 

CTATTTATAC GAAAGCAGAA TCTCCAGTCA AAGCGCGTCC AATTACTAAG GCATTAATTT 7 080 

CATGTGTACC TTCGTACGTG TAAATCGCTT CTGCATCAGA GAAGAAACGT GCAATATCAT 714 0 

AATCGTCAGC TAGTATGCCA TTACCACCTG TAATACCGCG GCCCffiAGCT ACTGTCTCAC 7 20 0 

GCAAACGTAA GGCATTCATC ATCTTCGCCG TTGAAGTTGC AACCTCGTCA TATTCACCAT 72 60 

GTGCTTGCAT ATTAGCTAAT TGAGCACATG TTGCCATTGC TTGAGCTAAA TTACCTTGCA 7 32 0 

TCATTGCTAG CTTTTCTTGT ATTAACTGAT ATT T AC T AAT TGGTTTGCCG AATTGcTTAC 7380 

GCTCAGTGAC ATAATCTAAT GTGGCACGTA AAGCGCCAGC CATACCACCT GTAGCCATAT 74 4 0 

AAGCAACGCC TGCTCTCGTT GAATAAAGAA TTTTGGCAAT ATCTTTAAAG CTTGTTATGT 7 500 

TTTGTAAGCG ATCCGCTTCA TCTACTTTGA CATTAGTTAA TTTAATTAGG GCGTTAGGAA 7 5 60 

CAATGCGAAG TGCGATTTTA TTATCASTGA CTTCAATATC GACGCCATCT TGTTCTGGTC 7 62 0 

TGACTACAAA GCAATGGGGT TTGCCAGTTT CTTTATTTAC TGCGAATACT GGAATGACAT 7 68 0 
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C AG AT AC AT G TGCACCACCA ATCCATTTCT TTTCACCATT GATAACCCAA GTATCGCCTT 77 4 0 

GGCGTTCAGC GACTGTTTCA AGACCTCCCG CAACGTCCGA ACCGTGTTCT GETCAGTTA 7800 

AAGCAAAGCA TGTACGCAGT TCATGTGACT GTAATTTAGG T AC AT AT T T C GCAATTTGTT 7 8 60 

CTTTGCTACC TCCGAAATAG AAAGTGTTAT GCCCTAAACC TTGGTGAACA CCGAGTAGGG 7 920 

TAGCTAAGGA AATATCAAAT CGCGCGAGTA GGTAAGACAT GAAAAACTGA AATAGTTGAC 7 980 

TAGGCATTTT GGCGTTTGGA CGATCCTTGT AAAGTAATGG ATTGTTAAAA TAATTTAATT 8 04 0 

CTCCCAGATC TTTAAAATAG TCCTCGGGTA CAGTAGCGTC TATCCAATGT TGATTAATAT 8100 

TTTCACGGTA CTTACTTTCT AGCAATGAAT CTACTTGTTG TAAAAATTCG ACTTCACCGT 8160 

CTGTTAAACC TTTAGCAATA CTAAGTACAT CTT(AGGAAA TAATGTTTTT AAGACCGTTT 8220 

CTTTTTCAAA TGTCATATAA ATTCCTCCTA AAAATAATAT GAATACTAAT GTGAAATGCA 8280 

TTTAATTCAA AAACAACACG CTTTATTTGT AAACGCTTAC ACTAAATGTC AAAAATTTTT 8 34 0 

ATCACCTTTA AAGTGTTTGC GAGACTTTGT CATTCATCAT TTGTCGAATC GCAAGTTTAT 8 4 00 

CTGGTTTCTG CGTACTGTTT AACGGCATAT GTGTCACTGG TACATACATT CTTGGGACTT 8 4 60 

TATAACCTGC TAAACGACTt CGCATATGTT GATTTAAAAT TTCAGCGTAA TGAGGTTCAT 8520 

CTTCGCGAAG TATAATGGCT GCAGCAATTG ATTCACCATA TTTTGGATGA TCATAGCCAA 8580 

CGACCACACA CCGGTCTACT AGTGGATGCT CAGCTAAAGC ATTTTCGACT TCGGATGGTA 8 64 0 

AGACATTTTC GCCACCAGTT ATGATTAATT CTTTTTTGCG GTCAATAATA AATATATCGC 8 7 00 

CATCGTTGTC CATCTTCGCT AAGTCACCAG TTAATAAATA TCGACCATGA AATGCTTTGG 87 60 

CAGTCTCTGC TGGTTTATTC CAATATCCTG GCGTGACATT TTAGCCTTA ATTGCAAGTT 8820 

CGCCAATCTC ACCAGTAGGT ACTTCCTCAC CGTTATCATC AAGGATACGT GCATCAACGA 8880 

ACATGACTGC TTTACCAATA CTCATTGGCT TACGTTTTGA ATTTTCCGGT GT AT TAACAA 8 94 0 

GTACAAGAGG TGCTTCAGTT AAACCATAGC CGTTAATAAT GTTTATGCCA TATTGTTTAA 9C0D 

AAGCTGCTTG GATACTTGGT AATGGTTGTG AACCACCTTG GAT G AT AT AA TCCATAGCTC 90 60 

TAAAATTTTC AGGATTAAAA TTACTAGCAC GTAGCGTACT ATAATACATT GTCGGAATCA 9120 

TGATAATAAA TGTAGGGTGA TATTGTGCAA TCATGTCATT CAATTCTTCG CCGTTAAAGT 9180 

AACGTTGAAG AATAAGTGTG CCKXTGACA TTAATACTGG TAATACAGTA TCGTTAAACC 924 0 

CTAAAACATG GAACATTGGT GTTGATACAA TCGTAATATA GTTTGAATTG AACTTATACG 9300 

TCAGCTCTAA GTTTGCACCG TTATGAACAA ATGATTCATA T G AG AAC AT C ACACCTTTAG 9360 

GTGATCCGGT TGTACCACTT GTATAAATTA ATGCTGCAAG ATCTTGTGE TCAACAGGTG 94 20 

TTGCTTGAAA AGGTTGGTGA TAATCTGGAT TTACGATTTC ATCATATTGC GCCACATCAA 94 80 
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TATCCATATG CAATAAGTTT TGGTCAATAT CGGTGAGTGA ACTTAAATGT TTTTCAGCAT 954 0 

AGAAGAGCAG TTTTAATTGT GCATCTTCCA CAATGGCTGC AATTTCTTTT GGGTTAAGCC 9600 

GCCAATTCAA TGGTAAAAAA ACCGCACCTG TTTTAAAACA AGCAAACAAT AAATCTAATA 9660 

TTGCAATATC ATTTGGCGCA AAAATACCGA TAACATCGCC TTTTTTAACA CCTTGAGATG 97 2 0 

TTAAATAATG TGCCATATTA TCAGCGCGTG CATTGAGTTG TTGGTATGTC CAAGATGTTT 97 8 0 

GTTTTGCGTG ATCAATAACG GCAGGCTTGT CATCATCGAA GTCTGAACGC GTTTTTATCC 9840 

AATCGAAATT CATTAGTATA CCCCCTTTAG CTTCACTTTC ATACTTTATG AATTGATTGT 9900 

TTAAGTTGTC CCCATTTTTC TTTGTAAATG CTGGTATCAA TTAATTTTAA ATGATCAGCA 99 60 

ATAATTGGTT TAAAAGCCAT TTGATTCAAA ATATCTTTAT GCAAATCAAG ACCTGGGCA 10020 

ATTTCAATTA GTTTCAAGCC TTGATTGGTG AGTTCGAATA CTGCACGATC AGTAACAAAA 10080 

TAGATTTCTT GCTCGAGTGA TTGTGAATAT TGTGCATTAA AGTCGATATG GCTCACATCT 1014 0 

GATACAAATT TCTGGTTTTG TCCTTCAGTT TCAATGTTTA ATCGTTGATT ATGGCATGAG 10200 

ACATGACTGC CAGCTACAAA AGTACCTGAA AAGATAATTT TATTTACAGA TTGCGTAATG 102 60 

TCTATAAAGC CACCACATCC ATTTAGTCGG TCATTGAAGT AAGACACGTT GACATTGCCG 10320 

TATTGATCAA CCTCAGCAAA GCTAAGATAG GCAACTGATA CACCATTGTT ATAAATAAAA 10380 

TCCCATGCTC GATCATGAGG CATGCGCACA TCTGCAIXT AATTCATACC AAAATGTTCA 104 4 0 

CGACTCCCAA CGAATCCACC GAAAATGCCA ACATCTAAAA TCGGTTGCAC ATCATGTTCA 105 00 

ACACATTCTT CATGCAATAA ATTAGAGAGT TCATTATTGA TGCCATAACC GATGCTAATT 105 60 

GTATCGCCAT AAGTTAAAAA CTGAGCAGCA CGTCGGAGAA TCAATTTGCG ACTATTAAAA 10 620 

GGTAATGCGG GTTCAGGTAT TCCATCAATT CGTTCTTCTC CAGACAAGGC TGGTAAATAA 10 680 

TGACTTTGAA TTACTTGGCG GTGATTCTTT TCATCTTCTG TGACGTATAC ATAATCGACA 10740 

AGATTTCCTG GGATAACAAC TTCATTCGGT TTTAGTTGAT AGTCGTCAAC TAAAGCTTTA 10800 

ACTTGTACAA TAACTTTCCC ATGATTGGCT TTCGCGTTTA ATGCGACATG ATAACACTCG 10860 

CTCAAGTACG CTTCTTGAGT TAAATAAATG TTACCTTGTT GATCTGCGTA TGTTCCTCTC 10920 

AGTAGTGCCA CATCAACGCT AGGGAATGTG TAATGTAAGT ATGTTTCATC GTTGATGGTT 10980 

ACTAATGAAA CTAAAT CATC CGTTGTTCGT GTATTTACTT TACCGCACC GTATCTAGGA 11040 

TCAACAGCTG TGTTTAATCC GATTTTAGTA ATAACTCCAG GTAATAATTG ATTACTCTGA 11100 

CGATAATGAG TTGCAATGAT ACCTTGTGGT AAAAAATAAG CTTCAATGTC ATTATTTTTC 11160 

ATTGcTTGTG CCGTTTTGGA AGAAGCCGTT AAAAT AC T C A TAATGACACG TTTAATCATG 11220 

CGACGTTCTA T AAAAT CATC TAAATCCGGT GCGGCACCTA AACTATGAAT ATCATTCGCT 112 80 

AATATAAACG TTAAATCATT GGGCGTATGA TATGTGTCAT GTTGCGCTAA CACAGCACGT 1134 0 
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AGAACTTCGG CGGGTAAGTT GGCTACAGCT AATGCTGGTA AACCAATCAC ATCACCATCT 114 00 

TTAATGATAT GTTGTAAGTC GTGCCATSTG ATTTGTTTCA AGCAAGTCAC CTCCATCACA 114 60 

TTTGATAAAA TATAGCGTTT TTACACTTTG TGTAAACCCT T a C AAGAAAT ATAACATAAC 11520 

GACGTTTAAA AT C AAT TAG A AATATCTTTT TATTCTGATA ATAGACACAG TATAGACACA 11580 

TTTTGATGGT CGATAACAAT TGTAATATCA AGGGTTTGTA ATGAATTGAA T AC ATT AAA 11640 

ATACTTATAT AAAAATATTG TTCGGAATAT AAAAAGTTAA ATAGGTTTTG ATTTTTAAAT 11700 

ATGAAATACA AAGTGCCCAA TCGAACAAAG TATTTATATT AAAATATGGA AAATCCATCA 11760 

ATATTAAATT AAAATAGTTT TATTATGAAA AG T GAAAG T A GGTAAGTCTA TGGAAGGTCT 11820 

TAATCATCGA AGAAATACAG AAAAAGAAGA GACAACACAA ACGCAATCaG TTGCACCTAA 11880 

TACAGGTGAA GAGGGGATGT CAT C AGC AAG TACACAATCA ACTAAGACGT CCGACATACA 11940 

TAATGAATCT ATCGATAAAC AAATGGAAGC TAAAGCGCAT GAAACAGCGC AAAATACAGA 12 000 

TTTAAAAAAC GAAGCAAGAA GTTTATTTGA TAAECAACC AAATCAATCG GTAGACTAGC 12060 

GGGCAATGAT GAAAGCTTAA ATCTTAATTT AAAAG AT AT G CTTTCTGAAG TATTTAAGCC 12120 

GCATACTAAA AACGAAGCAG ATGAAATATT TATAGCGGGT ACTGCTAAAA CTACGCCAGC 1218 0 

AATTTGTGAC AT AT C AG AAG AATGGGGGAA GCCATGGCTC TTTTCTCGAG TATTCATCGC 12240 

TTTCACAGTA ACATTTATTG GATTATGGGT CATGGCAGCA ATTTTTAATA ACACTAACGC 12300 

GATTCCGGGT CTCATTTTTA TAGGGGCTTT AACAGTACCA TTATCGGGTT TGTTCTTCTT 12360 

TTATGAATCA AATGCGTTTA AAAAT AT TAG CATTTTTGAA GTTATTATCA TGTTCTTTAT 12420 

TGGCGGCGTA TTTTCATTAC TAAGTACGAT GGTATTATAT AGATTTGTCG TTTTTAGTGA 12480 

TCAATTCGAA AGGTTTGGTT CTTTAACATT TTTCGATGCA TTTTTAGTAG GATTAGTTGA 1254 0 

AGAAACTGGA AAAGCACTCA TTATTGTTTA TTTCGTCAAT AAATTGAAAA CAAATAAGAT 12 600 

TTTGAATGGA TTATTAATCG GTGCTGCTAT TGGTGCAGGG TCGCAGTTT TTGAATCAGC 126 60 

AGGT TAT AT T TTGAATTTCG CTTTAGGAGA AAATGTCCCA TTATTAGATA TTGTCTTCAC 12720 

ACGTGCGTGG ACTGCGATTG GTGGTCATTT AGTTTGGTCA kCGATTGTTG GTGCTGCAAT 12780 

AGTTATTGCG AAAGAACAGC ATGGCTTTGA ATTCAAAGAT ATTTTTGATA AACGCTTTTT 1280} 

AATATTCTTT TTATCAGCCG TTGTTTTACA TGGCATTTGG GATACATCTT TAACTGTACT 12 900 

TGGCAGTGAT ACGTTGAAAA TATTTATTTT AATCGTTATT GTGTGGATAC TTGTATTCaT 12960 

TTTAATGGGG GCAGGTTTAA AACAAGTGAA TTTACTGCAG AAAGAAT T T A AAGAACAACA 13020 

GAAAAAAGTA GACGAATAAT AATTAAAGCT TATGTTGCTC ATATGTTTGT G AC AT AAG C T 13080 

ATTTTTATAA TTTGTCTTTA AAAGAGTGGA ATAGGAATAC TTTTTGGAGT TAAAAAAGTG 1314 0 
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TTtCACGTTA AACAAATAGT GACAATTAGA TTTATATAAA ATGAACATGA TTCACTGAAA 132 00 

GTATGTAATA ATCATTTTAT T G AAAT T CAT CAAACAGAAA TTAATACA2S. CATATAAGCA 132 60 

AATTAAACCA CGCCATAATC ATATTGGATG ACTTCGGCGT GGTTTTTATA GTTGAAGCAG 1332 0 

GGCTGAGACA TAAATCAATG TCCCACACTC CCTTATCGTT CAATCGTTGT TCGATAATCG 13380 

ATTAAATAGA TACCTTCAGG TGTTACTTTA TAATTTTTAA CCTTAGAGTT AGCAGCGACT 13440 

ATTTGATCGT TGTAAGCAAT ATAACTGTTT GGTACATCTC GACTTGATAA TTTAATAATA 135 00 

T CAT TAG AAA TATTGTGACG TTCCTTAACA TCTACAGTAT GATTCAATTG ATTAATTAAA 135 60 

TCATCGACGT TGCTATTATT GTAGTCTCCT TTATTAATAG CACCATCTTT TTTATATGCT 13620 

TGATTAAAGA AATAACCTGT ATCTCCACGA GGAATTGTTC C G AAAC TATA CATCGTTGCA 13680 

TCCCATGCAG AACGGTCTTT TAAGTAACCT TCTATGTCAT CAACACTTTT AATGTCGATT 13740 

TCAATATTTG CTTTTTTAGC ATCTGATTGT AATACTTGCG CAATTTTCGA TAGCTCTGGA 13800 

CGACCGTCAT ACGTAATTAA CTTAATTTTT AAAGGGTGTT CTTTTGTATA ACCATCTTA 138 60 

GCTAATAACA TTTTTGCTTG TTCGATATTT TGTTTGGTTA ACTTAGGTTC TTTAATATAT 13920 

GGAATTTTAT CATTAAATGG ACTCGTTGCA GGTTTCGCAT AACCTTGATA AATATGATCT 1398 0 

GCAATACCTT GTCTATCAAT GAT AT GAT C T AATGCTTCAC GAACGGATTT AGTCATTTTt 14 040 

TTATtAGTAT GATTATACAT AAGT a AGAAG TTCTAAAn 14 078 
(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

TGAAAACTAA AGTGTTTCTA ATGCGTGACT AAAATTAGTA ATAATTAAGT TCTCATGATA 60 

ATAGGTATTT TTGAAAAATG GAGGAGTCTA TAAATGGGTA AAAAAATGGG TCTAGGTTTA 120 

TCTATTGCAT TGGTTGTTAT TGGTATTGCC GTTGTATGTT TAATGATTTT TTCTAGTCAA 180 

AAAACGACTT ATTTTGGTTA TATGAATAGT AATACAAATG C AG AAAAAG T TGTCAGTGAA 240 

AAAGATGGAT TAGTCAAACA TAATATCAAA G T AG AAC CAT CTAATGATTT CAAGCCGAAA 300 

AAAGGAGACT TTGTAAAATT AGTTTCTAAA GATGATGGGA AGACATTTTA TAAACAAGAG 3 60 

ATTGTTAAAC ATGATGACGT CCCACACGGT TTAATGATGA AAATTCACGA CATGCATATG 4 20 

AATTAATAAA AAAGCATCTA TAACGTAATT T TGAAGAAGT AG AG T TAT C T TCTTATGCGT 4 80 

TTTAGA 4 86 
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(2) INFORMATION EOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1626 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

GAGGTCTATA TACAATTATG GTTGTTCCAG TTAAACGAAC TGATGGCTTT ATTACTAAGT 60 

TTAATAGATT AATTGAAAGA C GAT TAT T AC GTCATTTCAG TAAAAAAGGT TATATCACAT 120 

GGGAGGAAAA TTGATTGTCT GACATTTTAA AATGTATCGG TTGTGGTGCG CCACTTCAAT 180 

CTGAAGATAA AAATAAACCT GGTTTTGTAC CAGAGCAAA TATGTTTCGT GAT G AC G T G A 240 

TTTGCAGACG TTGTTTCCGC TTGAAAAATT ATAACGAATT C AAG AT G TAG GAT TAG AAA G 300 

TGAAGACTTT TTAAAATTAT TATCAGGACT TGCGGATAAA AAGGGTATTG TCGTCAATGT 360 

CGTGGATGTA TTTGACTTTG AAGGATCATT TATTAATGCA GTTAAACGTA TTGTCGGAAA 4 20 

CAAAAAAATC ATTTTAGCAG CAAATAAATT GGATTTATTG CCGAAACAAA TTAATAAACG 4 80 

TCGAGTTAAA GAATGGTTAA AACGAACAGC AAGAAAATAT GGTTTGGAAG CTGACGATGT 54 0 

CGTATTAATT TCAGCTGAAA AAGGCTGGGG CATAGACGAC TTATTATCAT CAATTGCGAA 600 

TATTCGAGAA AATGAAGATG TGTATATTGT AGGGACAACG AATGTTGGGA AATCTACATT 660 

GATTAATAAA CTGATTGAAG CTAGTGTTGG TGAAAAAGAT GTAGTAACAA C T T C AAG AT T 720 

CCCTGGAACA ACTTTAGATA T G AT AGAT AT TCCTTTAGAT GAAACATCAT TTATGTATGA 7 80 

TACACCAGGT AT TAT TC AAG ATCACCAAAT GACGCATTTA GTTJ5TGAAA AAGAATTGAA 84 0 

AATTATTATG CCTAAGAAAG AAATAAAACA ACGCGTATAT CAATTAAATG AGGCGCAGAC 90 0 

ATTATTCTTC GGCGGTCTAG CGCGCATAGA TTATGTATCA GGTGGTAAAC GTCCGTTAGT 960 

TTGTTTCTTT TCTAATGACT TGAATATACA TCGTACTAAm ACGGAGAAGG CTAATGATTT 1020 

ATGGCGTAAT CAACTTGGCG ATTTATTAAC GCCACCTGGA AAT CCACAAA ATTTTGATCT 1080 

TAATGAGGTA AAGGCTGTTA GACTTGAAAC AGGCAAAGAG AAACGCGATG TTATGATCTC 114 0 

TGGTCTAGGC TTTATAACTA TAGGACCAGG GGCTAAAGTA ATCGTTCGTG TTCCTAAAAA 1200 

TGTTGAkGTT G T AT T AAG AA ATTCTSTTTT ATAAGGTGaT TAAAAAAATG AAATTTGcAG 12 60 

TTATAGGAAA TCCTATTTCA CATTCCTTGT CGCCCGTTAT GCATAGAGCA AATTTTAATT 1320 

CTTTAGGATT AGAT GAT ACT TATGAAGCTT TAAATATTnC CAATTGAAGA TTTTCATTTA 138 0 

ATTAAAGAAA TTATTTCGAA AAAAGAATTa GAAGGCTTTA ATATCACAAT (TCTCATAAA 1440 
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GAACGTATCA TACCGTATTT AGATTATGTT GATGAACAAG CGATTAATGC AGGTGCAGTT 1500 

AACACTGTTT t G AT AAAAG A TGGCAAGTGG ATAGGGTATA ATACAGATGG TATTGGTTAT 15 60 

GTTAAAGGAT TGCACAGCGT TTAnCCAGAT TTAGAAAATG CATACATTTT AATTTTGGGC 162 0 

GCAGGT 162 6 



(2) INFORMATION FOR SEQ ID NO: 194: 

-<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 635 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

AGGGTTAATT GTCGGTTTAA TTGCAATGAA TAAGTTCCAT GTATTAGCTG GCTATAGAGC 60 

G AAAT T CAT C TTAATGGTGA TTTTAACTAT GATGGTCTTC G T AC T TAT T A ATACGTATTT 12 0 

ACTAAGACAG GTAAAATCTA TCGGTATGTT CTTAATGATT GCTGCATTGG GTCTATACTT 180 

TGTAGCTATG AATAATTTGA AAGCAGCTGG ACAAGGTGTG ACTAATAAAA TTTCACCATT 24 0 

GTCTTATATC G AT AAC AT G T TCTTCAATTA TTTAAATGCA GAGCATCCTA TAGGCTTGGT 300 

GCTAGTAATA T T AAC AG T AC TTGTGATTAT TGGCTTTGTA CTGAACATGT TTATAAACA 3 60 

CTTTAAGAAA GAG AG AT T AA TCTAATGTTG ATGAATAGCG TGATTGCTTT AACTTTTTTA 4 20 

ACAGCATCTA GCAATAATGG CGGACTTAAT ATTGATGTGC AACAAGAAGA G G AAAAG C G A 480 

ATCAATAATG ATTTAAATCA AT AT GAT ACA ACGCTATTTA ATAAAGACAG CAAAGCGGTT 54 0 

AATGATGCGA TTGCTAAGCA GAAAAAAGAA CGACAACAAC AAATAAAAAA TGATATGTTT 600 

CAAAATCAAG CGAGTCACTC GACTCGCTTG AATGA 63 5 



(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

CTGAAATGGG TATTATTTGT CTTCTTCATC ATAAAGTAAT AAAGATTGTT CATCATTGCG 60 

ACGTTGCCAA TTTTCATTTG GCGATCTTGG mCGTATAAT AATTGCCTAG TAGCATACGC 120 

TTTGATTGAA ACATACAAGT CATTCCTTGA ACTTGCACCT CAATTTCCAC ATTTGAATTT 18 0 

TCTTTTATAT TAATAATTTC ATCCAAATTC AGCTCACGTG CTAAGACAGC TCTTGATGCG 24 0 
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CCTCTTTTAC CCCAGTAATT ACATTGAAAA TGATTAGTTA CTAACGTCTC TGCATTCAA. 300 

TGAAGTGGTA TTGGATTTTC TTGCGCCTTC ACATACATTA CTACTGCTGG ATCCCCGAAA 360 

ATAATTCTGT CAACTCGTAT TTCATGTAAA AAATTAATAT AATCTTCTAC AGCATCTAAA 420 

TGATAATTAT GAAATAATCC ATTCACTGCC GCATATACTT TTTTATCGTT TTTGTGAGCT 4 80 

AATGCGACAG CCTCTGTCAT TTGTTGTCTA TTGAATTCCC CTGGAAGTCT TAAACCAAAC 54 0 

TTTTGCTCGC C AAT T AC AAA AGCATCTGCA CCTAAATCAA TAAGTGTTTC CATATGGCTT 600 

AATGACTTGG GTGTGACAAG TAATTCTGTC ATAGTCATTC TCCTTTAATT GAAATCGCTA 660 

ATCCATCGTC TATATTTAAA AAATTCGTTG TATATCCTG TTGCTTTATT AACCACTCAT 720 

TATAATCTTG AACCTTTTTA ACCATTTGTC TTACATTTCT CGATCTAACA ATCCCAATAT 780 

CCGATACAAA ACCGTGATAT AAAAC AT TAT CTGTAATTAC GAGACCTTGG TGCTTTAAAA 840 

GTGGTGTATA TATTTCAAAA AATTTCTTTG ATTGCGCTTT TGCTGCATCA AT AAAT AT C A 90 0 

TATCATAAAC TTTGTCATTT ACATTTTCAA ATTGCTCTAA AGCATTACCT TCAATAATTC 960 

GAACCTGGTT TTCAAAATGA TAAGTAGCTA AATTTTGTnT AGCATATTGA ATCATCGTTT 1020 

CATTACGCTC TATCGTTGTG ACATGAATGT CATCAGATAT AGAAGCGAAT TGCATAGAAC 108 0 

TATAGCCGAT TGCTGTACCA ATTTCTAAAA TATTTTTAAC ATTATTCATA CGAATTAATT 114 0 

GCTTAATTAA ATCTAATGTT AAACGATCTA CAATTGGCAC TTCATTTACC TCGGCAAATT 12 00 

C AC GC AAAAC TTCGATTGAA CTATTTTGAT GTTGATGTAA ATCTATTAAA TATTTTTTAT 12 60 

TTAGGTCATC CATGTTTTAA ACTTCCTTTA TGTAAAATAA GTCAAMTGA TTATGACAAT 1320 

AAAATAAATC AGCCTTCACA ATTGATTATA ATTTTGCCAA CCAATTAAAT GACTGATTTC 1380 

GTGTTAGACG CAAAGCTATT TTATTTATAG AAGCGAATCA TTCATATAAA ATTTAACTTT 14 4 0 

AGATATTTTA CCATATTTTC AATAAAATTA TAAGCGTTAA T T AT T TAT AC ATTGCTTGAC 1500 

TTAAAAAATA CTCTTGCCTC CCCATCTTTA AGGTTAGCAA GAGTAAAATC TTTTTAATTA 1560 

TTCTTCCATT TCAGTATTTA CAACTTCTTC AATCATGTCC CATTCTTCAT CAGTTTCGAT 1620 

TGGTACTAAC TTACCACCGT CACCTGACTC ATCTGGTTCA TTGATCATTG GTACAAGCTC 1680 

AATCATATCG TCTTCATCTG ATTGAGCSCC TTCTTCAGCT AAGATAACAT ACTCTTTTTT 17 4 0 

GAAT TCAGGA TGATAAAATT CTAAAACTTT TCGGTATAAA ACTTCATTTC CCTCTTCATC 1800 

GAATAAAGTT AATAATTCTT CTTCGTTATT AATTTCTAGT TGTGAATCAT GATTATGTTC 18 60 

AGTCATAGTA AAATCTCCTT TTAATGTAGT GAAT C T AAAT AGCCTTGTAA AASAATACC 192 0 

GCTGCCATTT TATCAATCAC TTGTTTTCTT TTTTGTCTTG AAACATCTGC TTCTAATAAT 1980 

GATCGTTCAG CAGCCATTGT GCTTAATCTT TCATCCCACA T C AC AAT C T C AATAGAAGGA 2 04 0 
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TAAGCTTCTA ATAATTTTTC TTTATATGTT AACGAAGCTT CGCCTCGAAA TCCTATTGAA 2100 

TTATTCATGT TTTTAGGTAG TCCTATTACG ACTGTACCCA CATTATGTTT TTTAATAATG 2160 

TCTACTAATT GGTCAATACC TAATTCATTA TTTTCTTCAT TGATTCGGAG TGTGTCTAAT 2220 

CCTTGTGCCG TCCAACCCAT TATATCACTA ATTGCAATTC CTACCGTTCT ACTACCGACA 228 0 

TCGAGTCCTA AAATTTTATG TTGTAACATA AATTATTAT TTTGCTCTTT TAAATAGTAA 2340 

GAAACAAGCT CTTCCATAAT AACATCTCTA TCAATATGAC GAATTTGATT TCTTGCTTCA 2 4 00 

TTTTGGCGTG GAATATACGC AGGGTCACCT GATAATAAAT AACCTACAAT TTGGTTTACG 2 4 60 

GCATTATATC CTCGTTCATC TAATGTTCGA TAAACATTAT TTAAAACATC TCTTACATCT 2520 

TGCGTTGGAA GTTCTTCATA GTCGAATTTC ATTGTTTTAT CAAAGTTTTC CATTTGCGAC 2 58 0 

ACTCCTTTAA T T ACAAAT AT AACTCACTAT CATCATACAA TATTATGGCT T T AAAT TATA 2 64 0 

GATTTTTAAT GTAATCTTTA ATAAAGCTTA ATGATTTTGA GATATTTTCA GGTTGTGTAC 2 70 0 

CGCCACCTTG AGCCATATCT GGACGACCGC CACCTTTACC ACCAACGATT GGTGCCATTT 2 7 60 

GTTTGATAAG ATCACCGGCT TTAACGTTAT TTGTTAAAGA TTTAGGGACA GTTGCAACCA 2 82 0 

TCGATACTTT ATCATCAACA TTACTTGCAA GAATGATAAT TGTATCTTGT AGTTTAGATT 2 88 0 

TAAAATCGTC CATTGTCGAG CGAATTGCTT TCGCATTTGG TAATCCACT TCAGTAACCA 2 94 0 

ATACTTTATA GCCATTGATT TCTTCAACTT GATCTTCAAT ATTACCCATT TTAAGTGATG 3000 

TGATTTCTTT GTCACGTTGC TCTAATTGTT TTAATAATGC TTTTTCTTCA TCTTGTAATT 3060 

GTGTTAACTT AT C G AC T AC T T GAT CATC AG ATTTCACTTT CAGCTGTGAT TTCATCGTAT 312 0 

TAAATTTCTC TTGAATATCT TCTAAATATA AGAAAGCTGC TTTACCTGTT AATGCTTCAA 318 0 

TACGACGCAC ACCAGCTCCT GTACCTGACT CACTTACTAT TTTGAATAAG CCAATTTCAG 324 0 

AAGTATTGCG G AC AT G AAT A CCACCACATA ATTCAATTGA AAATGGTGCC ATATTTACTA 3 30 0 

CACGCACAAC ATCACCATAT TTTTCACCGA ATAATGCCAT TGCGCCCATT TCTTTAGcTG 3360 

AAGCAATATC CATTTCTTGA ATGTTAACGT CAATACCTTT CCAAATTTCT T CAT T T AC T A 34 20 

AGCGTTCAAC TTGATCAATT T CAT CAT TAG TCATTGGACC AAAAT GAG AG AAATCAAAAC 34 80 

GTAAACGATC TGCTTCTACT AG T G AAC C AG CTTGGTTAAC ATGATCACCCAGTACTGATT 354 0 

TCAACGCTGC ATGTAATAAA TGTGTTGCAC TATGGTTCTT TTGAATGTCA CGTCGATCAT 3 600 

TTTGGTTCAC TTCAGCAGAC ACTGTAGCGC CAACATTTAC TTGGCCAAAT TGTACTACTC 3 6 60 

CTTTATGCAA GTTTTGACCA TTTGGTGCTT TGGTTACTTC ACTAACAGCA ATTTCAAAAT 3720 

TGTCATTATA AACAATACCT GTATCCGCAA CTTGTCCACC AC T GAT T GC A TAAAATGGTG 3780 

TTTCCGTTAA CATGAAGTAT ACTGTTTCAC CCGCTTCAAC TTGTGAAACT TCTTCACCAT 38 4 0 

TGTATATCAA GTGTGTTAGT GTTGTTTGAG ctGTCGCAGT ATCATAACCA AC AAAAG T AC 3900 
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TTGCAGATGT AATATTTTTC AATACTTCAC TTGAACTTG CAT T GAT T G A GAATTTTGAC 3 9 60 

GTGCTTGACG TGCACGATCA CGTTGTTGTT GCATTTCTGA CTCGAATGTT GTCATATCAA 4 02 0 

CTTTCAATCC TGCTTGCACT GCTATTTCTT CAGTTAATTC AATTGGGAAC CCATACGTAT 4 08 0 

CATACAATTT AAATGCATCT TTCCCATTAA TTTCATTTGT TGTCGCTTTA GCTTTTTAA. 414 0 

TTAATTCATT TAAAATCGCT AAACCATCTT CTAATGTTTC ATGGAATCGT TCTTCTTCAG 4200 

ACTTTATAAC ACGCTTAATG AAATCTGCTT TTTCCTTAAC ATTTGGATAA TATGGTTCCA 4260 

TAATGTCTGC AACAATATCA ACAAGTTTGT ACATAAATGG CTCATTGATT CCTAACGTTT 4 320 

GACTAAAACG AAGGGCACGA CGTAACAATC GACGTAATAC ATACCCTCTA CCTTCATTGG 4 38 0 

CAGGTAATGC AC CAT C AG AA ATTGCAAATG CAATCGTACG AATGTGGTCA GCAATTACTT 4 44 0 

TAAATGCCAC ATCTTGTTCG TTGTTTACTA AATATTGTTT ACCTGATACT TTTTCGATTT 4 500 

CATTCATTAT AGGCATAAAT AAATCTGTTT CATAGTTAG 1 ACGTACATTT TGAGAAACTG 4 5 60 

AGGCCATACG CTCAAGCCCC ATGCCAGTAT CAATATTTTT ATTAGGTAAT GGTGTGTAAC 4 62 0 

TATGATCTTT ATTATGATTG AATTCACTAA ATACTAAGTT CCATACTTCA AGATAGCGTT 4 68 0 

CATTTTCTCC ACCTGGATAC ATTTCTTCTG CCGGATCGTC TTGTCCATAT GCTTCTCCGC 4 74 0 

GAT C AT AG AA AATCTCAGTG TTCGGTCCTG AAGGCCCTTC ACCAATATCC CAGAAGTTAC 4 8 00 

CTTCAATGCG AATAATACGA CTTTCTTCAA GCCCAATATC TTTATGCCAA ATGTTGTATG 4 8 60 

CTTCCATATC TTCCGGATGA ATCGTAACGT ACAATTTATC TGGCTCCATA CCCATCCATT 4 92 0 

TATCACTCGT TAAAAATTCC CAAGCAAATT CAATCGCTTC TTGTTTAAAA TAATCACCAA 4980 

TTGAGAAGTT ACCTAACATT TCAAAGAATG TATGGTGACG CGCTGTGAAA CCAACATTTT 504 0 

CAATATCATT TGTACGAATA GCTTTTTGAG AGTTTACAAT TCTTGGCTTT TTAGGTGTTt 5100 

CACGTCCATC AAAATATTTC TTTAATGTTG CTACACCTGA ATTAAT.CAT AATAATGTAT 5160 

CATCATCAAT TGGCACTAAT GGTGCAGAAG GTTCAACCAT ATGTCCTTTT TCAACAAAGA 52 2 0 

AATCTAGATA TTTTTGTCTA ATTTCACTCG CTTTTAACTT TTTCATCATT TACACATCCT 52 8 0 

ATTTACTGTT TTTAAATTAC CATTCCATAA AAATTGATGA CACAGATAGT CGATTTGCAA 534 0 

AACTAGTATA AATCAATATC ATTTTTTATT ATTAAAAAAT AAAAAACGCC CATCCTCAAA 54 00 

AGGGACGAAC GTTATCGCGG TACCACCCTA GTTATAAATG CAATTCAACA CATTTATCAC 54 60 

TTTAATTCGA CTATACAGTT GTGCATAAAG TAGCGTTCAC TAATGTTTGT tGTACTTTTC 5520 

AC C AAC C AG T ACATCTCTGA TAAACAAiTC aTTAACTACT CATCTTTATA CGAATTTAAT 5580 

TCTATTTTAG T T AC AT T T AC GCTTGTTGTC AACGTTCTAT AAAG T CAT AC GGCGTGATTT 5 64 0 

CTCCCATATT AATCATTGGG TCAATTTTAA ACATTGTAGC TTCCGTTAAT ACATTTGTAT 5700 
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CTGTTTTTGT TGAATCAGAC ATAACTTCTT CACTATCATT CGATGACATT GGGCTTCTA 57 60 

CTTGATCATC TATTGTCGTT TGTGAAGCTC CTGTATCATT AgTTGCTGTG TTTTCCAgCA 5820 

TTTCTTCATC TTCTGAATTA AAATAATTTT TCAACAATGT ACATAATTGT GTTAAACGCG 5880 

CTTGACCATT TGTTTTCAAT CCAATATCAA ATGCTTCCGG ATCACCAAGT AAAACTAAAC 5 94 0 

TCGTTTTCGC TCTAGTTAAA CCAGTATATA ATATCGGTCT TTGTAACATT CTAAAATACT 6000 

GTTTAACAAT AGGCATGATA ACAATAGGAA ATTCTGAACC TTGTGATTTA TGGATTGATG 6060 

TACAATAAGC ATGTGTTAAT TCCATCATAT CTTGTTTCGT AAATGTAATT TCATTACCTT 612 0 

CAAAATCCAC AACAAGTACA TCTTTATTAA GGGCATTTC TTTCGCCCAA AAAATACCAA 6180 

CAATAACTCC TATGTCACCA TTGAATATGT TATCATTTGG CCTATTAACA AGTTGTAATA 624 0 

CTTTGTCACC TTTTCTAAAG ACTACATCAC CAAACTCAAT TTCTCGTGTG TCTTTCTTTT 6300 

TAGGGTTTAA AATATCTTGT AAAACTTGAT TTAAACGTTT AATACCGGCA TTTCCTTTAT 63 60 

ACATTGGTGC AAGCACTTGA ATATCAGCCA TAGTATACCC TTTATTAACA GCACTAGTAA 64 2 0 

CTACCTTCTC AACAACTGTT GGTATTTGGT TTGCCTGACA GTTAATAAAA CTTCTATCAT 64 80 

GAAAACGCTG TGTAATATCA ATTTTCTGAC CCAACTTCAT TCGATGTGCT AATTCTATAA 654 0 

TGCTTGAACC ATCTTGTTGA C GAT AT AC T T CAGTCAGATT TACTCGTGGT ATAGCTTTCG 6 60 0 

ATTCAATTAA ATCTTTAAAT AC T T G AC C AG GACCTACAGA AGGCAATTGG TCCTCATCAC 6660 

CTACAAATAT CAATTGTGCA TCTAAAGGAA CTGCACTTAA AAATTGGTGG AACAACCAAG 6720 

TATCTACCAT AG AC AT C T C A TCAATGATTA TGAGTCGTGC GTTATTTCA TTTTCTAATA 67 8 0 

TATCCTCTGG CTTTGTGTCT TGATTCCAAC CTATTAAACG ATGAATCGTC ATTGCTTCTA 68 4 0 

ATCCAGTTGA CTCTTGTAGT CTCTTAGACG CTCTTCCTGT TGGCGCTGCT AATACAACTG 6900 

GATAATCATC ATTGACATAA TCATCATAAT CTAATGATAA GCCATGAATC TCAGCATATA 6960 

ATTCAACAAT ACCTTTAATT ACTGTCGTTT TTCCTGTTCC CGGTCCACCG GTTAATAGCA 7020 

TCACCTTAGA ATTGATAGCC GTTTGCAAAG CTTCTTTTTG TGAAGCTGCA TAGTTCACTT 7080 

GATTCGCATC TTCTATTTCA CCAATATGCA TTTGTAAATC TGACTGTTCA ATTTCTGTAA 714 0 

GTTTATTTGT ATGCGTCTTT ATTCTGAATA AGTTTTGAAC ACTTTTGATT TCaGAATAAT 7200 

ACAAACTTGG AATTGCAACT TGTTCaTTGT CAATAATTAG TCGTTTTTCC TCATTTAAGT 72 60 

ATTGCAACAT TTCGTCTAAT TTTTCAGGTT CGATGACCTC TTCATCTTGa TAATTTAATA 7 320 

CATCAACCGT TAAATCTATA AC AAC AT T G A TAGGCAAATA TGTATGTCCCTGTTTAATAC 7 38 0 

ATTCTTCTTC TAACGTATAG AGCAACGCAG CTTTTAATCG TTCATTATCG TTATAAGCGA 74 4 0 

TACCAATATT TCTAGCAAGT TGATCTGCTT TATTAAAACC AATACCTTTA ATATCATAAA 7 500 

TCAATTGATA TGGATTTCGA TCTAAAATAG TCAGTGTATC GCCGAGATAA AAC T GAT AAA 7 5 60 
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TTGCCATTGA AAGTTTAGGA CCAAACCCTA AATCATGTAA ACGAATCATT ATTTTTTCAG 7 62 0 

ATTCTTGATT TGCTGAAATT TGTTCTGCAA TTTGTTTCTG TTTCTTTTTA GATAATCCCG 7 68 0 

AAACTTTTTC TAGCACTGAA TGGTCATCTA ATATATCATT TATCGCATTG TCACCTAATG 77 4 0 

TATTAACAAT ATTTTGAGCT GTCTTTTTAC CTACACCTTT AAACAAATCA C T AG AT AAAT 7 8 00 
AACTTATAAT TGCTTCTTTC GTTTGTGGCA TTTCTTTTTC AAAAGTCTCT GCTTTTAATT 7860 
GTTTACCATA ACGTGGATGA TCAACAACTT GCCCTTTAAA TGTGTAGACA TCGCCTTCAA 7 920 

CAATATTCGG AAGAAACCCT AC AAC AG T T G GCATTGTATC AAAGTCTTCA TTTGTTOAA 7 980 
TAGTATCTAC TTTAAGCACT GTATAAAAAT TATCACTGTT TTGAAACAAT ATCGCTTCAA 8 04 0 

CAGTACCTTT GATCATTGAA TAATCAAATA GTGTAGGGTC TGACATGTTA CTCCTCCTCT 8100 
TTCATTTTAG TGAATGTTTT CAGCGCATGC TGACTTAATA AGTGTTTAGG GTCGATAGTC 8160 

ACAGCTTCTT TAAAATGAGT TATTGCTTCA TCAATATCTT CATTTTTCAT AAATAACGCT 822 0 
AAGCCCAAAT TGTATCTTGC AT C AAC AT G A TTTTTATCAA TCGTTAATAC ATGTTTAAGT 82 8 0 

TGAGTTATGG CTTCATTAAA CATTTCTAAT TGACATAATA CAAGACCATA TTGAAATTGA 8 34 0 

ACTTCTGCAT CTTTGTCTTT ATCTAGTTCC GCAGCAGT& TTAAATACGG CAATGCCAAG 8 4 00 
C T T AAAT GAT TCTAACTGAT TAAACGCCAT ACCGATCATA TAATTACAAT CAACTTGTTC 84 60 

AATCTCTGTT TGTAATGCTT GTTGATATAA TTTAATAGCT T C T T GAT AAC GTTGCTGATT 8 520 

ATAATATACA TTTGCTAGAT TAAAAAATAC GACGCCATTC TTCGGATCTA TTGTnAAAGC 85 80 
TTTTTGGAAA AAACGCTCTG CCTTTyCAAy CyCATTCgCA TCAGCAAGTA CGATmCCaGC 8 64 0 

ATTAATATAA TTTTCAATAA TTGTAGGATT TTCTTCGATA TTTCCGAACA ATGCTTGTAA 87 0 0 

CGCTTCTTCT ATTTTTCCAT TTTGTATGTA TTGATAAATT GTTTGTTGAT CTATCATTTA 87 60 

CGAACCTCAT TTCTCATCAA TTATAACATC TTGATAAATT GTATGTCTCG AATCACTTAA 8820 

CAACGAATAA AATATAATCT AATATCATCT TCATTCATGA AAAAGCGGGA ATGGAATAGA 888 0 

AATGCTTAAG AACCATTAAC GGTTTATTAT GTAATGGTTC T T C C AC AT T A GCCACCACTA 8 94 0 

TTATGTACTT AAAAATAAGA ATACATAATT AGATTCATGC ATAGGfiGTG GGACAGAAAT 9000 
GATATTTTAA CAAAATTAAA TTCGTTATCC CCAACTGGCA TTGCCTGTAG AATTTCTTTA 90 60 

CGAAATTCTC TATGTTGTGG TCCCGCCAAT ATAACATTGT AGAGCCTAGG ACATTGTGAT 912 0 

GTCCCAGACT CTATCCTCAT GAATTATTCT CATCAAAAAC TGTCTTTCGT CATTTTCAAC 9180 

GTTGAAACTT C AAA T AAG T A ATTTATTGTT GCCATTGTTT ATACAACATA ATTTAATTGA 92 4 0 

CCTTCATTTT TGAACACATC GTCAATTGTT GCACCACCAA GACACACATC ACCTTGATAA 9300 

AAAACAACTG CTTGTCCAGG TGTGATTGCT CTTACTGGCT CAGCAAAAGT AACACGTAGg 9360 



676 

( opied h orn 1< i QbSQb on 08/ 1 , t?00 ; 



CAtGGtCGTT TTCACGTTTC ACAAAAACTT TCGTATCTTT TTGGCGATAT CTAAATTTAG 9420 

CTGtACATTC AAAACCTTGA TCTAAGTCAT TATCTTCTGG ATTTACAAAT GAATAGTCTG 94 8 0 

AAGCAATTAA GTAATCACTG TATAATGCAT CGTGATGGAA TCCTTGTTCT ACATATAAAA 954 0 

CATTATCTTT TAGGTTTTTA CCGACAACAA ACCAAGGATC GCCATCTCCA CCHTACCTA 9600 

ATCCATGTCT TTGTCCTATT GTGTAATACA TCAAACCACT ATGTTTACCC ATTTTCTTAC 9660 

CATCAAGTGT TATCATATCA CCCGGTTGTG CAGGTAAATA TTGTGATAAA AATGTTTTAA 97 2 0 

AGTTTTTTTC GCCGATAAAA CAAATGCCTG TAGAATCTTT TTTCTTAGCA GTAACAAGTC 97 8 0 

CTTGTTCTTC AGCAATTCGA CGCACTTCAC TCTTTTCGAT GTCGCCAATT GGGaACATCA 98 4 0 

CTTTTGAAAG TTGTTGTTGA GATAATTGAT TCAAGAAGTA TGTTTGATCT T TAT TAT TAT 9900 

CTACACCACG TAACATTTCA ACATGACCAT CTTCATGACG ATGTATGCGT GCGTAATGTC 9 960 

CTGTTGCTAC ATAATCTGCA CCTAAATTCA TCGCffGATC TAAAAAGGCT TTAAACTTAA 10020 

TTTCTTTATT ACACATAACG TCTGGATTTG GAGTACGACC TTTTTTGTAT TCATCTAAGA 10080 

AATACGTAAA GACTTTATCC CAATATTCTT TTTCAAAATT AACAGCGTAA TACGGAATGC 10140 

CAATTTGATT ACACACTTCA ATAACATCGT TGTAATCTTC AGTTGCAGTA CATACGCCAT 10200 

TTTCGTCAGT GTCATCCCAG TTTTTCATAA ATATGCCAAT G AC AT C AT AA CCTTGTTCTT 102 60 

TTAAGACGTG GGCTGTTACA GAACTATCTA CACCGCCTGA CATACCAACG ACAACACG T A 10320 

TATCTTTATT TGACAATTAT GACTCCTCCT TAAATTTAAA ATATATTTTA TGAATTTCAG 10380 

CTACAATTGC ATTAATTTCA TTTTCAGTAG TCAATTCGTT AAAACTAAAT CGAATCGAAT 10440 

GATTTGATCG CTCCTCATCT TCGAACATTG CATCTAAAAC ATGCGACGGT TGTGTAGAGC 10500 

CTGCTGTACA TGCAGATCCA G AC G AC AC AT AGATTTGTGC CATATCCAAC AATGTTAACA 105 60 

TCGTTTCAAC TTCAACAAAC GGAAAATATA GATTTACAAT AEGCCTGTA GCATCCGTCA 10 620 

TTGAACCATT TAATTCAAAT GGAATCGCTC TTTCTTGTAA TTTAACTAAA AATTGTTCTT 10680 

TTAAATTCAT TAAATGAATA TTGTTATCGT CTCGATTCTT TTCTGCTAAT TGTAATGCTT 10740 

TAGCCATCCC AACAATTTGC GCAAGATTTT CAGTGCCTGC ACGGCGTTTC AATTCTTGTT 10800 

CACCGCCAAG TTGAGGATAA TCTAGTGTAA CATGGTCTTT AACTAGTAAT GCACCGACAC 108 60 

CTTTTGGTCC GCCAAACTTA TGAGCAGTAA TACTCATTGC GTCGATCTCA AATTCGTCAA 10 920 

ACTTAACATC AAGATGTCCA ATTGCTTGAA CCGCATCAAC ATGGAAATAT GCATTTGTCT 10 980 

CAGCAATAAT ATCTTGAATA TCATAAATTT GTtGCACTGT GCCAaCTTCA TTATTTACAA 1104 0 

ACATraTAGa TACTAAAATC GTCTTATCTG tAATTGTTTC TTCAAGTTGA TCTAAATCAA 11100 

TAGCACCTGT AT CAT C AAC A TCTAGATATG TTACATCAAA ACCTTCTCGC TCTAATTGTT 11160 

CAAAAACATG TAACACAGAA TGATGTTCAA TCTTCGATGT GATAATGTGATTACCCAATT 1122 0 
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GTTCATTTGC TTTTACTATG CCTTTAATTG CCGTATTATT CGATTCTGTT GCGCCACTCG 11280 

TAAATATAAT TTCATGTGTA TCTGCACCAA GTAATTGTGC AATTTGACGT CTTGACTCAT 1134 0 

CTAAATATTT ACGCGCATCT CTTCCCTTAG CATGTATTGA TGATGGATTA CCATAATGCG 114 00 

AATTGTAAAT CGTCATCATC GCATCTACTA CTTCAGGTTT TACTGGTGTG GTCGCAGCAT 114 60 

AATCTGCATA AATTTCCATG TTTGGACACT CCTCACAATT TTATCAATGT TCCAATAATA 11520 

GCACCTTACA TACTATTTTT CTACTTTTCT GTTTAACTTT ATTTATAATG TTTTTAATTA 11580 

TATTTTACCA TTTTCTACAC ATGCTTTTCG JTAGGCTTTT TTAAGTTTAT CGCTTTATTC 11640 

TTGTCTTTTT TATAAATTTT AGTATTTGCA GATATTTTTT TATTTGTAAA ATGTAACGTA 117 00 

CTATTATTTT GGTTATGAGC AATTTAATAT TTATCTGGTT ATTCGATTGG TATACTTCTT 117 60 

AT AT CAT AAA AAAGGAAGGA CGATATAAAA ATGGCGATTA AATATTCAGC ATTAAACTT 11820 

GTCCCTATTC GAGAAGGTGA AGATGAACAA ACAGCAATTA ATAATATGGT TAATCTCGCA 118 8 0 

CAACATTTAG ACGAATTATC ATATGAAAGA TATTGGATTG CTGAACACCA TAACGCTCCC 1194 0 

AACCTAGTAA GTTCAGCAAC TGCTTTATTA ATTCAACATA CGTTAGAACA TACGAAACAC 12 0 00 

ATACGTGTAG GTTCTGGAGG CATCATGTTA CCTAATCATG CTCCATTAAT CGTTGCGGAA 120 60 

CAATTTGGCA CGATGGCAAC ATTATTTCCA AATCGTGTCG ATTTAGGATT AGGACGTGCA 12120 

CCTGGAACAG ATATGATGAC CGcAAGTGCA TTAAGACGAG AT C AAC AT G A TGGTGTTTAT 12180 

AAATTTCCAG AAGAGGTTTC AT TAT T AC AA CAATATTTG GCCCTGCTCA CCAACAAGCA 1224 0 

TATGTTCGTG CTTATCCAGC AGTAGGTAAA AATGTGCCTT TATACATTCT TGGTTCTTCA 12300 

ACAGATTCTG CACATTTAGC TGCTCGCAAA GGGCTTCCAT ATGTGTTCGC TGGACATTTT 12360 

GCACCTCAAC AAATGAAAGA AGCTATCGAA ATTTACAAAA CGTTATTTGA ACCTTCTGAT a4 2 0 

GTATTAGACG AACCTTATGT TATTGTATGT TTAAATACAA TCGTTGCTGA AAAT GAT G AC 124 80 

GAAGCACAAT ATTTAGCTTC ATCTATGGCA CAAGTAATGG TTAGTATCAC TCGTGGCAGA 1254 0 

ATGCAGCCCG TTCAACCGCC AAC AC AT G AA C T AC AAAAT A TATTAACGCC GAGAGAATAC 12 6 00 

GCGATGGCTA TGGAAAGACA G AAAAT AT C A TTAATAGGTT CAGAAAATAC TGTTCAACAA 12 660 

AAAATTCAAG ATTTTATGGA AACTTATGGT GAAGTCAACG AAATTATGGC AATAAGTTAT 12720 

ATTTATGATA AAGATATGCA ATTAGACTCT TATCGTCGGT TCAAGAATGT TATAAATCAG 127 80 

ATAAATGAAA AAAACACTTT ATAATGTGAT AAATAAACTA AGTGAAGTA TGTATCCATA 128 4 0 

ATATTAATAA AAATATACAG T AACAGC AT T TTGAATGAAA GATGTCTTTA TTGTTCAATC 12900 

ATTTATTTTA GTAATGATTC AAATTCACTT AAAATyCTAA tGCAAATATG AAAGCGCCCC 12 9 60 

TTCAcTTTAC ACTGTGTAAG TGTTTATTTG ATGGGGCGCT TTCAAAATAT TGAAAAGCAT 13020 
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ATCCAAAATT TAAAGAAATT TATTTCTCTT TATCTTCATT TTCTTTTTTC TCTTCGTTAT 1308 0 

TCGATCCTGT ATATTCATTT ATCTTATCTT TTACATTTTT AACTTGTTCA TTATCGCTAT 1314 0 

TTTTAAATTT TTCTACGCGT CTTTAGCTTT ATCCATAAAA CTCATATTAA TCGCTCCTCT 13200 

TATATTTGAT TAGTTTAATT GAACTTAHT TTTAAGTTTA TCAATTGCAT CAGTTATTTT 13260 

GTTTTTAGCA TTTTCAACAA CTTCTTTTGC TTTaCCAGTC GCTTTATCTT GCTGACCTTC 13320 

TTTTTCTAAT TCTTTGTTAT CAGTAACGTT ACCTACTGTT TCTTTAACAT TTCCTTTAAA 1338 0 

TTGATCGAAC TtACTTTCGT CTGCCATAGT GAAACCTCCT TGGATGTATA TATTATATA 13440 

CCACTAAGGA GGTTCGCTmm mCAyymyAAT ATGAAGTTTT TATGTTATAG TATAGTATTT 13500 

ATACGATTAA ATATAAAACA TGTATCCGTC TAAATCTTCA CTTGTATCTA CATATTCCGC 135 60 

TAAATATTTC AATGTTGTAT TATCTAAAAC ATCTCTCACT GCATCTsTCA TGCGAATCCA 13 620 

TAGTTGTTTT TGCGCAGGTG GTTCTGATTC AATACTTTCA ACAAATGTAA TTGGACCTTC 13 680 

TAACAGTCTT ATAATATCCC CTGCTGAGAT TTCTT 13715 
(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 873 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

AAATCCATAA TGTCATGATA ATCTGCATAT GCTTCATATA ATTCAATCAT TGTGAATTCA 60 

GGGTTATGTC TAGTTGATAC ACCTTCATTA QSGAATACTC TACCAATTTC ATATACTTTT 120 

TCAAGTCCAC CGACAATTAm ACGTTTTAAA TGCAACyCAA TAGCAATACG CATGTATAaC 18 0 

GTTGCATCTA ATGCATTATG ATGTGTTACA AATGGTCTAG CAGCTGCTCC ACCAGCAATT 24 0 

TGGTGCATCA TAGGTGTTTC TACTTCCAAG AAACCTTTAT TATTTAAATA ATTACGSTT 300 

TCTTGAATGA TTTTACTACG ATTAATAAAT GTACGAGTGC TATCTTCGTT CGTAATTAAA 3 60 

TCTAAATATC TTTGACGATA tCTCTGTTCA ATATCCTGTA AACCGTGGAA TTTATCCGGT 420 

AATGGTCGCA ATGATTTAGT TAGTAGCGTG AATTTCTTCG CTTTAACCGA TAATTCGCCA 4 80 

GTATTTGTTT TGAA.CATTAC ACCTTCAACA CCAACGATAT CGCCTAAATC AGCATTTTTC 54 0 

CATAAATCAA ATTCGTCATC GCCAACTTGA TCTTTACGAA CGTAAATTTG AATTTGTCCA 60 0 

GCTAAGTCCT GAACGTGTGC AAATCCTGCT TTACCTTTAC CACGCTTAGT CATTAATCGT 660 

CCAGCTATAG CGACATGACT ATCCGCTTCT TTTTCTAC& ATTCTTCTTT AGAATACTGG 720 

TCCCACTCTT CTTTCAAATC ACTAGATAAA CCTGAACGGT CAAATTTAGA ACCAAACGGG 780 
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TCTATACCAA GATCATATAA TTCTTGTAAT TtTTGACGTC GAACCAACAT TTGGTCATTC 



840 



ATTTCTTCTG AC AT AAC T t T CTCTCCTTTA ACT 873 

(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO: 197: 

TTGTCTACTT CCTCATAGcT TTCAATACGT TTATCAAATT CTTCTTCTGT TTCTCCAGGA 60 

ACCATAATAT GAATGGCTTC AGGATCAAAA TAAAGACCAA CTTCACTGCC TACTTCAGCT 120 

TTTTTAGTCG TTTGTATTAC CCATTCATAA CCTTTATTGT C T AT ACAACA TATTTCATAG 180 

TGGACCCCTC TAAATAACAT AGAATCAACA GTTGCTTTAA ATAATCCTTC TTCAGCTTTG 24 0 

ATTAATGATA TATCTTCTGG TCGAATAACG ACTTCTACTT TTTTATTTTC AGGAATACCC 300 

ATATCGACAC ATTCGAAATC TTGCCCATAA AT AT T C AC GA CATAATCTCT AACCATGCGC 3 60 

CCTTCAACAA TATTAGATTC TCCAATAAAA TCAGCTACAA AEGATTCAC TGGTTCGTCA 4 20 



TaTATATCTG TTGGTGTGCC AAATTGTTGA AT 
(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



452 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

TAGGTTGGGT TCTAACATAC GATAAAGCTC AACAAATCAA CACAGCTTTC TTTGTAAAAT 60 

TGTTTAATAC TGCATTAGCA GAACGTGATT ATTATTTTAA TATAGATGGA ACAAATGCTT 120 

TTAGATTATT TAATGCTGAA GGTGATGGTG TTGGGGGATT AACAATCGAC AATTACGATG 18 0 

GTCATTTGTT GATTCAATGG TACTCAAAAG GTATTTATAA ATTTAAATAT GCCATTCTTG 24 0 

AAGCGGTTAG AAAAGTATTT GAT T AT AAAT CTATTTACGA AAAAGTAAGA TTTAAAGACA 300 

GCGAATATAG TGGTGGTTTT GTTGAAGGAG ATGCACCTGa GTTTCCAATT GTTATCGAAG 3 60 

AAAACTTCAC ATTTTATAAT GTAGACCTTG AAGATGGTTT GATGACAGGT ATCTTTTTAG 420 

AT C AAAAAG A AGTGCGCAAG AaATTAAGGG ATCAATATGC CAAAGAACGC CATGTTTTAA 480 
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AC T TAT T TAG TTATACAGGT GCTTTTTCTG CAATAGCAGC AAGTfiGGCA TCTTCAACAA 54 0 

CAAGTGTAGA TTTGGCTAAT CGTTCTCGTA GTTTAACTGA AGAAAATTTT GGATTAAATG 600 

CTATTGATCC TAAATCCCAA TATATTTATG TCATGGACAC TTTTGATTTC TATAAATATG 660 

CTGCACGACA TGGACATAGT TATGACACGA TCGTGATTGA TCCACCTAGC TTTGCGCGTA 7 20 

ACAAAAAACG TACATTTTCA GTGCAAAAAG AT TAT G AC AA ATTAATTAAT GGCGCCTTAA 7 80 

ATATCTTATC ATCTGAAGGA ACATTATTGT TATGTACAAA CGCAAGTGTA TATCCATTAA 84 0 

AGCAATT TAA AAATACTATT AAAAAGACGC TTGAAGAGAG TGGCGTTGAT TATGAATTAA 900 

CTGAAGTTAT GGGAT TACCA AAAGATTTTA AAACGCATCC ACATTATAAG CCATCTAAAT 9 60 

ATTTAAAAGC TGTTTTTGTA AATATTAGAC ATTAATTTGA TAAAAAGGGG TATAGTAACT 1020 

TATTGAGAAA AAGAAGGGTG ATAATATTAT GGGATTCAAA AACAATTTAA CATCAAATTT 1080 

AACAAATAAA ATCGGTAATT CAGTCTTTAA AATAGAAAAT GTTGACGGAA A&GTGCAAT 114 0 

GCCAACGACG ATTCAAGAAT T G AG AG AAAG ACGACAACGT GCTGAAGCAA TTGTAAAGAG 1200 

AAAGTCTTTA ATGTCATCAA CAATGAGCGT TGTTCCAATT CCGGGTTTAG ATTTTGGTGT 12 60 

TGATTTAAAA TTAATGAAAG ATATTATCGA AGATGTTAAT AAAATTTATG GTTTAGATCA 132 0 

TAAGCAAGTT AATAGCCTTG GGGATGATGT GAAAGAAAGA ATTATGTCTG CAGCAGCAAT 138 0 

TCAAGGTAGT CAATTTATTG GTAAAAGAAT TTCAAATGCA TTTTTAAAAA TTGTAATTAG 14 4 0 

AGATGTAGCT AAACGTACTG CTGCAAAACa AACAAAATGG TTTCCTGTTG TAGGACAAGC 1500 

TGTGTCTGCA TCTATTAGTT ACTATTTTAT GAAAAAATT GGAAAAGATC AC AT T C AAAA 15 60 

ATGCGAAAAT GTTATTAAAA ATGTCATGTA GGTGCTATAA TAGTTTTGCA ATTTGCAAAT 162 0 

TTTACTGAAA CCGGTTTTAA ACGAATTGAA TTTAAAGcAT GGTTTTGGTA AAGTTAATGT 168 0 

ATAAAACTAA GTTAGyATTG TAATAATATk GAAGATTCTA AC TAT AC G AA GGAGAAATGT 17 4 0 

AATTATGGAA CAAAATTCAT ATGTAATCAT C G AC GAG Am T GGTATTCACG CTAGACCAGC 18 00 

AACAATGTTA GTACAAACAG CTTCAAAATT CGATTCTGAT ATTCAATTAG AATATAACGG 18 60 

TAAGAAAGTA AACTTAAAAT CAATCATGGG TGTTATGAGC CTTGGTGTTG GTAAAGATGC 192 0 

T GAAAT T AC A ATTTATGCTG ACGGTAGTGA TGAATCTGAC GCCATTCAAG CAATCAGTGA 1980 

CGTCTTATCA AAAGAAGGAT TGACTAAATA ATCATGTCTA AATTAATTAA AGGTATTGCC 2040. 

GCATC TGATG GTGTCGCAAT TGCTAAAGCT TAT TT AT TAG TTGAGCCAGA CTTAACATTC 2100 

G AC AAAAAT G AAAAAGTCAC TGATGTTGAA GGAGAAGTTG (SAAGTTCAA TAGCGCTATC 2160 

GAAGCTTCTA AAGTTGAGTT AACTAAAATT AGAAATAATG CAGAGGTTCA ACTAGGTGCT 2220 

GATAAAGCTG CTATCTTTGA TGCAcaTTGG GGGGTGGTAG ATGACCCTGA ATTAATTCAA 228 0 

CCAATCCAAG ATAAGATTAA AAATGAAA 2330 
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(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5559 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 199: 

AAGTAATAAA TCGTCTCATT TGGCAACTGA CGCATAATTT CTTTAGCTAC TGTCAAACCT 60 

CCGACACCAG AGTCTATTAC ACCTATTGGT TTATTCATAT TCGGTCATCC TTACTTTTAA 12 0 

TTTAATTTTT AG T T TAT CAT AACTAAGCAT TGGATTTTAG TATTATGCAC TGTGTTTACC 18 0 

ATTTTTGTCA TTATAATATT TATTTTAAAT CAGCCCACTA TCATATTGTC ATGTAATCTG 24 0 

CTTATTAAAA AAATCCCTTC CAAGTTATTG TGTATCTCCA TTCAATTTAA TTTTGAAAGG 300 

AACATAACwT TTTAACTCAA AAGGGATTAA TTTnTAnTCT ACTTCATGGT CTGAACCAAA 360 

GAATGATTTA AACATGTGGA ATGTTGTTTC TCTGTTCATT GCTGAATGG ATGTTGTTAA 420 

TGGAATACCT TTAGGGCAAG CAT T AAC AC A GTTTTGTGAA TTACCACACT GCTGTAAGCC 4 80 

ACCAGTACCC ATTAATGCAT TTAAACGTTC ATCTTTAGTC ATAGATCCTG TTGGGTGCAA 54 0 

ATTAAACAAA CGAACTTGCG AGATTGCTTG TGCACCAACG Aa T T TAT TAT T T T C AG T AAC 600 

AT TAGGACAA ACCTCTAAAC ATACACCACA TGTCATACAT TTAGATAATT CATAAGCTGT 6 60 

TTGACGTTTT TTCTCTGGCA TACGTGGTCC CGGACCTAAA TCATACGTTC CATCAATTGG 720 

GATCCATGCT TTCATACGTT TTAAGTTATC GAACATTCTA GAACGATCAA CTTGTAAGTC 7 80 

ACGGAT AAC T GGGAAAGTAT TCATTGSCTC TAAACGAATA GGTTGTTCTA ATTGATCAAC 84 0 

AATCGCAGAA CAAGATTGTC TTGCACGACC AT T GAT AAC C ATAGAACATG CTCCACATAC 900 

TTCTTCTAAG CAGTTCATAT CCCAGACAAC AGGTGTTGTT TTTTCACCTT TAATATTAAC 960 

TGGGTTACGT CTAATTTCCA TTAAACAAGC AATGACGTTT AAATTTTCAC STATGGAAT 102 0 

TTCAAATGTT TCTTCATAAG GCTTAGAATC ACTTGTATCT TGTCGTTTAA TAATTAATTT 1080 

TACTGTTTTT TGTTTCGGTT TAGATTGTGT TTCATGTTGT GGAGTGTTTT TCACTGATTG 114 0 

TTCAGTCATT ATTTTTTACC CCCTTTAGAC TTACTTGTGT AATCACGTTT ACGAGGTGGT 1200 

ATTAAACTCA CATCGACGTC ATCATAAGTA AACTGCGGTT TTTCAAATGC GCCTTGGAAT 12 60 

GAGGCCATTG TCGTTTTTAA CCACTCTTCA TCATTACGCT CTGGGAATTC TGGTTTATAA 132 0 

TGGGCACCGC GTGATTCGTT ACGGTTATAT GCACCAATCG TAATAACACG TGCAAGTACT 138 0 

AACATGTTCC ATAGTTGACG GGTAAAGAAT ACCGCTTGGT TACTCCAAGT TTGAGTATCT 14 4 0 
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TCCATATCAA TATCTTCATA ACGTTTCATC AATTCAACAA TCTTTTTATC TGTTTCTAAC 1500 

AGTTTTTCAT TTTCACGAAC AACAGTTACA TTTGCTGTCA TAATTTCACC AAGTTCACGG 1560 

TGTAATTTAT ATGCATTTTC TGTACCGCGC ATAGCTAATA ATTTATCAAA ACGTTCTTG 1620 

TCTTCAGCTT TACGCTTTTC AAAAATACTT TCGTCCATAT CAGTATATGA TCGATCAATA 168 0 

TTTGAAATAT AATCAATCGC GTTTGGACCT GCTACTGTAC CACCATAAAT CGCTGATAAC 174 0 

AATGAATTGG CACCTAAGCG GTTACCACCA TGTTGAGAGA AGTCACATTC TCCAGCTGCA 1800 

AATAACCCTT TAATATTTGT CATTTGATCA TAATCTACAT AT AG AC C AC C CATTGAATAG 1860 

TGAACAGCTG GGAAAATCTT CATTGGTACT TTGCGTGGGT CATCACCAGT GAATTTTTCA 1920 

TAAATCTCAA T GAT AC C AC C TAGTTTTACA TCTAACTCAT GTGGATCTTT AT G T G AC AAA 198 0 

T C AAG AT AT A CCATGTTTTC GCCATTTATA CCTAATTTTT GGTTAATACA TACATCGAAA 204 0 

ATTTCACGCG TTGCGATATC ACGAGGTACT AAGTTACCAT AATCAGGATA TTTCTCTTCT 2100 

AAG AAG T AC C AAGGCTTACC ATCTTTATAT GTCCAAATTC GTCCACCTTC ACCACGTGCT 2160 

GATTCACTCA TTAGTCGCAG TTTATCATCA CCAGGGATTG CAGTAGGATG AATTTGAATG 220 

AACTCACCAT TAGCATAAAT AGCGCCTTGT TGGTAAACAA TGGAAGCCGC TGATCCTGTA 228 0 

TTAATCATTG AGTTTGTTGT TTTACCGAAA ATAATACCAG GGCCACCCGT TGCCATAATA 2 34 0 

ACTGCATCTG AACCAAATGT TTCAATCTCA GCAGTTGTCA TATTTTGTGC AnCGATACCT 2 4 00 

CTTGCACTAT CATCGTCACC TTTAACTATG CCAAGGAATT CCCATCCTTC ATACTTCGTA 2 4 60 

ACTAATCCAT CTACTTCATA TGCACGAACT TGTTCATCCA ATGCATATAA TAATTGTTGT 2 52 0 

CCAGTTGTTG CCCCTGCATA TGCTGTTCTG TGATGTAATG TACCACCGAA ACGTCTAAAA 2580 

TCTAATAGAC CTTCATTTGT TCTATTGAAC ATTACGCCCA TACGGTCXA TAAATGAATA 2 64 0 

ATTTTAGGTG CTGCCTCTGT CATCGCTTTA ACAGGTGGTT GGTTTGCAAG GAAATCGCCA 2700 

CCATACACTG TATCATCAAA GTGAATCCAA GGAGAATCGC CTTCCCCTTT AGTATTGACC 2 7 60 

GCACCATTAA TGCCACCTTG GGCACAAACA GAGT GCGAAC GCTTTACTGG TACAACTGAG 2820 

AACAAATCTA CATGTGCACC TTTTTCTGCC GCTTTAATTG TTGACATTAA GCCCGCTAGG 2 880 

CCACCTCCGA CAACAATAAG ATGTTTCTCT GCCATAAAAA TTTCACTCCC CTAAATTTTC 2 94 0 

AATCTATATT TGTTAAATGC GATGTATTAC ATAAAGGCAA TAATTGCAGT AACACCAATA 3000 

TACGAAATAA C T AAAAAT AC GATTAATGAA ACCCATGTAA ATACTCGTTG TGATTTTGGA 3060 

GATTGAAGTC CACCCCAAGT AACTAAGAAT GACCATAAGC CAT T T GC AAA GTGGAACACA 3120 

ACAGCAATAA TACAAATAAT ATAAAATATT GCCCATCCAG GAT G T T GC AA TGTTTCGTGC 3180 

ATTAAATCGT AATTCACTTC TTTGCCGTAA AATGCTTTTT GTAAACGTGT TTGCKTAAA 32 4 0 

T G G AT AC C AA TAAAGATAAA TGTTAAGATA CCACTCACTC TTTGGAAGAA GAACATCCAG 3300 
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TTTCTAAAAA TCGAGTAATG TCCAACATTT TCTTTTGCTG TAAATGCAAT GTGTATACCA 33 60 

AA.CAAACCGT GATATAACAA CGGAATGTAT ATAAATAAAA ATTCTACAAT AATTAGAAAT 34 2 0 

GGTAATGATT GCATAAAGTT AGATGCCTTA TTAAACGCTT CAGCACCTTG TGTTGCTTGG 34 8 0 

TGATTCACTA ATAAATGAAC G AC C AAAAAT GCACCTATTG GGATAATACC TAATAACGAG 3540 

TGAATACGTC TTAGATAAAA TTCATTTTTT GATTGAGCCA AAAGGAGTCC CCCCTGTGAA 3600 

CGAATATTTA ATTTATTGAG CTATTTATAT TAAACGACG CTTAACCCCC TAAAGTGATA 3 66 0 

TCGTTTCTAG CGTTGTCATT ATCATTAAGC GAAACATTTT AAAGACAAAT ACACACTGCA 37 2 0 

CGATCACCAA ACTGCATGTC GAACAATGTA ACATTTGGAT TCGATATTTA AAATTGCTTG 37 8 0 

TGATGATAAA CTTTCTCATT TAGAAAACGC TTCCACGTAC ATTCAAAAAA ATAACTTTGT 384 0 

TAACCATATT GTAACATTAT TTCATATATT TTGGGGCATG AGAATGATTC TCACGCCCAG 3900 

TAATTTATTT ATGCAATTGT TCATGTAGGT TCTTTGCGAC GTTTTCAGGA ATACCTATAT 3 9 60 

TTTTAAAATC TTCAAGTGTA GCTTCCTTCA TTTTCTTGAT TGAACCGAAT GAACGCAATA 4 020 

ATAATGTTTT ACGTTTGTTA CCGATACCAT CTATATCATC AAGTATTGAT TTCAAGCCTG 4080 

TCTTTTGACG TGTTTGTCTA TGAAATGTGA TTGCGAATCT GTGAACCTCA TCTTGGATAC 414 0 

GGTGCAACAA ATAAAATGCC TGGCTATTTT TCTTCAGTGG TACAATTTCT GCACTAGCGC 4200 

CATATAATAA TTCAGATGTT TGGTGTTTAT CATTTTTCTG CAAECTGCA ACAGGGATAT 4 2 60 

CAAGACCTAA TTCGTTTTGT AGC AC AT CAA TAACCCCGTT CATATGTCCT TTACCACCAT 4 32 0 

CTACTATTAT TAAATCAGGT AATGGTAATC CTTCGTTTAA AACGCGAGAA TATCGTCGTC 4 38 0 

TTACTACTTC TCTCATTGAT TTGTAATCAT CTGGACCTTT AACCGTTTTG ATTTTATACT 4 440 

TTCTATAATT TTTCTTATCT GGTTTACCGT CGACAAATGT AACCATTGCT GACACTGGAT 4 500 

CCACACCTTG AATATTAGAA TTATCGAATG CTTCAATTCT AATTGGTGTT TGAATTCCCA 4560 

TTTGTGTTCC AAGTTCTTCA ATAGCTTTAA TCGTTCTGGA CTCATCACGT GATATTAATT 4 620 

CAAATTTATT ATTTAAGGAT ACTTTA5CGT TATGTGCAGC TAGGTCAACC ATATCTTTTT 4 680 

TGGGACCTCG CGCGGGTTGA ACGATTTTAG TGTCCACAAC AGATTGAATC ATTTCTTTAT 4 74 0 

CCAAATTACG TGGTACATGA ACTTCCTTAG GTAAAATATG TTGGTTTAAG CTATAAAATT 4800 

GTCCAATAAA TGTATAAAAT TCTTCTTcTT CTGTTTGCTG TAATGGAATC SCGTTGTAT 4 8 60 

CTCGCTTTAT CAT AT T AC C T TGTCGTATAA AG AAAAC T T G GAT AC AC AT C CATCCTTTAT 4 920 

CAACACTATA ACCAAAGACA TCACGAATCG TTTTATCTGA TGACATAATT TTTTGTTTGT 4 98 0 

TTGTCAGATT TTGAATATGT TGAATTAAAT CTCTATATTC TTTAGCCCGT TCAAAATCAA 504 0 

GTGATTCACT TGCAGTTAAC ATTCGCTCTT CTAAACTTTT TAAAATTGTT TTGTCTTCCC 5100 
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CAT T C AG AAA ATCAGTAATT TCCTTCGTCA TTTGTGCGTA TTTACTCAAA TCAACGTCAT 5160 

ATACACATGG TCCTAAACAT TGTCCAATAT GGTAATAAAG ACATAATTTA TCTGGCATCT 5 22 0 

TATCACATTT GCGATATGGA TATATTCTGT CTATAACTT TTTAGTTTCT TGAGCAGAAT 5280 

ATGCATTCGG ATACGGTCCG AAATATTTGC CAGTACCTTG TTTTACAGTT CTCGTCACTA 5 34 0 

GTAGTCTAGG ATATTTCTCC TTCGTAATTT TAATAAATGG ATAACTTTTA TCATCCTTTA 5 4 00 

ATAATATATT ATATCTTGGT TGATATTGTT TAATCAGATT CAATTCCAGT AAAAGTGAT 54 60 

CTGTTTCACT TGACGTGACA ATAAACTCAA AGCGACGTAT TTCACCAACC AGTCTCGTTG 552 0 

TTTTAGCATC ATGAGCACCC GTAAAATATG ATCGCAATC 5559 
(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

AAATCAATCG AGTGGCATGT CAAGGTCATA TCAATATTTT AGAATCTGCG ACTATGAGAG 60 

AGGAAATAAA TGAAATTGCG CGACGTATCA TCGTTGATAT TCGTGATAAG CAATTACGAT 12 0 

ATCAAGATAT TGCTATTTTA TATCGTGATG AATCTTATGC TTATTTATTT GATTCCATAT 180 

TACCGCTTTA TAATATTCCT TATAATATTG ATACAAAGCG TTCGATGACA CATCATCCGG 24 0 

TCATGGAAAT GATTCGTTCA TTGATTGAAG TTATT(&ATC TAATTGGCAA GTGAATCCAA 300 

TGCTACGCTT ATTGAAGACT GATGTGTTAA CGGCATCATA TCTAAAAAGT GCATAC T TAG 360 

TTGATTTACT TGAAAATTTT GTACTTGAAC GTGGTATATA CGGTAAACGT TGGTTAGATG 420 

ATGAGCTATT TAATGTCGAA CATTTTAGCA AAATGGGGCG TAAAGCGCAT AAACTGACCG 480 

AAGATGAACG T AAC AC AT T T GAACAAGTCG TTAAGTTAAA GAAAGATGTC ATT G AT AAAA 54 0 

TTTTACATTT TGAAAAGCAA ATGTCACAAG CGGAAACTGT AAAAGATTTT GCAACTGCTT 600 

TTTATGAAAG TATGGAATAT TTCGAACTGC CAAATCAATT GATGACAGAG CGAGATGAAC 660 

TTGATTTAAA TGGTAATOAT GAAAAGGCGG AGGAAATTGA TCAAATATGG AATGGCTTAA 720 

TTCAAATCCT TGATGACTTA GTTCTAGTAT TTGGAGATGA ACCAATGTCG ATGGAACGTT 7 80 

TCTTAGAAGT ATTTGATATT GGTTTAGAAC AATTAGAATT TGTTATGATT CCGCAAACAT 84 0 

TGGACCAAGT AAGTATTGGT AC GAT G GAT T TGGCTAAAGT CGffiAATAAG CAACATGTTT 900 

ACTTAGTAGG TATGAATGAT GGAACGATGC CACAACCAGT AmTGCGTCAA GCTTGATTAC 9 60 

AGATGAAGAA AAGAAATACT T TGAACAGC A GGCTAATGTC GAGTTAAGTC CAACATCAGA 102 0 



685 

( opied horn 1( 80(ii>0(i on 08/ 1 , /">■ ; 



TATTTTACAG ATGGATGAAG CATTTGTTTG TTATGTTGCT AT G AC TAG AG CTAAGGGAGA 1080 

TGTTACATTT TCTTACAGTC TAATGGGATC AAGTGGTGAT GATAAGGAGA TCAGCCCATT 114 0 

TTTAAATCAA ATTCAATCAT TGTTCAACCA ATTGGAAATT ACTAACATTC CTCAATACCA 12 00 

TGAAGTTAAC CC AT TGTCAC TAATGCAACA TGCTAAGCAA AC C AAAAT T A CATTATTTGA 12 60 

AGCATTGCGT GCTTGGTTAT ATGA1SAAAT TGTGGCTGAT AGTTGGTTAG ATGCTTATCA 1320 

AG T AAT TAG A GATAGCGATC ATTTAAATCA AGGTTTAGAT TATTTAATGT CAGCATTAAC 1380 

GTTTGACAAT GAAACTGTAA AAT TAG G T G A AACGTTGTCT AAAGATTTAT ATGGTAAGGA 14 40 

AATCAATGCC AGTGTATCCC GTTTTGAAGG TTATCAACAA TGCCCATTTAAACACTATGC 15 00 

GTCACATGGT CTGAAACTAA AT GAGC GAAC GAAGTATGAA CTTCAAAACT TTGATTTAGG 15 60 

TGATATTTTC CATTCTGTTT T AAAAT AT AT ATCTGAACGT ATTAATGGCG ATTTTAAACA 1620 

AT TAG AC C T G AAAAAAATAA G AC AAT T AAC GAATGAAGCA T T G G AAG AAA TTTTACCTAA 1680 

AGTTCAGTTT AATTTATTAA ATTCTTCAGC TTACTATCGT TATTTATCAA GACGCATTGG 17 4 0 

CGCTATTGTA GAAACAACAC TAAGCGCATT AAAAT AT C AA GGCACGTATT CAAAGTTTAT 18 00 

GCCAAAACAT T T T GAG AC AA GTTTTAGAAG GAAACCAAGA ACAAATGACG AATTAATTGC 18 60 

ACAAACATTA ACGACAACTC AAGGTATTCC AffTAATATT AGAGGGCAAA TTGACCGTAT 1920 

CGATACGTAT AC AAAG AAT G ATACAAGTTT TGTTAATATC ATTGACTATA AATCCTCTGA 1980 

AGGTAGTGCG ACACTTGATT TAACGAAAGT AT AT TATGGT ATGCAAATGC AAATGATGAC 2 04 0 

ATACATGGAT ATCGTTTTAC AAAATAAACA ACGCCTTGGA T T AAC AG AT A TTGTGAAEA 2100 

GGTGGaTTAT TATACTTCCA TGTACATGAA CCTAGAATTA AATTTAAATC ATGGTCTGAT 2160 

ATT GAT GAAG ATAAACTAGA ACAAGATTTA AT T AAAAAGT T T AAG T T GAG TGGTTTAGTT 2 220 

AATGCAGACC AAACTGTTAT TGATGCATTG GATATTCGTT TAGAACCTAA ATTCACTTCA 2280 

GATATTGTAC CAGTTGGTTT GAATAAAGAT GGCTCTTTGA GTAAACGAGG CAGCCAAGTG 234 0 

GCAGATGAAG CAACGATTTA TAAATTCATC CAACATAACA AAG AG AAT T T TATAGAAACA 2 4 00 

GCTTCAAATA TTATGGATGG ACATAC TGAA GTTGCACCAT T AAAG T AC AA AC AAAAAT T G 2 4 60 

CCATGTGCTT TTTGTAGTTA TCAATCGGTA TGTCATGTAGATGGCATGAT TGATAGTAAG 2520 

CGATATCGAA C T G TAG AT G A AAC AAT AAA T CCAATTGAAG CAATTCAAAA TATTAACATT 2580 

AATGATGAAT TTGGGGGTGA GCAATAGATG ACAATTCCAG AGAAACCACA AGGCGTGATT 2 64 0 

TGGACTGACG CGCAATGGCA AAGTATTTAC GCAACTGGAC AAGATGTACT TGTTGCAGCC 7200 

GCGGCAGGTT CAGGTAAAAC AGCTGTACTA GTTGAGCGTA TTATCCAAAA GATTTTACGT 2 7 60 

GATGGCATTG ATGTCGATCG ACTTTTAGTC GTAACGTTTA CAAACTTAAG CGCACGTGAA 2 820 
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ATGAAGCATC GTGTAGACCA ACGTATTCAA GAGGCATCGA TTGCTGATCC TGCAAATGCA 28 80 

CACTTGAAAA ACCAACGCAT CAAAATTCAT CAAGCACAAA TATCTACACT CCATAGTTTT 2 94 0 
TGCTTGAAAT TAATTCAACA G CAT TAT GAT GTATTAAATA TTGACCCGAA CTTTAGAACA 3000 
AGCAGTGAAG CTGAAAATAT TTTATTATTA GAACAAACGA TAGATGAGGT CATAGAACAA 30 60 

CATTACGATA TCCTTGATCC TGCTTTTATT GAATTAACAG AGCAATtETC T T C AG AT AGA 312 0 
AGTGATGATC AGTTTCGAAT GATTATTAAA CAATTGTATT TCTTTAGCGT TGCAAATCCA 3180 
AATCCTACAA ATTGGTTGGA TCAATTGGTG ACACCATACG AAGAAGAAGC ACAACAAGCG 324 0 

CAACTTATTC AACTACTAAC AGACTTATCT AAAGTATTTA TCACAGCTGC TTATGATGCT 3300 

TTAAATAAGG CGTATGATTT GTTTAGTATG ATGGATAGCG TCGATAAACA TTTAGCTGTT 3360 
ATAGAAGATG AACGACGTTT AATGGGGCGT GTTTTAGAAG GTGGCTTTAT TGATATACCT 34 20 

TATTTAACTG GTCACGAATT TGGCGCGCGT TTGCCTAATG TAACAGCGAA AATTAAAGAA 34 8 0 

GCAAATGAAA TGATGGTCGA TGCCTTAGffi. GATGCTAAAC TTCAGTATAA AAAATATAAA 354 0 
TCATTAATTG ATAAAGTGAA GAGTGATTAC TTTTCAAGAG AAGCTGATGA TTTGAAAGCT 3600 
GATATGCAAC AATTGGCGCC AC GAG T AAAG TACCTTGCGC GTATTGTGAA AGATGTTATG 3660 

TCAGAATTCA ATCGAAAAAA GCGTAGCAAA AATATTTTGG ATTTTTCTGA TTAffAACAT 37 2 0 
TTTGCATTAC AAATTTTAAC TAATGAGGAT GGTTCGCCTT CAGAAATTGC CGAATCATAC 37 8 0 

CGTCAACACT TCCAAGAAAT ATTGGTCGAT GAGTATCAAG ATACGAACCG AGTTCAAGAG 38 4 0 

AAAATACTAT CTTGCATCAA AACGGGTGAT GAACATAATG GTAATTTATT TATGGTTGGA 3900 

GATGTTAAGC AATCCATTTA T AAAT T TAG A CAAGCTGATC CAAGTTTATT TATTGAAAAG 3 9 60 
TATCAACGCT TTACTATAGA TGGAGATGGC ACTGGACGTC GAATTGATTT GTCGCAAAAC 4 020 

TTCCGTTCTC GAAAAGAAGT ACTGTCAACG ACTAACTATA TATTCAAACA TATGATGGAT 4 080 

GAACAAGTCG GTGAAGTAAA AT AT GAT GAA GCGGCKAGT TGTATTATGG TGCACCATAT 4 14 0 
GATGAATCGG ACCATCCaGT AAACTTAAAA GTCCTTGTTG AAGCGGATCA AGAAC AT AG T 4 200 

GATTTAACTG GTAGTGAACA AGAAGCGCAT TTTATAGTAG AACAAGTTAA AGATATCTTA 4 260 

G AAC AT C AAA AAGTTTATGA TATGAAAACA GGAAGC T AT A GAAGTGCGAC ATACAAGGAT 4 320 
ATCGTTATTC TAGAACGCAG CTTTGGACAA GCTCGCAATT TACAACAAGC CTTTAAAAAT 4 380 

GAAGATATTC CATTCCATGT GAATAGTCGT GAAGGTTACT TTGAACAAAC AGAAGTCCGC 44 40 

TTAGTATTAT CATTTTTAAG AGCGATAGAT AATCCATTAC AAGATATTTA TTTAGTTGGG 4 500 

TTAATGCGCT CCGTTATATA TCAGTTCAAA GAAGACGAAT TAGCTCAAAT TAGAATATTG 4 560 

AGTCCAAATG ATGACTACTT CTATCAATCG ATTG 4 594 
(2) INFORMATION FOR SEQ ID NO: 201: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

GGTTTTCnTG GAAAGATAGT GAAAATCTCG TGTTTTTTGG TTTTgAGGTG TTGTTTGTAT 60 

TTTaTAAAAT GGCTTACATA TATGAAGCGT TGATTAAGA TGGAATTGTT AATTAATTGA 120 

ACCTATTTAG CTTTAAGAAG GCATAACAAG ATGACCTTAT TTTATGCTAT AATATTTCTA 18 0 

TTATGCGAAG ATTAAGGTGA GTAGTAAATT GGATAAAAAA GTAAGTATTC AAACAAAGCA 24 0 

AGTGTTGAAA CAGCACAACG AAAAAGAAAA ATTTGAATTT ACTACTGAAG GAACTTGGCA 300 

ACAAAGGCAA TCTAACTTTA TTCGGTATGT AGAACAAATT GAGGATGCAA CAGTTAATGT ' 3 60 

TACAATAAAA GTGGATGATG ATAGCGTTAA GTTGATTCGT AAAGGCGACA TTAATATGAA 4 20 

TTTGCATTTT GTTGAAGGAC AAACGACAAC AACTTTTTAC GATATATCGG CTGGACGAAT 4 80 

TCCACTAGAA GTTAAAACAT TACGCATTTT ACATTTCGTA AGTGGAGACG GTGGCAAGCT 54 0 

AAAG AT T CAT TATGAATTAT ATCAAGATAA TGAAAAAATG GGTTCTTATC AATATGAAAT 60 0 

TAACTATAAG GAGATAGGCG AATGAATATT ATTGATCAAG TGAAACAAAC AT TAG T AG AA 6 60 

GAAATTGCAG CAAGTATTAA CAAAGCAGGA TTAGCAGATG AGATTCTGA TATTAAAATT 720 

GAAGTTCCTA AAGATACAAA AAATGGAGAT TATGCTACTA ATATTGCGAT GGTACTGACT 7 80 

AAGATTGCAA AGCGTAATCC TCGTGAAATT GCTCAAGCGA TTGTTGATAA CTTAGATACT 84 0 

GAAAAAGCAC AT G T AAAAC A AATTGACATT GCTGGTCCAG GATTCATTAA TTTTTACTTA 900 

GATAATCAGT ATTTAACAGC AATTATTCCT GAAGCAATTG AAAAAGGTGA TCAATTTGGA 960 

CATGTAAATG AAT CAAAAGG TCAAAATGTA TTGCTTGAGT ATGTTTCAGC TAACCCTACA 1020 

GGAGATTTAC ATATTGGTCA TGCTAGAAAT GCAGCAGTTG GTGATGCTTT AgcTAAtATT 1080 

TTAACTGCAG CTGGCTATAA TGTAACACET G AAT AT TATA TTAATGATGC TGGTAATCAA 114 0 

ATTACTAACT TAGCGCGTTC GATTGAAACA CGTTTCTTTG AAGCTTTAGG TGACAATAGT 12 00 

TATTCAATGC CAGAAGATGG CTATAATGGA AAAGATATTA TTGAAATAGG TAAAGATTTA 12 60 

GCAGAGAAAC ACCCTGAAAT TAAAGATTAT TCTGAAGAAG CACGTTTGAA AGATTTAGA 1320 

AAATTAGGCG TAG AAT AC G A AATGGCTAAA TTGAAAAATG ATTTAGCAGA GTTCAATACG 1380 

CATTTTGATA ATTGGTTTAG TGAAaCATCT TTATATGAAA AAGGAGAAAT TCTTGAAGTT 14 4 0 

TTAGCAAAAA T GAAAG AAT T AGGTTATACG TATGAAGCTG ATGGCGCTAC ATGGTTACGT 1500 
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ACAACTGATT TTAAAGACGA CAAAGACAGA GTATTAATTA AAAATGACGG T AC AT AT AC G 1560 

TATTTCTTAC CAGATATTGC GTACCACTTC GATAAAGTAA AACGTGGTAA TGACATTTTA 1620 

AT C GAT T T AT TTGGTGCTGA TCATCATGGT TATATTAATC GTTTGAAAGC ATCTCTTGAA 168 0 

ACGTTTGGTG TAGATAGTAA TCGTTTAGAA ATTCffiATCA TGCAAATGGT TCGTTTAATG 17 4 0 

GAAAATGGTA AAGAAGTGAA GAT GAG T AAA CGTACTGGTA ATGCGATTAC ATTAAGAGAA 18 00 

ATTATGGACG AaGTTGGCGT TGACGCTGCA CGTTATTTCT TAACTATGCG TAgTCCTGAT 18 60 

AGTCACTTTG ATTTTGATAT GGAATTAGCG AAAGAGCAAT CTCAAGACAA TCCAGTTTAC 192 0 

TATGCTCAAT ATGCACATGC GCGTATTTGT TCAATTTTAA AACAAGCGAA AGAGCAAGGT 1980 

ATTGAAGTGA CTGCTGCGAA T GAT T T T AC A ACGATTACTA ATGAAAAAGC GATTGAATTG 2040 

TTGAAAAAAG TAGCTGATTT CGAACCTACA ATTGAAAGTG CTGCTGAGCA TAGATCGGCA 2100 

CATAGAATTA CTAATTATAT TCAAGATTTA GCTTCTCATT TCCATAAATT CTATAATGCT 2160 

GAAAAAGTGT T AAC AG AT G A TATTGAAAAA ACAAAAGCAC ATGTTGCTAT GATTGAAGCG 2220 

GTCAGAATTA CAT T G AAAAA TGCATTGGCA ATGGTCGGTG TAAGCGCACC TGAATCAATG 2280 

T AAGAAC AT T TATATACACT CCAACGTAGA GTTTCTCGAA A&TACTTTG TGTTGGAGTG 23 4 0 

TTTTTTTTAG GTATGTGACA TATTGGGGAA TGCTTAGTAT GTGAATAAGG TTAAGAGGAA 2400 

CACAGTTGGA TGCTCTGCAC AACTGCATAA GAGAGCCTGA GACATAAATC AATGTTCTAT 2 4 60 

GCTCTACAAA GTTATAATGG CAGTAGTTGA CTGAACGAAA ATTCGCTTGT AACAAGCTTT 2520 

TTTCAATTCT AGTCAACCTT GCCGGCGGGG CCCCAACAAA GAGAAATTGG ATTCCCAATT 2580 

TCTACAGACA ATGCAAGTTG GGGTGGGACG ACGAAATAAA TTTTACGATA ATATCATTTC 2 64 0 

TGTCCCACTC CCTCTAAAAT GGAGGGTGTA AATGTTAGGA AC T GAT GAAT TATATAAAGT 27 00 

TTTATATGAA CATCTCGGAC CACSVTTTTG GTGGCCTGCT GATAATGACA TTGAAATGAT 2 7 60 

GTTAGGTGCA ATTTTAGTTC AAAAT AC TAG ATGGCGAAAT GCAGAAATTG C AT T GAAT C A 2820 

GATTAAAGAA CATACGCATT TTAATCCAAA TCATATATTA G AAC T AC C T A TTGAAACGTT 28 8 0 

ACAATCATTG ATACATTCAA GTGGCTTTTA TAAAAGTAAA T C AC T G AC GAT T AAAAC AT T 2 94 0 

ATTAACATGG TTAGCACGAC AT CAT T T C AA TTATCAAGAG ATTAATGAGC GATATAAAGG 3000 

TGGAT TAAGA AAAGAATTAT TATCTTTGAA AGGTATTGGA AGTGAAACAG CAGATGTCTT 3060 

ACTTGTTTAT ATATTCGGAC GTATTGAATT TATTCCAGAT AGCTATACAA GAAAAATATA 3120 

TGATAAATTA GGATATGAAA ACACTAAAAA TTATGATCAA TTAAAAAAAG TAGTCaCATT 3180 

ACCAAATCAT TTTACAAATC AAGATGCTAA TGAATTTCAT GCTCTGTTAG ATGTATTTGG 32 4 0 

TAAACATTAC TTTAGAGACA AAGATATAAA GAAT TAT GAT TTTTTAGAAC CTTACTTTAA 3300 

AAAGTAAACG CTGTGAAGTT AGATAGATGA GTTTATATGA AAT AT AAAAA ATAATTTACT 33 60 
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ATTTTCTTTT AGTATGTGGA CTTATATAAT AAATAGAAGC ATATAAAGAA AAAAACAGTT 34 20 

GTTTGTTTGT GCAGCAACTG CATAAGAGCC CCTAATCGCT AAAGCTCAAG GGGAGTAAAG 34 80 

GAATACAGTT GTTTGTGCAG CAACTGCATA AAAGCCTCTA ATCACTAAAG GTGAAQ&GA 35 4 0 

ACGCAGTTGG ATGCTAAGGC ACAACTGCAT AAAAGCCTCT AATCGCTAAA GATGAAGAGG 3600 

AACGCAGTTG GATgCTACCG CACAACTGCA TAAATCCCTC TaATCgcTAA AG C G AAAAG T 3 6 60 

GGGATTAAAA AGGAGATGTG ATAGTGTGAA GAAATCGTTA ATTGCTTTTA TTTTGATTTT 3720 

TATGCTTGTC CTGAGTGGCT GTGGTATGAA AGATAATGAT AAACAAGGTA GCAAT GAT AA 37 8 0 

TGGCTCGTCT AAATCGCCGT AC C AT AGAAT TGTTTCGTTA ATGCCTAGTA ATACTGAAAT 38 4 0 

T T TAT AT G AA TTAGGATTAG GTAAATACAT AGTTGGTGTT TCAACGGTTG ATGATTATCC 3900 

AAAAG AT G T G AAAAAGGGTA AGAAACAATT TGATGCTT'E AATCTAAATA AAGAGGAACT 3 960 

TTTAAAGGCA AAGCCAGATC TAATTCTTGC GCATGAGTCG C AAAAG G C AA CTGCTAATAA 4 020 

AGTATTGTCA T CAT TAG AG A AACAAGGCAT CAAAGTAGTG TATGTTAAAG ATGCACAATC 4080 

AATTGATGAA AC T T AC AAC A CATTTAAGCA AATTGGGAAA TTAACGCATC ATGATAAGCA 414 0 

GGCTGAACAA CTTGTTGAGG AAACTAAAGA TAATATCGAT AAAGTCATAG ATTCAATTCC 4 2 00 

TGCTCATCAT AAAAAATCAA AAGTATTTAT TGAGGTTTCA TCAAAGCCTG AAATATATAC 4 2 60 

AGCAGGGAAG CATACATTTT TTAATGATAT GTTAGAAAAA TTAGAAGCCC AAAATGTGTA 4 320 

TAGTGACATT AATGGTTGGA ACCCTGTAAC GAAGGAAAGT ATTATTAAAA AGAACCCAGA 4380 

TATATTAATT TCGACGGAAG CTAAGACAAG ATCAGATTAT ATGGATATCA TCAAAAAAAG 4 4 40 

AGGTGGATTC AATAAAATTA ATGCTGTCAA GAATACACGT ATTGAAGTTG TAAATGGTGA 4 500 

TGAAGTATCA AGACCAGGTC CACGTATTGA TGAAGGATTA AAAGA2KTAA GAGATGCAAT 4 5 60 

T T AT AG AAAA TAAACCATTC TAATTATGCC CCTTATTGCT ACATGTAAAA AATACATGTT 4 620 

TGAGATAAGG GGTTTTTaAA ATATATTTAG TGAATGATAG CAACGCGAGT ATGTGATTGC 4 680 

TATAATGAAT GTAATTATCG ATGAAcaaAA GAGAATGCTA TGACATTTAA TAAAGTATTA 4740 

TTGAGCTGGa TAGTCmTATT GATTATAACA ACTAGCATAT ATCTATTTTG GCAGTTGGGC 4800 

GATATCAATG ATGTATTTAA CCAGTCTATT TTAATCAATG TTAGATTACC GAGATTATTA 4 8 60 

GAAGCATTGT TGACAGGTAT GATATTAACT GTTGCAGGCC TTATATTTCA AACAGTTTTA 4 920 

AATAATGCAT TGGCAGATAG CTTTACATfA GGATTGGCAA GCGGCGCTAC ATTTGGTTCA 4 980 

GGATTAGCAT TATTTTTAGG T T T AAC AAC G TTATGGATTC CTGTATTTTC AATAACATTT 5040 

AGTTTGATAA CATTAATAAC TGTATTAGTC ATTACGTCGG TATTGAGCCA AGGCTATCCA 5100 

GTTAGAATCT TAATATTAAG TGGTTTAATG ATTGGTGCGT TATTCAATTC ACTCTATAT 5160 
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TTTTTGATTT TATTAAAACC TCGCAAATTA AATACAATTG CCAATTATCT GTTTGGTGGT 5220 

TTTGGTGATG CAGAATACTC AAATGTATCT ATAATAGCAA TCACATTTAT CATTGCATTG 52 8 0 

TTTGGTATAT TTATCATTCT TAATCAACTA AAGTTATTGC AAT TAG GAGA ACTAAAAAGT 534 0 

CAGTCACTAG GCTTAAATGT TCAATTGATT ACATATATCG CGTTATGTAT AGCTTCTATG 54 00 

ATAACGGCGA TAAATGTCGC ATATGTTGGC ATCATTGGAT TCATTGGTAT GGTGATACCG 54 60 

cAACTCATTA GAAAATGGCA GTGGAAACAA TCATTAGGAA GACAATTGGC TTTAAATATT 5520 

GTAACTGGAG GACAAATAAT GGTTATGGCA GATTTATTG GTAGCCATAT ATTGTCACCA 5580 

GTACAAATAC CGGCAAGTAT TATCATTGCA TTAATTGGTA TACCAGTGTT AtTTTACaTG 5640 

CkAAwAtCtC aGTCgAAAcG GTTACaCTAG CACACGACaT TTGCTAAAAT AAAAATAACT 5700 

ATAAACATAA AGAGGGCATA AGCGATGGAT T T G AAT C AAA TTAAAGCAGT TGTATTTGAT 57 60 

TTAGAAGGTA CGTTGTTGGA C AG AG T T AAA T C T C GAG AG A AATTTATCGA AGAGCAATAT 5820 

GAACGATTTC ATGACTACTT AATTCATGTT CAACTGGCAG ATTTTAAAAA AgCATTTATT 5880 

GAGCTAGATG ACGAT GAAGA T AAT GAT AAA CCTGATTTAT ATAAAGAAAT CATTAAACGT 5 94 0 

TTCCATGTAG ATAGGTTAAC TTGGAAAGAC T TAT T T AAT G ATTTTGAAAT GCATTTTTAT 6000 

CGTTATGTAT TTCCTTATTA CGATACTTTG TATACACTAG AAAAgCTATC GCAAAAAGGC 60 60 

TTTCAAATTG GTGTTATCGC AAATGGTAAA TCTAAGATTA AACAATTTCG ATTACATTCA 6120 

CTTGGTTTGA TGCATGTTAT TAATTATTTA TCAACATCAG AACAGTTGG TTTTCGTAAA 6180 

CCACATCCTA AAATTTTTGA AGATATGATT GATCAACTAG GGGTATTACC TGAGCAAATT 62 4 0 

ATGTATGTTG GCGATGATGC GTTAAATGAT GTAGCTCCAG CACGAGCTAT GGGCATGGTT 6300 

AGTGTATGGT ATA 6313 
(2) INFORMATION FOR SEQ ID NO: 2 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 

CCGTAAACAC ATCAACAAAA GAAGGCTATA T T AC AAAAG A AGACTTGGAC TTATGCTGCA 60 

CGTCGCTCTA ATTCAGCTGG AATGCAAGTC ACCGGACGAC TGGCTTACAT TGAACCTTAT 12 0 

GGGGCAACAA GTCGCACAAA ATAAACGCGC GAGAAGCaAG AATAGGAAGT GATATCTATG 180 

AAATGGTTAT CACGAATATT AACAGTAATA GTGACCATGT CtATGGcGTG TGGTGCATTG 240 

ATATT TAATC GTAGACATCA GCTAAAGGCG AAAACGCTGA ACTTCAATCA TAAAGCATTA 300 
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ACAATTATTA TTCCGGCTAG AAACGAAGAA AAAAGAATAG GTCATTTACT ACATTCGATA 360 

ATACAACAGC AAGTTCCAGT AGATGTCATT GTTATGAATG ACGGffiCGAC AGATGAAACA 4 20 

GCACGTGTAG CACGTTCATA TGGTGCGACT GTAGTAGATG TTGTTGATGA TACTGACGGC 4 80 

AAATGGTATG GGAAATCACA TGCTTGTTAT CAAGGTGTGA CGCATGCATG TACGAATCGC 54 0 

ATTGCCTTTG TAGATGCTGA TGTAACTTTC TTAAGGAAAG ATGCTGTTGA AACGTTGATT 600 

AATCAGTATC AATTACAAGG TGAAAAAGGA TTGTTAAGCG TACAGCCTTA TCATATAACA 660 

AAGCGTTTCT ACGAAGGGTT TTCAGCGATA TTTAATTTAA TGACAGTCGT TGGTATGAAT 720 

GTATTTTCTA CCTTAGACGA CGGTCGGACT AACCAGCATG CATTTGGACC GGTGACATTA 780 

ACAAATAAAG AAGAT TAT T A TGCAACTSGA GGTCATAAAA GTGCAAACCG TCATATTATT 84 0 

GAAGGATTTG CTTTAGGAAG TGCATATACT TCACAATCAT TGCCCGTAAC AGTTTATGAA 900 

GGGTTTCCAT TTGTTGCATT TCGCATGTAT CAAGAAGGAT TTCAGTCATT ACAAGAAGGA 960 

TGGACAAAGC ATTTGTCAAC TGGGGCAGGT GGCACAAAGC CTAAGATCAT GSAGCAATT 102 0 

GTGTTGTGGT TGTTTGGTTC TATAGCGAGT ATTTTAGGGC TATGTCTTAG TTTAAAATAT 1080 

CGCCAAATGT CTGTAAGAAA AATGGTAGCA CTTTACTTGA GCTATACTAC ACAATTTATT 1140 

TATCTGCATC GAAGGGTCGG CCAATTTTCT AATTTATTAA TGGTATGTCA TCCATTGTTA 1200 

TTTATGTTTT TTACTAAAAT TTTCATCCAA TCTTGGAAAC AAACGCATCG TTATGGTGTA 12 60 

GTTGAATGGA AAGGTCGTCA ATATTCTATA TCTAAAGAAC AATAAATCAA GGTAATGGCA 1320 

TTTCAATATA GGAGGACTAG TATGACAATG ATGGATATGA ATTTTAAATA TTGTCATAAA 1380 

AT CAT G AAG A AAC AT T C AAA AAGCTTTTCT TACETTTTG ACTTGTTACC AG AAGAT C AA 14 4 0 

AGAAAAGCGG TTTGGGCAAT TTATGCTGTG TGTCGTAAAA TTGATGACAG TATAGATGTT 150 0 

TATGGCGATA TTCAATTTTT AAATCAAATA AAAGAAGATA TACAATCTAT T GAAAAAT AC 15 60 

CCATATGAAC AT CAT C AC T T TCAAAGTGAT CGTAGAATCA TGATGGCGCT TCAgCATGTT 162 0 

GCACAACATA AAAATATCGC CTTTCAATCT TTTTATAATC TCATTGATAC TGTATATAAA 168 0 

GATCAACATT TTACAATGTT TGAAACGGAC GCTGAATTAT TCGGATATTG TTATGGTGTT 17 4 0 

GCTGGTACAg TAGGTGAAGT ATTGACGCCG ATTTTAAGTG AT CAT G AAAC ACATCAGACA 18 00 

TACGATGTCG C AAG AAG AC T TGGTGAATCG TTGCAATTGA TTAATATATT AAGAGATGTC 18 60 

GGTGAAGATT TTGACAATGA ACGGATATAT TTTAGTAAGC AAC GAT T AAA GCAATATGAA 1920 

GTTGATATTG CTGAAGTGTA CCAAAATGGT GTTAATAATC ATTATATTGA CTTATGGGAA 1980 

TAT TAT G C AG CTATCGCAGA AAAAGATTTT C AAG AT G T T A EG AT C AAA T CAAAGTATTT 204 0 

AGTATTGAAG CACAACCAAT CATAGAATTA GCAGCACGTA TATATATTGA AATACTGGAC 2100 
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GAaGTGAGaC AGGCTAACTA TACATTACAT GAACGTGTTT TTGTGGaTAA GAGGAAAAAG 2160 



GCAAAGTTGT TTCA 

(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO 203: 

GAAnCAGnTA GACAAATTAT GGaAAmCGGT GTGAATCaAG GATTCtTTGG TGTAGCTGGT 60 

TTTGACCTAC TCGTCGATGA GGATGATAAC GTTTATGCGA TTGATTTAAA CTTTAGACAA 120 

AATGGTTCaA CGAGCATGTT ATTACTTGCT AACGAGTTGA ATTCAGGATA TCAAAAGTTT 180 

TAT AG T T AT C ATTCAAAAGj TGATAACACA CATTTCTTCA ATACGATTTT GAAATATGTC 240 

AAAGAAGGTA GTTTATACCC GTTATCTTAT TATGATGGTG ATTGGTACGG T G AAG AT AAA 300 

GTTAAATCAA GGTTTGGCTG TATTTGGCAT GGTGATTCAA AAG AAA C AG T ACTGGAGAAT 3 60 

GAACGCGCAT TTTTAGCTGA ACTTGAACAC TATTAGAGTT CGGACATAA GGCGCTACAA 4 20 

TGTTGTGTTG CCAGTAGTTG ACTGAATATG CGTTTGTAAC AAGCTTTTTT CGATTCTAGT 480 

CAACAGTAAT TAAATTTATG ATATGGCAAT ACTTTGTAAT ACTAATATTA AATGGCGACT 54 0 

TTTATTTCAC TATGTTATAA GAGTTGCCAT TTTGTTGATA AAGGTATACT AAAGGTTATC 60 0 

GTTTTGAAAT TTTTAGTAAC TAGATATGTT TCGTGTTATA GACCGAATTT GTGTATACGT 660 

AAAATTTAAT GCTATTGAAT TTTTAAAATG AAAAAC AT G A CAT T AAAT T G AATTCATAAT 720 

ATGTCTAATT GACTAACTTG TTGGAGTCAT TTACTATTTT ATGTATGACA TATTTTAAAA 780 

AGTGAGGGTC AAGCATGTCT TATAAAJCAT ATCCATTCTT TAGAGATATA TTAATAAATG 8 40 

AATGTATTTA TTTCGCCTCT AAAAATAAAA AACTAGTACG CCT AAAT TAT AAAAGT GAAG 900 

CGnATGTAGG CGTTTGGACA GAAGAAAGTG TGGCCGTATC ATTTTTAACA AGTCGTGATA 960 

TTCCATTTGA TAAAGTTGTA AAAATGGACG TTGATCGTTT TGCTACTTAT fiATTAGATG 1020 

AATTGTTTGA T G AAC AAG AC CATATTATTA TGAATCAAAC AATGGAAGAw GAAGGGCATC 1080 

TACTAAACGT TGTAGCTGTT ACACAAGAAG TGATGACGGA AT T AGAT AAA ATTAGAATCA 114 0 

AAGAATTTGT CCAAGATGTA GCGAAATATG AT GAAGT AT A CGGCTTAACT AAAAAAGGTA 12 00 

GTAAGCAGTT TATTCTCATT AGTGAAAATG ATAGCGACGA AAAAAAGCCG CATATTATGC 12 60 

CTGTATGGAG TATTAAAAAC AGAGCGTTAA AAGTTCGAGA TGAAGATTTT GAAGAGTGTG 1320 

ATTTAATTAC GATTGAAGGT TCTGTTTTCG GAGAATGGCT AGAT G AAC T T AGAGATGATC 1380 
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ATAAAGCCGT TGCGATAGAT TTAAAAACTG GCGTGGTTGG TACAATTGTT TCAGCGCAAA 14 4 0 
AATTGTCTAA TGAACTTACA TTTTAATTGA ATTGAGTTGT AAGTAATTAA GTGATAATAA 1500 
ATGGAACAAT AC G T AT T C AA AACACTTAGA CCATAAAATA AAAGGCCATT TATATAGCGT 15 60 

TTATTTAAAA CAACGCGCAT ATAAATGGTC TTTTTCTATT TTTCTAAATA TAATGCAOE 162 0 
ATAGCACCTG nAAAATGCGC CGTTTTCAAC ATAGTACGGT TTGCAACCGC GTAACACAGT 168 0 

ATAATCTTCC ACAACTTTGC GTAATAAAGC GTTATTATGA AATGAAGAAC CGATATAAAC 17 4 0 

GATATTTTCA GTTTTAAATT CACGTGCAAC AGTAATGGCC ATTGTCGTAA CAACTTCGCC 18 00 

AACGACACCA ATAACGGCTG CTAATTTATT GCTAGGTGTA AAATCAGCAT CTAAATGATG 18 60 
TAGTACATGA CCAAAATTAG CTGCTGTTAA ATCACCGGGA ATGGGTGGTT CGGTATCTTT 192 0 

ATAAATATGT CTAACCTTTA AATCGATAGT GTTACGATCA CCGTGTTGTG CCATGTCAGT 198 0 

TAACTGTTTA TAATCAGTGA TTTGACTTAG TAAATAACCGAGTCCTTGAA TCATGCCTCC 2040 

ACCTGTACCG ATACCGCCTA CACGACGTTG TGATTGGCCG TCGAAATAAT GTAGTGACGT 2100 

ACCGGTACCA ACATTTGCAA AAATATAATC TGCTAAGTCA TGGCCTTGCT CTTTTAACAA 2160 

AATACCTAGT CCTTGAGATG CAGCATCAAA CTCTACAAAA ATTTGTGCAG GAATGTTGAT 220 

GTTTTCAGCA ATGACACCTG CATTACCTCC AGTTAAGCAT AATTTTTCAA TTTGCTGTTG 228 0 

GTTTAACCAT TCCACAACTT GATCAATATT TTTAGTTAAT TCAGTTTTAA AAGTACGTTG 2 34 0 

GTTATCTTGC TCTTGAACGA TTTTAATTAG TGTACCGCCA GCGTCAATGC CAACTTTCAT 24 0 0 

AAGATTCCCA CCTCATTATT AATGTCTATC CTTAAATAAT AGTATAGTAA AATGACTAAA 2 4 60 

AAACAAGTAA TAATAGTAAT TAT T AAC AAA TTTGATGCCa TTGCATTTCA ACATTGTAAG 2 52 0 

cGTATCGCAA TTAAtGTTTT ACAAACGTGG ACGTTAAGTt ATATATAT T A TTTTCTAGGA 258 0 

ATTTTGAAGT TGTATAGGAT TGTTAGTTAG TGACGCAATA TTAAAAGffiG TTCGTACGCA 2 64 0 

GTGTATTTGT AAGTCTCTGA TTAAAATGAT AAGTAATGAG GAATAGTACA TTAATTTTGA 27 00 

AATTTAAAAA ATATAAATAA GTAATTTATT T AAC T TAG AG CAAATAATGG TATCGTAGTG 27 60 

AAATAATAGG TAAAATAATA TGGGGATTCA TGCTTCATAT ATAAAAAGAT AGGGGTTAAA 282 0 

TATATGGCTA AAGAACTTTG TTTTGAAGGT ATCACTTTAA AAGCATTTGA TGAACAATAT 288 0 

CGTTCAGCAA TTAATGATTT TGACTTGAAT GAAAGACAAC AAATATATTC ATCTTTACCT 2 94 0 

AAAGAAGTTA TTGATGATGC AATTAATGAT GCTGATAGGA TTGCTAACGT AGCAwTAAmC 3000 

G AT AAAAAT G AAGTGGTGGG CTTTTTTGTA TTACATCGTT AC TAT C AGC A TGAAGGTTAT 3060 

GATACACCTG AAAATGTCGT TTATATTCGT TCATTATCGA TTAATGAAAA ATATCAAGGT 312 0 

TTTGGATATG GCACGAAAAT AATGATGTCA TTGCCGCAAT ATGTTCAAGG TGTATTTCCT 318 0 
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GATTTTAATC ATCTATATCT AGTAGTAGAT GCGGAAAATG ACAATGCTTG GAACEATAC 32 4 0 

GAACGTGCTG GATTCATGCA TACAGCGACT AAAGAAGAAG GACCAATTGG CAAAGAACGA 3300 

C TAT AT T AC T TGGACTTAGA TTCAAAACAT GTTTCATCAT TAAAGCTTGA AGAAGAAAGT 3360 

CGTTCAGAAG TGACCAATGT ACATATCATT AATTTAATGA TTGATGGCCA AAAGGTTGGC 34 20 

TTTATCGCAT TGGAGCAGAT TGGTGAACGC ATGAACATTG CTGCTATTGA AGTGGATAAA 34 8 0 

TCATATCGCT TTAATGGTAT TGGTTCAAGT GCTCTGCGAC AAT TGCCAAC TTACTTAAGA 354 0 

AAAAACTATG ACAACCTTAA TGTGATTACG ATGATTCTGT T T G G AG AG AA TAATGATTTT 3600 

AAACCATTAT GTTTAAATAG TAATTTCGTT GAAATC(AAC AAACTGATGA TTATGTCGTT 3660 

TTCGAAAAAT ATTTAAATTA CTAACAGTGA TTGCGAAATA TGATATTGTC ATTTATAATT 37 20 

TAGTTTTGTT ACTATATATA AATGAATTCA GACGTATAAA TTTAGATTAT ATCCTTCGAA 3780 

AGGAAGTATT GGGCAATGAA AATTCAAGAT TATACAAAAC AAATGGTTGA T GAAAAAT C A 38 4 0 

TTTATTGATA TGGCTTATAC ATT AT TG AAT GATAAAGGCG AAACAATGAm mTTATATGAT 3900 

ATyATCGATG AATTTAGAGC GTTAGGTGAT TATGAGTACG AAGAAATTGA AAATCGTGTT 3960 

GTACAATTTT ACACGGATTT AAACACAGAT GGTCGTTTTT TAAATGTTGG AG AAAAT T T A 4 020 

TGGGGATTAC GTGATTGGTA TTCGGTAGAT GATATTGAAG AGAAAATCGC ACCAACTATT 4 080 

CAAAAATTCG ATATTCTGGA T GC AG AT GAT GAAGAAGATC AAAACTTAAA ATTATTGGGC 414 0 

G AAG AT G AAA TGGATGACGA CGATGATATT CCAGCTCAAA CAGATGATCA AGAAGAACTA 4 200 

AAT GAT CC AG AAGATGAGCA GGTTGAAGAA GAAATCAATC ATTGGATAT AGTCATTGAA 4 2 60 

G AAG AT G AAG ATGAACTAGA C G AAG AC G AA GAAGTGTTTG AAG AC GAAG A AGACTTCAAC 4 320 

GATTAATTTT TTGTTTGACT TTTAGTTGAA AG AT GAT AAA ATTTTATTCG GGCTCCTTTA 4 38 0 

AATAGGACAC GTGTATAAAA TTTATACGCT CCCCTTACAG AATTTGTGAG AGGGAGCGTT 444 0 

TTTTtATTTA ATTGAGTAAA TCAAGAAATG ATAACGCAAA AATCAAAGTT GTAAATGATA 4500 

T AC AT AG T GA CATAGCAGTA TGGAAACGGT AAG T AAAC AG AATTTAATTT TGTCGAtTCG 4 560 

ACAAtAAaCA aCTtGAaTGA GCTTGCTTTA ATGTTATGTn nTACGTAATT T T T AC AAT T G 4 62 0 

AT GAG GAAG C ATTCCCTTTA ATAATTAGGA G G T C AAG AC A T G AC AAAAT T TATTTTTGTA 4 68 0 

ACAGGTGGCG TAGTTTCATC CATTAGGGGA AGGGT 4 715 
(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 



ATAATAACTG AAATTAAAAT TGCTAAATmG TGTTaAgCTA TCGCmACAAT GAAAATwCCG 60 

ATTTTGCGTT GTTGAAAATA TCTTTCCAAA CCAAGAATCG ATAATGGEA TAAATATAAT 120 

AAATTTCCAT AAAATGACCA AGTAAAATTA AAGTATATAA CGACAGTTGA CATGCCGTAT 18 0 

AAAATCGTAG CGATCATATT TGCTGAGCGT TTAAAGTGTA ATATTTTAAA TAAGTAGAAG 24 0 

GTCACGACAA ATGTTATGAT AGCTCGTATC ATAGCCATAA TAAGTTGGTT TGTCGGCCAA 300 

AAATGTATTG TCGTCGGATT AAATATACCA ACCGTTTCTC CTATTTTAAT GAAkAGAAAA 3 60 

TTTAGCCACA TTAAAGGTGA CAGCGAATAA TAATnTGATA GTCCTTTCAT ATAATCGCCA 420 

CCTAmTCCAA ACGATGCATC ATrTAAACTA GAAnAACTAC GTAGATGTTC ATACAnATAC 4 80 

ATTTGAAATG GCATCATTTG ACGGAATCCA TCTCCAGCCC CGCTAAAAAC AGTACCATTC 54 0 

ACAATATAAT CATAGATATG AG T AG AAAAT AAAATAAGCG TTAATATTAC AC T AAT G AAA 600 

GTTATAACAA AGAATTGTTT GACGTTTGAA TTTAGCCACT TTTTTAACAC AACATTATCC 6 60 

TCAACTTTCA AAT T T AAAAT TAAGTTTAAC TGAAACTAAA GTTAATGAGG TTCTCIATAG 72 0 

GTAAAGACGA AGATGACTGT GGAACAGATA CCTTATCATA GTTACTTAAA CTTTGGATCA 780 

TTTTCAGTTT AT CAT T AAAC AAATATATTG AATAATAAAa aTGTCATACT GATAAAGATG 84 0 

AATGTCACTT AATAAGTAAC TTAGaTTTAA CAAATGATGA TTTTTAATTG TAGAAAACTT 900 

GAAATAATCA CkTATACC 918 
(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16397 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

TCGCCCnATA ATCAATTTAT TTTTCATGTG CCACTCCTAT ACAAGCTnAC AATGCTTCTT 60 

CAGTTAAGGC AATATCTTTT AATTTTGTTT GATATTTTTG TTCAAAGTCA TATTGTAACT 120 

GAACAATTTC TGGCAAACCA- ATATGCCAAT (CGCCAATTT TTTTTTAyCT TtGAAGAGCT 180 

CTTTTGGTGA TGkTTGcGAC ACT AT ACT AC CTTCTTTCAT AACGATGACT TCATCTGCAT 24 0 

AACGCGCGAC TTCATTCATA TCATGTGAAA TTAGGATAAT TGCCTTATTT TCATCTGTTT 300 

GTAGTGACTT TAGTAATCTC ATTACTTGTC GTTTACTTTG TGGATCAAGT CCTGCTBAG 360 

GTTCATCAAC CACGATAATA TCAGGATTCA TTGCCAATAT CGATACAATC GCTATTTTAC 4 20 
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GCATTTGTCC ACCTGACATT TGAAATGGTG ATTGTGACAT TACATCTCTT GAAAAGCCTA 4 80 

AATCCATCAA CAGACGATGG GCATAGTTTT TGGCTTCATC TAAATTCATT TTAAAGTTTT 540 

TAGGTCCAAA TATCATTTCA CGCTCTACTG TGTCCTCAAA TAATTGAGAT TCGGGAAATT 600 

G AAAT AC CAT TCCAATTCTT TTTCTTACAG GTCTAATATA TTTATCTTTG GTCTTATGTG 6 60 

TAATAGTAAT GTCATCAACT GTAACTGTCC CAGTAGTCGG CTTTAACAGC GCATTAATAT 7 20 

TTTGTATCAA CGTTGATTTA CCACTACCCG TTTGTCCAS GATGGCGTAA TATTTACCTT 780 

GTTCAAATTC TGTATTAACA T CAT G AAT AG CTTGATGCTG ATATGGTGTC CCTTTTTGAT 840 

AGGTATAACT TACATTGTCA AACCGTATAG TCATAGTTGA TCCACCAGCC CTTCATAAGT 900 

TAAGAATGAT GTTTGGTGTC CCAGCATTTG ATTTATTTTG ATTGGGAATG GCAAATCTAG 9 60 

ACCTATTCTT GTTAACTCTT CTGCATTGTC GAAAATTTCA GTCGCTGTGC CTTCTTTATA 1020 

GACAGTCCCT TTATTCATAA C GAT AAC AT G ATCTGCTTCC ATCGCCTCAG AT AAAT CAT G 1080 

CGTAATAGAA AT GAT T G T AA TATTATGTTC T GAT T T AAC T TTTCTCACTA AATCCAATAA 114 0 

ATTTTGACGT GCATCAGGAT CTAACATAGA AGTCGCCTCA TCTAATATAA TGACAGAGGG 1200 

GTTAAGTGCT AATACACTTG CTATAGCCAC ACGCTGCTTC TGTCCCCCCG ATAATGCATT 12 60 

AGGTTCATAA TCTGCACGTT C T AAC AT AT C AACTTGTTTA AGTGCTTCGC TGACTCTTCT 1320 

ATGCATTTCG TCATATGGAA CCGCATGATT TTCGAGTCCA AATGCfiCAT CGTATTTTAC 1380 

AAT TGAACC A AC AAAT TG AT TATCCGGATT C T G AAAT AC A ATTCCTATGT CTTTTCTTAA 14 40 

CTTTTCAAAA TTATCATCAG TTATAGCTTG ATTATTATAA AAAATTTCTC CAGATTTAAC 15 00 

TTTCTCTATG CCAATCATTA ACTTGGcAAT TGTAGATTTT CCAGAACCGT TATGACCAAC 15 60 

AATAGATGTC CACTGACCTT TAGGTATATT AAAAGAAACA TCTTTCAATG TGAAGGATGC 162 0 

ATCACTTTGA TATTGAAATG AAACATTTTT AAATACAATA ACTGAATTCT TATCCTCCAC 1680 

TTGTCTCTCT CCTTTACGAT TCGTGTATCT ATCATATTTT ACAATATTTA TAAATCGCTG 17 4 0 

TATATGACAT TGACTGGGTT CTCTATA'RT TACTAGTATT TTCTGACTCA TTTCTAGTCT 1800 

TTAAAGTGTT GTTTAACAAC TAATGATAAG GACTTTTATT CCTCTCTAAC AATTATGTAT 18 60 

AAACGTTAAT AAAATAAATG ATT TACT AAT ATAGGGGTGG TCGCGTTTGA T T C AAC GAT A 192 0 

ATACTTTCAC TTCATTCAGT TCTAGTGAAA TTGATCAAAC TAGCTTCATC ATATTTTAG 198 0 

ATTCGCACTC AAAAAAGTAA ATATAAAGAA ATCGGACTTA AAAACATTTC TGTTCATAAG 2040 

TCCGATATTT TATTCAATAA AAAAGCGCGC ACCCCATCAT AAGTTTGTTG AGTTCACGCT 2100 

TTAAATCTTT ATTTAGTTGA TGGGGTACTC TGAGCTAGAC AATATTTGTA TGTGGCAAAC 2160 

ATTAtCGTTG CACTCATTTG CTTTATATAA AAGTAGTTAG TGTATTTATA TAAATTCTTA 2220 

TACTAATTCG ATAATTACTG ATTCAGCACC GTCACCGCGA CGAGGGCCTT GTTTAAGGAT 22 8 0 
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ACGAGTGTAA CCACCTTGAC GTTCTGTGTA AcGCTCTGCG ATTTCACCAA ATAATTTTTG 2340 

AAGTGCAGTT TGTGTAGTTT CATCTTCGTT TAAGATTCA ACATTACGTA AAGTTTTAGC 2 4 00 

TGCATTACGA CGAGAAGCTA AATCTCCTTT TTTACCTAAA GTGATTAATT TCTCAACAAC 24 60 

ACTGCGAACT TCTTTTGCAC GAGCTTCTGT AGTTTCAATA CGTTCACTAA TAATAAGTGA 2520 

TGTAGCTAAG TCACGTAACA TAGCTTTACG TTGATCAGAA GTACGACCTA ATTTTCTGTA 258 0 

ACCCATGAGT TAACCTCCTT TATCAATCTT CTTTTCTTAA TCCTAATCCT AAATCTTCTA 2 64 0 

ATTTGTATTT AACTTCTTCT AAAG AT T T AC GACCTAAATT ACGCACTTTC AT CAT G T C AG 2700 

CTTCAGATTT GTCAGCTAAC TCTTGAACAG AATTGATTCC TGCGCGTTTT AAGCAGTTAT 2 7 60 

ATGAACGTAC AGATAAGTCT AATTCTTCAA TAGACATTTC TAATACTTTT TCTTTTTGAT 2820 

CTTCTTCTTT TTCAATCATG ATTTCAGCGT TTTGCGCTTC AT C AG T AAG A CCAACGAAGA 28 80 

TATTCAAGTG TTCAGTCATT ATTTTTGCTG CTAATGAAAC TGATTCTTGT GGTGTGATTG 2 94 0 

AACCATTAGT CCAAACATCC AATGTTAATT TATCAAAATC ACEGCTTTGA CCTACACGTG 3000 

TATTTTCAAC AGTATAGTTC ACACGTTCAA CAGGTGAATA CAATGAATCA ACAGGGAT T A 30 60 

CACCAATTGG TAAATCACTA GTATTATTTT GTTCTGCTAA TGCGTAACCT CTACCCTTGT 3120 

TAGCAACTAG ACGAATTTTT AAG T G AC C AC C T T TAG AT AC TGTTGCAATT TTAAGCTCTG 3180 

GGTTTAAAAT TTCAACATCA CTATCATGTG TAATGTCGCT TGCTGTTACT TCGCCTTCAT 32 4 0 

CACGTACATC AATTTCTAAA GTTTTATCTT CTTCAGAGTA AATTTTCAAT GCTAATTGTT 3300 

TAATGTTCAT AATAATTGTA GAAACATCTT CAACTACATT GTCTACTGCT GAGAATTCAT 3360 

GTAAAACTCC CTCAATTTCA ATATACTTAA cGGCTGCACC TGGTAATGAA GATAGTAGGA 3420 

TACGACGTAA GGAGTTTCCT AGTGTAGTAC CGTAGCCACG TTCTAGTGGT TCAACAACGA 34 8 0 

ACTTACCGAA TTTAGCATCT TCACTAATTT CAATTGTCTC AATTCTAGGT TTTTCGATTT 35 4 0 

CTATCATTTA AATATCCTCC TTATATACGT CGACTTAATT TAAAATGTTTGCTCAGTGAC 3600 

CTGTAACAAT ACCATCATAA ATTATACACG ACGACGTTTT GGTGGACGAC AACCGTTATG 3660 

AGGTACTGGA GTAACGTCTC TGATCGCAGT TACTTCTAAA CCTGCAGATT GTAATGCACG 37 20 

AATAGCTGAT TCACGACCTG GACCAGGTCC TTTAACTGTT ACTTCAACTG TTTTTAAACC 37 80 

ATGCTCCATA GCTGATTTAG ATGCAGTTTC AGAAGCCATT TGTGCTGCAA ATGGTGTTGA 38 4 0 

TTTTTTAGAT CCTTTGAATC CTAATGCACC AGCTGATGAC CATGATAAAG CATTACCGAA 3900 

CTCATCAGTG ATAGTTACAA TAGTGTTGTT GAATGTTGAA CGGATGTGTG CTACACCATT 3960 

TTCAATATTC TTTTTCACTC TACGTTTACG AiATACTTGT TTACGTGCCA TTTAAAATTT 4 02 0 

GCCTCCTTTA CCTATkATTT TTTCTTGTTA GCTACCGTTT TAACTGGTCC TTTACGAgTA 4080 
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CGCGCGTkGT TTTTCGTTTT TTGACCACGA ACTGGTAAAC CACGACGGTG ACGGATACCc 414 0 

ACGGTATGAT GAAATTnCCA TTAAAcGTTT GATATTTAAG TTAGTTTCAC GACGTA&TC 4 2 00 

ACCTTCGACT TTATAACCGT CTACAACTTC ACGGATGCGA CCTAATTCGT CATCAGTTAA 4 2 60 

ATCTTTCACA C GAG TAT C AG CTGATACGTT AGCTTCTTCA AGAATTTTTT GAGCAGTTGA 4 32 0 

CGTACCGATA CCGTATATAT AAGTTAATGA GATAACTACG CGTTTTTCAC GTGGAATATC 4 38 0 

TACTCCTGCA ATACGTGCCA TATTAATTTA CACCTCTCTT TTATTAACCT TGTCTTTGTT 4440 

TGTGTTTTGG ATTTTCACAA ATTACCATTA CTTTACCTTT ACGTTTAATG ACTTTACATT 4 500 

TTTCGCAAAT AGGTTTTACT GATGGTCTTA CTTTCATTTT TATACCTCCC TATATTATGG 4 5 60 

AGTGACGATT ATTTATAACG ATAAGTAATT CTTCCGCGE TTAAATCGTA CGGAGACATC 4 620 

TCAACAGTTA CTTTGTCGCC AGGTAGAATA CGAATGTAAT TCATTCTGAT TTTACCACTT 4 680 

ACGTGAGCnA AAATCTCATG ACCATTTTCT AATTCTACTT TAAACATTGC GTTCGGTAAA 4740 

GTATCTAATA CAGTACCTTC TAATTCAATT ACATCTTGTT TAGCCATTGA TTAACTTCCC 4 8 00 

CCTTTTTGCA ATAGTAAGGT AATCGTCAAT AGACAACTTT ATTGTTACGA ATCTATCAGT 4 8 60 

GATTAATTTT ATAAGTTAAA CAAAAAT T AC GGGAATTAAT TATCGTTAAT TGCCACTCTC 4 920 

ATCTATCTAA T ATGAT T AAA TCATGCCTCA CTTAAAATAG ACCGCTAAAA GTTGATCTAT 4 980 

TACAAATGAT CTAAAATATC AATGACATCT TTGGTAACGT CGCTAATATC TTTTGAACCA 504 0 

TCAATATTTT TCAATACACC TTTTTGATCA TAGAAATCTA AAATAGGCTT AGATTGTTTA 5100 

AT AT T AAC AC TCAAACGATT AGCTACCGTT TCAGGATTAT CATCTTCTCG TTGATACAAT 5160 

TTACCACCAT CGATATCACA AATACCTTCG ACTTCGGAGG ATTAAKACA AGATGATACG 5220 

TTGTACCACA TGACTCACAG ATTCGACGAC CTGTAAGACG GTTCATTAAT TCTTCTTCCG 52 8 0 

GAACTTCGAT ATTGATGACA GCATCAATGT TTCTGTCAAG CTCAGACATA ATATTATTTA 534 0 

ATGCCTCAGC TTGCTCGATT GTTCTTGGGA AGCCATCTAA TAAAAAGCCT TTTTTTGCAT 5400 

CGTCTTCAGA AATTCTTTCC T T AAC GAT AC C T AC AG T C AC T T CAT C AGG A ACTAATTCGC 54 60 

CACGGTCCAT ATAAGACTTA GCTTCTTTAC CTAATTCAGT TTCTTCTTTT ATAGCTTTTC 5520 

TGAACATGTC ACCAGTTGAA ATGTGGGGTA TTGGGAATTT CTTGaCAATT TCACTTGCTT 5580 

GAGTTCCTTT ACCTGCGCCA GGTAAACCCA TCAAAATGAT ATTCATAAGT GCCCTCCTAA 5 64 0 

AATTATCTAC CACCAAAGCC TTTATATTCT T T T T GAG AT A CTTGAGCTTC TAAAGATTTC 57 00 

ATTGTTTCAA TCGCTACACC AAT AAC GAT A AGTAAACTTG TACCACCAAT CTGAATTGAT 57 60 

TGTGGTAATC CCATAAACTT AGTTGCTAAT ATCGGTAGAA TTGAAATAAC GGCAAGAAG 58 2 0 

ATAGAACCTA CAAAAGTTAA ACGATATAAA AC T T TAG T AA TATATTTTTT GGTTTGTTCA 5 8 80 

CCAGGTCTAA TACCTGGAAC ATAGCTACCT TGTTTCTTAA GGTTATCAGC CATTTTTTCC 5940 
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GGATTAACTT GTACAAATGC ATAGAAGTAT GTGAATAGTA TAATTAGTAC AATATATACA 6000 

ACCATACCAA CAT T AC T T GA AGGATTTGCA GCATTCGCAA TGTTTTGTGC CCATTCTTTA 6060 

TCTGGATAGA ACAACGTTAA TGTTCTAGGC AGTAAGAAGA ACGCCATTGC AAAG AT T AC A 6120 

GGAATAACAC CGGCTGAGTT CACTTTTAAA GGTAGATAAG TTGCCTGTGA ACCTAATCTT 618 0 

TGAGCAGTTT GTTTCTTAGC ATATTGAATC GGAATTTAC GAACGGCTTC AAGTACATAA 62 4 0 

ATAGCACCTA CAGTTAA.TAG TATCAGTGAC AC T AAAAGTC CTAATACTTT CAACCATGCT 6300 

AAT GATGT AT CTTCTTGCCC AACGAACGCA tTTGTcCAAA TTGAATTAGA CTGGCTGGCA 6360 

ACGTTGATAA AATACCCGCA AATATGATAA TAGAAATACC ATTACCAACA CCGAACTGAG 64 20 

TGATTTGATC ACCAAGCCAT AT T AAG AAAG CAGTTCCTGC TGTnCAAAAC TAGTGCTATT 64 8 0 

AATAAATAAC TCATAATTGA CTGATTGATA ATCAGCGCAC CTTTGAGATA ATTATTAAAT 65 4 0 

TGGAATGCCA TACCTATAGA TTGGATAAAT GC T AAAG AAA TTGCTAAATA ACGAGTAACG 6 600 

TTATTTAACT TTCTTCTACC TACTTCACCT TGTTTTGCCC ATTCTGAGAA TTTAGGGACA 6 6 60 

ATATCCATTT GTAATAATTG CATTACGATT GATGCAGTGA TGTAGGGTAC AATACCCATT 6720 

GCAAAAATAG AAAATCGTTT cAAGGCTCCG CCACCAAAAG TATTTAATAA CTCAGTGGCA 6780 

CCTTGAGAAC CTTGGGGATT ATCAAAAGCT GCAGGATTTA CECTGGAGC TGGTATATAA 68 4 0 

GTCCCTATTT TAAAAATTAC TAACATTGCT AGTGTGAAGA AAATCTTGTT ACGAACCTCT 6 900 

TTTGTTCTAA AGAAGTTCAC AAGGGTTTGA AT CAT T AGAT CACCTCGTGT GcTCCACCTT 6 9 60 

TAGCATCAAT AGCTTCTGCT GCTGAAGCTG AG AAT T TAT G AGCTTTCACT GTCAATTTCT 7 02 0 

TATCAAGTGA ACCATTACCT AGTATTTTGA TACCAGATTT TTCATTCTTA AC AAC AC C AG 7 080 

ATTCTACTAA TAAAGCTGGA GTTACTTCAG TACCATCTTC AAATTTATTA AGTTGGTCTA 714 0 

AGTTAACAAT AGCATATTCT TTACGATTTA TGTTAGTAAA ACCACGTTTT GGTAAACGAC 7200 

GGAATAATGG TAATTGACCA CCTTCAAATC CTGGTCTTAC ACCACCGCCT GAACGAGCTT 7 2 60 

TTTGACCTTT GTGTCCGCGA CCACTTGTTT TACCGTTACC TGTCGCAACA CCACGTCCAA 7 320 

CACGATTGCG TTCTTTACGT GAACCTTCTG CCGGTTTTAA CTCATGTAAT TTCATTTCGG 7 380 

CACCTCCTTG ATTATTTTTC TTCTACTGTT ACTAAGTGCT TAACTTTGTTGATTTGCCCA 74 4 0 

CGAATAGCAG GGTTATCTTC AACAACTACT GAACTGTTAG TCTTTTTAAG ACCTAAAGCT 7 500 

TCAACAGTTT TACGTTGTGT TTCAGGACGA C C AAT AAC AC TACGAGTGAG GGTAATTTGT 7 5 60 

AATTTAGCCA T AAC TAG T T T TCCCTCCTTA ATTGTATAAT TCTTCTACTG TTTTGCCACG 7 620 

TAATTTCGCA ACATCTTCAG CAT T T T T AAG GTTTTGTAAA CCATCGATTG TAGCACGAAC 7 68 0 

CATGTTGATT GGTGTGTTTG ATCCTAATGA TTTACTTAAG ATATCAGTGA TACCTGCTAA 77 4 0 
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TTCAAGTACG GCACGAACAG GACCACCAGC GATAACTCCT GTACCAGGTG CAGCCGGTTT 7 8 00 

CATAAATACG CTTCCTGAAC CGTAACGGCC iSTAATTGTG TGTGGAGTTG TACCTTCAAC 7 8 60 

ACGTGGAACA ACTACTAAAT CTTTTTTAGC TGCTTCAACA GCTTTTTTGA TTGCTTCTGG 7 92 0 

TACCTCTTGA GCTTTACCAG TACCGAAACC TACACGACCA TTTTTGTCTC CAACTACAAC 7 980 

TAATGCAGTG AAACGGAAAC GACGACCACC TTTTACAACT TTTGCTACAC GGTTGATGT 80 4 0 

AACAACGCGT TCTTCAAATT CTTTCGTCTC TTCTTCtCTA CGAGCCATGT ATTTGTCCCT 8100 

CCTTTAAATT AAAATTCTAA TCCGCTTTCT CTTGCTGCTT CAGCTAATGC TTTAAcACGT 8160 

CCGTGATATA AATATCCTCC ACGGTCAAAT ACGATTTCTT TAATGCCTTT GTCAGCAGCT 8 220 

TTTTTAGCAA TTGCTTCACC GACTTTAGTT GCTAATTCAA CTTTAGTTGC TGTAGTAGCA 8280 

ATGTCGCTGT CTTTTGAAGA AGCTTGAGCT AATGTTACGC CTTTATTATC ATCAATAATT 8 34 0 

TGAGCGTAGA TATGCTTGTT TGAACGATAT ACGTTTAAAC GTGGCTTTTC AGCTGTACCT 8 4 00 

GATAAGTTAG TACGAACACG AGCATGTCTT TTTAAACG& CTTTATTTTT ATCAATTTTA 84 60 

CTGATCATTT CAATACTCCT TTCTTTAGAG TTTATCTATT ATTTACCAGT TTTACCTTCT 8 520 

TTACGGCGAA CGTATTCACC TTGGTAACGA AT AC C T T T AC CTTTGTAAGG CTCTGGAGGT 8580 

CTTACTGAAC GGATGTTAGA TGCTAATGCT CCAACTTGTT CTTTTGAAAT ACCTTCAACT 8 64 0 

TTAACGACTG TGTTTTTCTC AAC T GAG AAA GTAATGTTTT CTTCAGCTTT AATTTCTACT 8 7 00 

GGGTGAGAAT AACCAACGTT AAGGAT TAAG TCTTTACCTT GCATTTGAGC ACGGTAACCT 8 7 60 

ACACCAACAA GTTCAAGTAC TTTTACGTAT CCTTGAGAAA CACCTTGTAC CATATTGTTT 8820 

AATAAAGCAC GAGTTGTACC ATGGTTTGTT CTATCTTCTT TAGAATCAGA TGGTCTTACA 8880 

ACTTCAATTG TGTTTTCTTC TTGTTTGAAT GTCATTCTTT CATTTAAAGT TCTTGATAAT 8 94 0 

TCACCTTTAG G AC C T T T AAC AGTTACATGA TTTCCATCAA AAGTTACTGT TACGTCACTA 9 000 

GGGATGTCAA TAATTTTCTT AC C AAC AC G A CTCATGTTAT GGCACCCCT TATTTTTTAT 90 60 

TACCAAACGT ATGCGATAAT TTCTCCACCA ACATTACGTT TTCTTGCTTC TTTGTCAGTG 912 0 

ATTACACCTT CAGAAGTTGA TACTAATGCA ATACCTAAAC CATTTAATAC TTTAGGCATT 9180 

TCGCTAGCTT TTGCATAAAC ACGTAAACCT GGTTTTGAAA TACGTTTTAA TCCTGTGATA 924 0 

ACAC GC T CAT CGTTTTGACC ATATTTTAAG AATAAACGAA GTACACCTTG TTTATCATCT 9300 

TCTACGTATT CAACATTTTT AATGAAACCT TCACTCTTTA AGATTTCAGC AATTTCTTTT 9 3 60 

TTAATATTTG ATGCAGGTAA TTCTAACTTC TCGTGACGCA CCATGTTTGC GTTTCTTACA 94 20 

CGAGTAAGCA TATCTGCGAT TGGATCTGX ATTGTCATTG ATTGTTGCCT CCTTTCAGAC 94 8 0 

TCTTTTTTAT TACCAGCTAG CTTTACGAAC GCCAGGGATT TGGCCTTTGT AAGCTAATTC 954 0 

ACGGAAACAA ATACGGCATA ATTTAAATTT ACGATATACA GAATGTGGAC G G C C AC AAC G 9600 
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TTCACAACGA GTGTATTCAC GAACTGCATA TTTTTGTTTT TTTTGTTGCT TAGAACCAT 9660 

TGAAGTTTTA GCCACTTAAT TAGCCTCCTT TAAATAATTA TTTACGGAAT GGCATACCGA 97 20 

AGTTAGCTAA CAATTCACGA GCTTCTTCAT CAGTGTTAGC AGTCGTTACG ATAACAATAT 97 8 0 

CCATTCCTCT AACTTTACTT ACTTTATCAT AGTCGATTTC TGGGAAAATT AATTGTTCTT 98 4 0 

TAACACCTAA AGTGTAGTTA CCGCGTCCGT CAAATGCTTT TTTAGAAACA CCTTGGAAGT 9900 

CACGTACACG TGGTaATGAT ACTGAAATTA ATTTGTCTAA GAATTCATAC ATTCTTTCAC 9960 

CGCGAAGTGT TACTTTCGCA CCGATTGGCA TACCTTCACG TAAACGGAAA GTCGCGATTG 10020 

aTTTTTTAGC TTTAGTTACT AATGGtTTTT GACCiSTGAT CAATTCTAAT TCTTCAACAG 10080 

CATTGTCTAA TACTTTAGAA TTTTGTACTG CGTCACCTAC ACCCATGTTC ACAACGATTT 1014 0 

TATCTATTTT TGGTACTTCC ATTACTGAAC TATAATTGAA TTTTTTCATT AAGTTTTCAG 10200 

TAACTTCAGT GTTaAACTTT TCtTTTaAAC GGTTCaAAGT GGGATCCTCC TTTCaACTTG 102 60 

TtATTAATTA TTAGAkTTAA TTTCTTCGCC AGATTTTTTA GCGATACGAA CTTTTTTACC 10320 

ATCAACAAAT TTGTAACCTA CACGAGTTGG TTCGTTTGTT TTAGGGTCCA ATAATTGTAC 10380 

ATTAGAAACA TGGATTGCTG CCTCTGTTTC TAAGATTCCA CCTTCAGGAT TTAATTGAGT 10440 

TGGTTTTTGG TGTTTTTTCA TAATGTTAAC ACCTTCCACA ACGACACGGT CTTTTTTAGG 10500 

TAGAGTAGCA ATT ACT TT AC CTTCTTTACC TTTGTCTTTA CCTGCGATAA CTTTAACGTT 105 60 

GTCACCTTTT TTGATATGCA TGTGGGCACC TCCTTATTTG TATTGGTTGT TATTAATTAA 10620 

AGTACTTCTG GTGCTAATGA TACGATTTTC ATGAAGTTAC CTCACGTAA TTCACGAGCA 10 680 

ACAGGTCCGA AGATACGAGT ACCACGTGGG CCTTTGTCAT CACGGAT GAT AACACATGCA 10740 

TTTTCATCAA ATTTGATGTA TGAACCGTCA T T AC G AC G AA CACCTGACTT AGTACGTACG 10800 

ATTACAGCTT TGACAACGTC ACCTTTTTTA ACAACGCCAC CTGGTGTTGC ATTTTTAACA 108 60 

GTACATACGA TAACATCGCC GATGTTTGCT GTTTTACGAC CAGATCCACC TAATACTTTG 10 920 

ATTGTAAGAA CTTCACGAGC ACCAGAGTTG TCTGCTACTT TCAAGCGTGT TTCTTGTTGG 10980 

ATCATTAGTT AAACCTCCCT TATCTCTAAA CTTGTATTAA ATAATTACTG ACTCTTCAAC 110 4 0 

AATCTCTACT AAACGAAAAC GTTTTGTTGC TGATAAAGGA CGAGTTTCTT GAATTTTAAC 11100 

AATGTCTCCT AATTTAGCTG AATTGTTTTC ATCATGAGTT TTGTATTTTT TAGAGTATTT 11160 

TACTCGTTTA CCGTATAATT TGTGTGTTTT GTAAGTTTCA ACAAGTACTG TAATAGTCTT 11220 

GTCCATTTTG TCTGAAACAA CTTTACCTAC ATAAACTTTA CGATCGTTTCTTTCGCTCAC 11280 

TTTTGTAACC TCCTCTTACT TAATTATTGA TTAGCCTTAC TTTGTTCAAT TTCTCTTTCA 1134 0 

CGAGCAACAG TTTTTAGACG TGCAATCGTT TTTCTTACTG TACGAATACG TGCAGTTTCT 114 00 
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TCTAATTGAC CTGTAGCTAA CTGAAAGCGT AGGTTAAAAA GCTCTTCTTT TGAAGATTTG 114 60 

ATTTGTTCTT CGATTTCTGA AGTGGTTAAG TCTCTAATTT CCTTAGCTTT CATTTGTTTC 11520 

ACCACCCAAT TCCTCACGTT TTACAAACTT AGTTTTTACT GGAAGTTTGT GACTTGCTAA 1158 0 

ACGTAGTGcT TCACGCGCAA CTTCTTCAGA AACGCCAGCA ACTTCGAATA AAATTCTACC 11640 

TGGTTTAACA ACTGCGATCC AGCCTTCAAC (ECACCTTTA CCAGCACCCA TACGTACTTC 11700 

TAAAGGTTTT TTAGTATATG GTGTATGTGG GAAGATTTTA ATCCAAACTT TCCCGCCACG 117 60 

TTTCATGTAA CGTGTCATTG CTATACGAGC AGATTCGATT TGACGAGATG TGATCCAAGA 1182 0 

CGTTGTTGTA GCTTGTAAAC CAAACTCACC AAATGTTACG T At T ACCGCC TTTAGA&GA 118 80 

CCAGTTGTTT TAGGACGATG TTGACGACGA TATTTTACAC GTTTTGGTAG TAACATTATT 11940 

ATTTTCCTCC TCCACTAGTG TTCTTAGTAG GAAGAACTTC TCCACGATAA ATCCATACTT 12000 

TAACGCCTAA TTTACCGTAA GTAGTGTCAG CTTCAGCGTG tGCATAATCG ATGTCAGCAC 12 0 60 

GTAACGTATG AAGTGGAACA GTTCCTTCTG AATATTGTTC AGCACGAGCG ATGTCAGCTC 12120 

CGCCTAAACG ACCAGATACT TGaGTTTTGA TACCTTTAGC ACCAAGTTTC ATAGCTCTAG 12180 

TGATTGCTTG TTTTTGTACA CGACGGAATG AAGCACGGTT TTCTAATTGA CGTGCGATGT 1224 0 

TTTCAGCTAC TAAACGAGCG TCAAGATCAA CTTTTTTGff TTCAATTACG TTGATGTGTA 12300 

CTTTTTTATC AGTTAACGCA TTTAATTTGT TGCGTAATTT TTCGATTTCT GAACCGCCTT 12 3 60 

TACCAATTAC CATACCAGGT TTACCAGTAT GAATTGCAAT GTTGATACGG TTTGCAGCAC 12420 

GTTCAATCTC TACGTGAGAA ACTGATGCTT CTTTTAATTC ATTATCAATA AATTTACGGA 214 8 0 

TTTTTAAATC TTCGTGTAAA AGTGAAGCGA AGTCTTTTTC AGCATACCAT TTAGCTTCCC 12540 

AATCACGGAT AATACCAACA CGAAGTCCGA TTGGATTAAT TTTTTGACCC ACAGTATTCC 12 600 

CTCCTTAAAA GTTAATTAAG CTTCTTTAGC TTCTTCTTTA CCGTCACTTA CGACGATTGT 12 660 

AATGTGGCTT GTACGTTTGT TAATCGCACT TGCACGACCT TGCGCACGTG GACGGAAACG 12720 

TTTTAATGTT GGTCCTTCGT TAGCATATGC TTCTTTAACT ACTAATTCAT CTGTGTTCAT 12780 

GTCATAGTTA TGTTCAGCAT TAGCTAAAGC GGACATTAAT ACTTTTTCAA TTACTGGTGA 12 84 0 

TGAAGCTTTG TTTGTTAATT TTAAAATTGC AATAGCTTCA GCAGCffiTTT TACCTCTGAT 12 900 

TAAGTCAAGA ACTAGTCTTA CTTTACGAGG TGCGATTCTT ATTGTTCTAG CAACCGCTTT 12 9 60 

TGCTTCCATT AGGATGTCCT CCTCTACTTA ATAGATATTA TCTTCTTGTT TTCTTGTCGT 13020 

CTGCAACGTG TCCTTTGAAT GTACGAGTAG GAGCAAACTC ACCTAATTTA TGACCTACCA 13 080 

TATCTTCAGT TACATATACA GGTACGTGTT TACGTCCGTC GTATACTGCA AAAGTATGTC 1314 0 

CGATGAAATT AGGGAAAATT GTAGAACGAC GTGACCATGT TTTGATTACT TGTTTCTTTT 13200 

CGCTTCCTTC TTGAGCTTCA ACTTTTTTCA TTAAATGCTC ATCGACGAAA GGTCCTTTTT 13260 
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TAATACTACG AGCCATTTGG GCGCCTCCCT TCTTATTATG TGCGTGCAGC TTTAAGCCGC 13320 

AC AC C C AAAT AAGTTGATTA TATTATTTTT TCTTACGTCC ACGAACGATA AGTTTGTCTG 1338 0 

ATGATTTTTT ACCACGACGA GTTTTCTTAC CAAGCGTAGG TTTACCCCAT GGTGACATTG 13440 

GAGATGGTCT ACCGATAGGA GCACGACCTT CACCACCACC GTGTGGGTGA TCETAGGGT 13500 

T CAT T AC AG A ACCACGAACT GTTGGACGGA TACCTTTCCA TCTTGAACGT CCGGCTTTAC 13560 

CAACGTTAAC TAATTCGTGT TGTAGGTTAC CAACTTGACC GATTGTAGCA CGGCAAG TAG 13 620 

ATAAGATCAT ACGAACTTCA CCAGATCTTA ATCTGATTAA TACGTATTTA CCTTCTTTAC 13680 

CAAGTACTTG AGCACTTGCA CCAGCTGAAC GAGCGATTTG TCCACCTTTA CCAGGTTTAA 13740 

GCTCGATGTT GTGTACTACT GTACCAACTG GAATGTTTTG TAATGGTAAT GCGTTACCAA 13800 

CTTTGATGTC AGCTTCAGCA CCACTTTCAA CGATTTGACC TACTTCTAAT CCTTTAGGAG 13860 

CAATGATATA TCGTTTTTCA CCGTCTGCAT ATACACTAA AGCGATGTTT GCTGAGCGGT 13 920 

TTGGATCATA TTGAATAGAA TCAACTTTTG CATTGATACC ATCTTTGTTA CGTTTGAAAT 13980 

C G AT AAC AC G GTATTGACGT TTGTGTCCAC CACCATGGTG TCTTACAGTC AATTTACCTT 14 040 

GGTTGTTACG TCCCGCTTTT TTCGGTAGCG GTTTTAATAA TGACTTTTCA GGTGTAGTTT 14100 

TCGTGATTTC TGCGAAATCT AACGAaGTCA TATTACGACG ACCATTTGTT ATTGGCTTAT 14160 

ACTTTTTAAT AGCCATTGTC GCTTACCTCC TTAATGGTAA TTGTTTTATT AGTTAAATAA 14 220 

GTCGATTGAT CCTTCTTTAA GAGTTACAAT CGCTTTTCTT CTTTTGTTTG TATAGCCTTG 14280 

GTAACGGCCC ATACGTTTTT TCTTAGGTTT GTAATTCATG ATATTAACAC TTGCAACTTT 14 34 0 

TACGTTGAAG ATTTCTTCAA CTGCCATTTT TACTTGTGTT T T G T T AAC AC GAGTATCAAC 14400 

GTCGAAAGTG TATTTGTCTT CAGCCATTGC TTCAGAAGAT TTCTCAGTGA TTACGGGGCG 144 60 

CTTAAGAATA TCTCTTGCTT CCATTATCCG AGCACCTCCT CACTTTTTT AGCAGCAGCT 14520 

TCAGTAATTA CTAAGCTGTC AGCATTAGTG ATATCTAAAA CATTTAAACC TTGAGCAGTT 14 580 

GTCACTTGAA CGCCAGGGAT GTTGCGTGCT GATAATTCAA CATTTACATC TTCGTTTTCA 14 640 

GTAACTACTA ATACTTTTTT AGGTTGTTCT AAT G TAG AT A ATACATTTTT GAATTCTTTA 14700 

GTTTTTGGAG CTTCGAAGTT GAATGCGTCA ACTACAGTTA AGCCATTCTC TTGAGCTTTG 14 7 60 

AAAGATAATG CTGAGCGTAA AGCTAAACGA CGCATTTTCT TAGGCATTTT GTATGCATAA 14 820 

CTTCTTGGAG TTGGTCCGAA TACGATACCA CCGCCACGCC ATTGTGGAGC ACGGATTGTA 14 880 

CCTTGACGAG CACGACCTGT TCCTTTTTGC TTCCATGGTT TACGTCCGCC ACCGCTTACT 14 94 0 

GCTGAACGAT TCTTAACAGC ATGCGTACCT TGACGTAATG AAGCACGTTG T AAAT T AAT A 15000 

GCTTCGAATA AAACGCTATT ATTTGGCTCA ATACCGAATA CTGCATCGCT TAATTCGATT 15060 
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GAACCTGATT TAGTTCCGTC TAATTTTAAA ACATCATAAT TAGCCATTATGCATTTCCTC 1512 0 

CTTTCACTTC TTATTATTTA TTACCTTTTT TAATTGAAGT TCTGATTTCT ACTAAACCTT 15180 

TTTTAGGTCC AGGTACGTTA CCTTTTACTA AGATAACTTT GTTTTCTGTG TCAACTTGAA 15240 

CTACTTCTAA GTTTTGAACA GTTACAGTGT TTCCACCCAT ACGTCCTGGC ATTTTTTGGC 15300 

CTTTAAATAC TCTAGAAGCA TCTGAAGCCA TACCTACAGA ACCTGGTGCT CTGTGGAAAT 15360 

GAGAACCGTG TGACATAGGT CC AC GAG ATT GTCCGTGGCG TTTAATTGCA CCTTGGAAAC 154 20 

CTTTACCTTT TGATACGCCT GTTACGTCAA TAACGTCGCC AGCTACAAAA GTATCTACTG 15480 

AGACTTCTTG AaCCTAcTcG TAAGCATCCA (ETCTACATT GCGGAATTCA CGAATGAAGC 15540 

GCTTAGGTGC TGCGTCAGCT TTTTTAGCGT GACCTTCAGC TGGTTTATTA GCATATTTAT 15600 

TAGATTTTGC ATCTTTTTTG TATGCTTTTT TGTCTTCAAA TCCAACTTGG ATTGCGTTGT 15660 

ATCC AT CAAC TTCTACAGTT TTCTTTTGTA ATACAACATT TTCTTTAGCT TCTACTETG 1572 0 

TTACAGGGAT TAATTCACCG TTTTCTCCGA ATACTTGTGT CATCCCAATT TTTCTTCCTA 15780 

AGATTCCTTT GGTCATCGAA AGTCCACCTC CTAAAATTGT CTATTATAAT TTGATTTCGA 1584 0 

TGTCTACACC AGATGGTAAG TTTAAGCCCA TTAAAGCGTC AACTGTTTTT GGTGTTGGGT 15 900 

TTACAATATC GATTAAACGT TTGTGTGTAC GTTGTTCGAA TTGTTCACGT GAATCTTTAT 15960 

ACTTATGCAC GGCACGGATG ATTGTGTAAA CTGATTTCTC AGTTGGTAAC GGAATTGGTC 16020 

CAGAAACATC TGCACCAGAA CGTTTCGCTG TTTCTACAAT CTTCTCTGCT GATTGATCAA 16080 

TTACGCGGTG ATCATAAGCT TTTAATCTGA TTCTGATTT TTGTTTTGCC ATAATTTTCC 1614 0 

CTCCTTATTC GTCTACATTT AG T GAT AG AC TTCTCCACGA AAACTATCTT ACACAGCGCC 16200 

ATGGCAAAGC GGCCGGGTGT GTCAGTAACC TTTCGCTTCA TCGCTTTTCT TAAAGTCCAA 162 60 

CGTTAGTTAT ATTACACGAA AAAC AT C GAT AAATCAAGGC TTTTCACATA ATTTTTCTAT 832 0 

CTGTCTAACA CATACTTTTA TATTTnACTT TATATACTTA GTCAGTTCAA CTATTTTCGA 1638 0 

GATATTTTnA ATTTCCn 16397 
(2) INFORMATION FOR SEQ ID NO: 20 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2955 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

TnAGTTGTTT CTGCCACGAA AGATTCAATG GCTTTTCTTG CTTTACGCTT TTCTTTCAAT 60 

GGCAAATCAC CAATCATTTT TTTAAGGTGA TGTGGGTTTA CAACACCACT ATACTGGTAG 120 
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TCATTTGAAn TTGTTTTTAG GGCTTGTTCA TCGATAGATC TCTCTCCAGC AAATCCTTTG 18 0 

AACTCCGCTT CTTTTTTAAT ACTTTCGAAA TTAACATATT CTTGATCGAT ATCATCATCC 24 0 

TTATTTAAAG AAGGTACAAC ATTGTCGATG AATTCTCTAATTAGATCTCG TTTTAACCTC 300 

AATGtCGGAT CATCTGCATG ATCTAAAATG CGTCTAATTT GTTCTTGGTT ACGACGTTGT 3 60 

TCCGCTTTGT CTTCAAGATC AATTTGTCTC AATATATTCA TAATATAATT CACATTAATC 420 

GTATCATTAC GCATCATTTC TATTTCGAAA TCAATATCAT TTAAAATGGA TACTTTATTT 830 

TTCTCAGCCG TCGCTCTTTT TACTTGATCG TACACAGCTA AATATTTACT TTTATAGTCT 54 0 

TCATTCTCTT GTTCATCCAT TCCAATTTCA TCAATTGTAA ACTCAAACTC GTCAAATGCT 600 

TTTAAACGTA ATATTATTTT AGCTAATAAA CGATAAGCtT CAACAAAGCG CTTTAGCTCT 660 

TCTTCATCtT GaATGtCATC APCCATGTGT GGTGTCGGCA CAATCATTTT AAGCTCACGA 720 

TAAGCGTCCA TAAATTCTTT TTTATACTCT TCATAACTGC G CAT T AAAAT TGTATCCGTA 78 0 

TCATTTGTTT GT GAGAAT AC TCTCAGTGCA TCGTCTGTCT CTTTTTTCAA GTCACGATAG 84 0 

TTTACAATTT TACCAAATGG CTTTGATTCT TTTTCAACCC TATTTGTHG TGAATACGCT 900 

TGAATTAAAT CATGATACAT TAAATTCTTA TCAACATATA AAGTGTTCAG TACTTTACTA 960 

TCAAAACCAG TTAAGAACAT ATTAACAACG ATTAAGATAT CAATTTTACT ATCTTTAACG 1020 

CCCTTTTTAA CGTTTTTtGA AAT AT GAT T A AAATACTCAT TAGTTGtGGC TGnTGaAAAA 108 0 

TTCGTCTCGA ACTTTTTATT ATAATCACTA ATCATTATCT CTAATTTTTC ACGTGAATGA 114 0 

TATGGCACTT CACCATCACG ATCATCTTCA TTAGGTTTAA ACGTAAATAT ACCAGCTATC 12 0 0 

GTTAACGGTT GTTCCAACTT TTTGTTAAGT CGCTTAAATG TCTCATAATA TTTAATAAGC 12 60 

GCGTGAATAC TTTGGACTGT AAATATACTT GAATATTGAC GATTACGTGT ATATTTATCA 1320 

T GAT TAT T G A TGATATGTCG TGT TACT AAT TCCACACGTT TATCCGCTAA CCATACTTCT 1380 

TCCGTATCAA TTGCTTCAAC CATGCtGTTA TCTTCTGCTT TTAAAGCTTT ATTTTTAAAA 14 4 0 

G TAT T AAT AT AGTCAACTGA GAAACCAAGT ACATTACCAT CATGAATGGC ATCTTAATT 1500 

AAATACGTAT GTAAGCATCT ACCGAAAATA TCTGCAGTTG TTCTACCATC TTGACTACTA 15 60 

TTTTCTGGAA AACGTGGCGT ACCAGTGAAT CCAAAGTATT GGGCATTTTT GAAATGTTGT 162 0 

TTAACTAGAC GATGCATGTC ACCAAATTGA CTGCGATGAC ACTCATCAAT AATAAATACA 168 0 

ACTTTATTCG TTTTATACTG TTCTAATAAA GGGGCATTCC CTTGAATCGC TTTAGCCATT 17 4 0 

TTTTGAATCG T C G T T AC AAT AAGTGGCAAA CTTTTATCAT TTAGTTGGCG TACCAGTTGC 1800 

GAGGTATTAA AAGTTTTGTC TACAGCACCC TTAGCAAATT TATTAAATTC CTCTTCTGTT 18 6 0 

T G AC TAT C C A AGTCTTTACG GTCAACCAAA AAGATAATT TCTTAATGTC ATCTTGCTGT 1920 



706 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



GATAAAATCT GACTCGCTTT AAAAG AAG T C AACGTCTTAC CACTTCCAGT TGTATGCCAT 1980 
ACATATCCAT TATTCCCTGT CTCAGTCGCT TGTTGAATAA GTGCTTCTAC CGCATACACT 20 4 0 

TGATACGGAC GCATTGCCAT CAGTATTCTA TCTGTTTCAT TAATAATCAT ATAGCGCGAT 2100 
AT CAT C T TAG CTAATTGACA AGGTCTCATA AATGACTCAG CAAACGATTG CAATGTATTG 2160 
ATACGGTTAT TCTGTTTATC ACTCCAATAA AACATGTGAC TCTTCAATAG TTCGCTATCA 22 2 0 

TTATTAGAAA AGTATCGCGT TTCAACACCA TTACTAATGA TAAACATTTG TATGTAGCGG 22 80 

AATAAGCCTG TGTAATTTTG TTTGCGGTAA CGTTTTACTT GGTTAAACGC CTCATTAATA 234 0 
TCAATACCTC GACGTTTCAA TTCAACTTGG ACAAGGGGTA GTCCGTTGAT TAATATCGTT 24 00 

ACATCATAAC GTGCTTTATA TGTATCCTCG ACAGATACTT GATTCGTCAC TTGAAACTTA 24 60 

TTTTTACACC AACTTTTCGT ATCTAAAAAC GACAAATAAA TCTBGACTC ATCATCACGT 2 52 0 
CTAAGTGGTA ATTTATCACG TAAAATACGG GCACTCTCGA AAATACTTTT TCCATCAATC 25 80 

ATCGTTAACA GACGTTGaAA TTCTTTATCT GTTAaGGGAT TGCCTTCTAA TTTGTCCGCA 2 64 0 

TGACGCTCAT TTAAAATCGT TCTAAAATTA TCAAGCAATT GCTTATTATC ACGTATCGTT 27 00 

ACTCTTTCGT AACCCAATTG TTCAAGTTGA TTCATCATTT CATTTTCTAA TGCGTATTCA 27 60 

CTTTGGTATG CC AT TC AT AT CCCCTTCCAT ACACTTTCTA TTGCTCTAAA TATATCATAA 2820 
ACTTTAATGA AAAATGTTTG TTTTTTATCT TCAAACGTAA ATTTATTCTA ATTTTATTGT 28 8 0 

CTTATCTTTT AATATTTGTC TTTGACETAA GTCGTATACT AAAATTTGAA TACAAATAAT 2 94 0 
C AAAT CAT T G ATAAATTTTT TGTCTACGAT TAATGGAGGG ACTTGAATGG TGTTAATTAC 30 00 

CTATCAAATC ATTTTATTTT TTATTATTAG TCTAAGTTAC TATTTAACTT TAAATCATTA 30 60 

CATGGCAGTC ACTGTAGGTA ACTTCACTTC AATATTCGGC ATGTTCGCAG CATACTCTT 312 0 
TATGTACTAC TACCTACTCT ATAAAAGTCC CGAATACAAT CAACGCAAAC GATTTAAACA 318 0 

TTTCATTCAT ATCACTAATT TGATAATAAT TGCTTTTAGC ACCTTCGTAT TAGTTCATTT 32 4 0 

AGCATTAAAA TTATTCTTCA GCATTTAATT TCCATCTATG AAAAAAGCAA AGCTCAAATC 3300 

TGAACTTTGC TTTAATTTGT CACGCCTTTA TCATTTTCAA AATAGCCTCT ATGCCAGTTT 33 60 

TACAAACTTG TAGCAACAAT TTTTCATCAA GCAACTGAAT CACATCAAAA ACTTCAATTG 34 2 0 

TTTACGTTGC AACTAACTTT ATGTTAGAAC G C AC AC AAT A TCTAATGTTT AAAATCTATC 34 80 

GTCGCAAGAT GCTTCCTGTA AT TAT C AAG T GC(ATTTTCG ATTGGGTTAT ACAATCTAGA 354 0 

ATCGCATGAT AATTTAATGC TACAAATCGA TAGTACAATA TATCTACCGT GAATAACTGT 3600 

GCAAATAGTG ACGTTGTAGC CGCCATACGC ATTTCATTTT CATCAGTTCT GCCATAAATC 3660 

AATGCATAGT CTGCAATTTG AGCCACTGGA TTATTAGCTG TACTAGATAT AGTTATGAT 3720 

GGAATACTGT AATGTGTGGC CACCTGTGCA ATTGACTGCA AT T C AC TAT G ACTACCTTGA 37 80 
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TTCGTCACAA AAATCATGCA ATCTCTATCA TCATGCGTCG CAAATGTTGA CACAAGTAAA 38 4 0 

TGCGTTTCAT GTAATAACCT GACATTTAAG CCAATACGAG ATAACTTTTG AAAAAGAT C A 3 900 

CCAATAGTCA AACTGGATGC GCCAAATCCA AATAAAAATA TTGTCCTGGc ATTTTTCAAC 3960 

ACATCACAAA TTGCATCAAT TTGCGCATCC ATAATATTAG TAGCTACAAA TCGCATCGTA 4 02 0 

TTCGTTGCTC TAGCAATCAT TTTATTTTTC AAAGTTTCTA CAGATTCATT TTCAATCAAT 4080 

TCTAAATGTG GATTGGTTGC AATATCTTCG GGTAAGTATC GAGATATCGC AATCTTTAGC 4140 

TCTTGAAAAC CTTGATGTGT CATTTTCCGA CTAAATCTAA CAATTGATGC TGTACTAACA 4200 

TTCGTAACAT CTGCCAAATC ATTCACAGTC ATATCAATGA TTTTATGTGG ATTCTTTAAA 4 2 60 

ATGTAATCAG CGATTATCTT TTCTGTCTTC GTAAAATCAC TCAACTGCTT ATCAATGCGA 4230 

TATAAAATAT TTGTCATCAT TAATCACCCA ACAAATCTGT CTGTCGCATC GCCTTTGTCG 4 38 0 

TTCCAAATAA ATATGTACAA ACGAATCCAC CAGCATACGC AGCAAGTAAT CCTGCAATAT 4440 

AACCTAAATA CATATTATCT GAGATTAATG GTAATAGTGA CACACCACTT GGGCCTATTG 4 500 

CTTTGGCACC AATATGTCCA ATTCCACCTA TTACAGCGCC ACCAATACCA CCACCAATAC 4560 

AAGCAGTTAA GAAAGGTCGA CCTAATGGCA AAGTCACACC AT AG AT T AAT GGTTCTCCGA 4 620 

TACCTAGGAA ACCAACTGGC AATGCACCTT TTAAAGTATT ACGTAATGTT GTGTTGCGTT 4 68 0 

TACATCT TAC CCAAAGTGCT AATGCGGCAC CTACTTGTCC AGCACCAGC ATCGCTGCAA 4 74 0 

TTGGCAATAA GTAAGTAGCA CCTGATTGGT TAATCATTTC TATATGAATT GGCGTAAAAA 4 8 00 

TATGATGAAG CCCTAACATA ACTAACGGTA GGAAGCTTGC ACCAATGATA AATCCACTAA 4 8 60 

ATACGCCACC AAT AC T AAT A ATTCCGTTAA CTACTGAAAC TAAACTGTCT GAAACAAAAC 4 920 

CTGCTAATGG CATAAAGATA AAGATAGTTA ATAGTCCTAC AATCAACAAT GCAATAGTCG 4 980 

GCGTTACAAT AAT AT C AAT C GCATTTGGCA CAATTTTATG TAATCTCTTT T C G AC AAT AC 50 4 0 

TTAAAATCCA AACGGCAAAA ATAACGCCAA TAATCCCACC TTGTCCAGGT TGCAATGGTT 5100 

CTCCAGTGAA G AC AT T C AT T AAAATATTTT TACCAGCAAT ACCCGTTAAT AACGTTGTAC 5160 

CACCAATCAC GCCACCAAGT CCTGGTGTCG CACCAAATTC TTTAGCCGCA T T AAT AC C AG 5220 

TGAAAATAGC TAAGTATGCT AACATACCGT CTTTAATGAC ATTAAATACT GTTATAAGTT 528 0 

GCGTAATCCA AG C AC C T G AA ATATAGCCTG CCACCATTAA GTTACTCAGT ACTGEGCAA 534 0 

TACCACCAAT TAATCCAGCT CCAATAAATG CAGGAATCAA CGGTATAAAG ATATTGGCAA 54 00 

TTGATTTCAA TACTTTATTC AACTTACCAT TCTTTTGTTT TGCTTTATGC GCTTCCTTAT 54 60 

TCGCCTTTGC TTTATCAGCT GCATATGATT TATAGTCCAT TTTTTCACTA TCATTGTGAT 5520 

GGTGTGGTAT TCGGTCACCT AGTTTAACAC CACTTAATTC CGCCATATGA TTAGCCACTT 5 58 0 
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TATTGatGTA CCAGGTCCAA CCACAACTTG AATGCGTTCA TCGTGTATAA CACCCATGAC 5 64 0 

ACCATCAATA TGCCTTAGTT CTTGGTCATC TACTTTATTC T CAT C T AAT A CTTTAATACG 5700 

CACACGTGTC AT ACAGT TCA TGACACTATC TATATTACC ATACCACCTA CTGCAGCAAT 57 60 

AATTCGTTCT GCAAGTTGTT GTTCTTTGGT CATTTAAATC CCTCCTAAGG TTGTCTATCT 5820 

CTGATTGCTC GTTTAAaATG TCACCATTGT TTAATAACCG TCTTGTTGCT TCTTCCTTAG 5880 

AAATGCCACA CATACCCATA ACTGTCGCAA CTTTCACATC ATGCTCAGAT ACCTGATATA 594 0 

ACGCCATTGC TTCATCATAT GTGATAGCAC ATATTTCTTG AATAATACGC ACTGAACGGT 6000 

CGATCAGTTT TTGATTGGTT GCTTTAACAT CAATCATGAG GTTATCGTAA ACTTTTCCGA 6 0 60 

CACCAACCAT TGTGATGGTT G AAAT CAT AT TTAAAATTAA CTTTTGTGCt GTACCAGACT 612 0 

TTAAACGTGT TGAACCAGTT AATACTTCTG GACCAACTTT AACTTCTACT GGATAcTGCG 6180 

CAATTTCACT TATAACTGCA TGTTCATTGC AT G AAAT AG A TACTGTTGTA GCACCGATTG 62 4 0 

TGTTAGCAAA TGTTAAACCG CC T AT AACAT ATGGCGTTTT GCCACTCGCG GCAATTCCTA 6300 

TAACGACATC TTTTGATGTT AAATCTATAT TTTTCAAATC TTCTCCGCT AATTTTTTGT 63 60 

GATCTTCCGC ACCTTCTACA GCCATCGTCA TAGCATGTTG TCCACCAGCA ATAATACCTA 6420 

TAATTTCATG AGGGTCAGTA TTGAATGTAG GTACACACTC CGCTGCATCT AAGAC AC CC A 64 8 0 

ACCTTCCACT TGTACCTGCA C C GAT AT AAA TCAATCGTCC ACCCTTTTTA TACTGTGCAA 654 0 

TTGTTTTTTT AATTACTTTT GTCAATTGTG GTATTGCCTT TCGAACTGCT AACGGGACTT 6600 

GCTGATCTTC TTTATTCATC GTAATTAAAG CCTCTTCCAC AGTCATTTCA TCAAGATGCA 6660 

TCGTCGCTTC ATTACGCGCT TCGGTCGTAC TATTTTCCAT CACTTCTTAC ACTCCCTAGT 6720 

TTTTTGAAAA TCAAATGTAT CATTCGSCTC GAT AC AAC T T AACAGTGGTA AGTCTTCTTT 67 8 0 

AATAATTTGT GCAaCAACAT TCACATTGTC ATGTGCACTA AGCGTTTGTC TCACAATTTG 6840 

CATTTCGCCT TGATAACGTC CGTTATTCAA ATTATCAACG GTTACTGAAC CAATGCGTCG 6900 

TTGCGTCGTA AACTGTGGTT GAATCGAATG TGGACATATT TGTCTTGACG TTCCGAACG 69 60 

AAT G AC AT T T TCCGGATTAT CCGGGCGTAC T T T AT G AC AC ATATCGAAAA GGTAAGTCAC 7 02 0 

TGTCGTATCA AACACTTCTT CAATACATAA CGTGAAATGC CTATGCTTGC AAAAATCTAT 7 08 0 

AAGTTGTTTT GCCTGCCTCA TTTCAATCAA TGAGTCTCCA AC T AAC AC T T C AG AT AC AC C 714 0 

AGTTTCTTGT AAT AAT T TAG CTGCAACGAC AGGATGACTA TGTCTCGTTG CTTCAATTGT 7 2 00 

TGGCAAGCCT TTATGCAAAG GACCTCGCAA ATCACTCCCT ACAATAAAAC CATATATTTG 7 2 60 

TGCCTTTGGA TTAAATTGAT AAATGAGTTC ATTTTTCTTA TTGACCAAGT C AAC AG A T AA 7 32 0 

TCCCGTATCT GGTCTTGGAT AATAGTTATG ACfflAATGAA AGTAATGTAA AAT CAT T C AA 7 38 0 

TTGTTGATGT AAGCTTGTTA ACAATTCCCG GGAAATAATA CTTGCATTCA AACAGCACTT 74 40 
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TAAACCCTGT GCCATTATCG CTTCGATTGC CTCAATTGAT GTACTATGAT CGATACGAAT 7 500 

CATAAATTGT GCATCATATT GTCGAAGATG GTCATAAAAA GATGGTGTTA AAATAGATG 7 5 60 

ATTAGCATCT ATGAGGTAAG TCACTTGTTC ATGTTTTAAT AAATTGAGTA GTTTTGTGAA 7 620 

ATAATGATAT TTTGTCTCGT CATCTTCTTC TGGTATTTGT ACAGATGTAA AAATCATTTG 7 68 0 

GTAACCTTGT TTAATCATTC GCTTAATATA CGCTTCATCT AAAGGTTGTC C T AAAT AC AC 774 0 

TGAAAAGCCT GTCAAA.GTAG CCCTCCTTAA CAATATAATT ATTAGGAAAA TAT AG T T GAT 7 800 

TTGTGTAATC GCTTACATTT TACTATAAGA GAAAACACAT TACAATATTA AT C AG T T AAA 7 8 60 

GCCTGTTCAT TGTAATAATC TTACATATTT CTGTCACAAG TTAATTATTA CACCATCAAA 7 92 0 

GATTATCCTT TCTTTTAAGT GCTGATAATA GCTGCTACTGCTGGATTATT ACAATAACTT 7 980 

TTATACATTT TATTCAGGAT TATCTTATAT TATGTTTTAA TAATAATCTG TGAACAATTA 804 0 

AGAGATTTGA AATTGAATTT AATAATTGTA TTGAAAACGC ATACTTCACC ATGCTAAAAT 8100 

AGGAGTCGCA AACAAATAAG ATTCAATAAG ATGTGATGGT TACCAACACA GTCTATTTGC 800 

TCGTGTCTTT TTTTATTGAA TCTTAAATAA TAAATACAAC TTTGGAGGTT GGACAAGTGA 8220 

GGAAGAAACT TTTCGGTCAA TTGCAACGTA TTGGTAAAGC GCTAATGTTA CCTGTTGCGA 8280 

TTTTACCAGC AGCTGGTCTG TTATTAGCTA TCGGTACAGC TATGCAAGGT GAATCATTAC 8 34 0 

AACACTACTT GCCGTTTATA CA^AATGGTG GCGTACAAAC TGTCGCTAAA TTAATGACAG 8 4 00 

GTGCTGGTGG TATCATTTTT GATAACTTGC CTATGATTTT CGCATTAGGT GTCGCAATCG 8 4 60 

GATTAGCTGG CGGTGATGGC GTAgcAGCTA TCGCAGCATT CGTCGGTTAC ATAATCATGA 8 52 0 

ACAAAACAAT GGGCGACTTT TTACAAGTTA CACCTAAGAA TATTGGTBT CCAGCGAGTG 8580 

GTTACGCTAG CATTTTAGGT ATCCCAACAT TACAAACAGG TGTGTTCGGC GGTATTATAA 8 64 0 

TCGGGGCCCT GGCAGCTTGG TGTTATAACA AGTTCTATAA CATTAACTTA CCATCTTATT 870 0 

TAGGTTTCTT CGCTGGTAAG CGTTTCGTAC CTATTATGAT GGCTACAACA TCATTTATTT 87 60 

TAGCATTCCC AATGGCATTA ATTTGGCCAA CGATTCAATC AGGAT T AAAT GCATTCAGTA 8820 

CAGGATTATT AGAT TCAAAT ACTGGTGTTG CCGTATTCTT ATTTGGTTTC ATCAAGCGTT 888 0 

TATTAATTCC ATTCGGTCTA CATCACATTT TCCACGCACC GTTCTGGTTC GAGTTTGGTT 8 94 0 

CATGGAAAAA TGCAGCTGGT GAAATTATTC ACGGTGACCA ACGTATCTTT AT C G AAC AAA 9000 

TTCGTGAAGG CGCACATTTG ACAGCTGGTA AATTCATGCA AGGTGAATTC CCTGTTATGA 9060 

TGTTCGGTTT ACCTGCAGCA GCTTTAGCAA T T T AT C AC AC AGCTAAACCT GAAAATAAGA 912 0 

AAGTAGTAGC AGGTTTAATG GGTTCTGCTG CTTTAACATC ATTCTTAACT GGTATACAG 918 0 

AACCATTAGA ATTCTCATTC TTATTTGTAG CACCATTATT ATTCTTTATT CACGCaGTAC 92 4 0 
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TTGATGGTTT ATCATTCTTA ACATTGTACT TATTAGATCT TCATCTAGGT TATACATTCT 9300 

CAGGTGGTTT CATCGACTAC TTCTTACTCG GTATACTACC TAATAAGACA CAATGGTGGT 93 60 

TAGTCATTCC TGTAGGTCTT GTATACGCAG TTATTTACTA CTTCGTATTC CGATTCTTAA 94 2 0 

TTGTAAAATT AAAATACAAA ACACCAGGTC GTGAAGATAA ACAATCACAA GCGGCTACTG 94 8 0 

CTTCAGCAAC TGAATTACCA TATGCAGTAT TAGAAGCTAT GGGTGGCAAA GCAAACATTA 954 0 

AAC AT T TAG A CGCTTGTATC ACACGTCTAC GTGTTGB.GT T AAC G AC AAA TCTAAAGTTG 9600 

ATGTTCCTGG TTTGAAAGAT TTAGGCGCAT CTGGTGTATT AGAAGTCGGC AATAATATGC 9 660 

AAGCAATTTT TGGTCCTAAA TCTGACCAAA TCAAACATGA AATGCAACAG ATTATGAATG 97 2 0 

GTCAAGTAGT AGAAAATCCT ACTACTATGG AAGACGATAA AGACGAAACT GTTGTTGTTG 97 8 0 

CAGAAGATAA ATCTGCAACA AGCGAATTGA GCCATATCGT GCATGCACCA TTAACTGGTG 98 4 0 

AAGTAACACC AT TAT C AG AA GTGCCTGATC AAGTGTTCAG CGAAAAAATG ATGGGTGACG 9 900 

GTATCGCTAT CAAACCTTCA CAAGGTGAAG TTCGTGCACC ATTCAACGGT AAAG T AC AAA 9 9 60 

TGATTTTCCC AACAAAACAT GCAATTGGTC TTGTATCAGA TAGTGGTTTA GAACTATTAA 10 020 

TCCACATCGG TTTAGACACT GTTAAATTAA ACGGAGAAGG CTTTACTTTA CATGTTGAGG 10080 

AAGGTCAAGA AGTTAAACAA GGTGATTTAT TAATCAACTT TGATTTAGAC TACATCCGCA 10140 

ATCATGCAAA GAGTGATATT ACGCCTATTA TCGTGACACA AGGAACATT ACAAACCTTG 10200 

ATTTTAAACA AGGTGAACAT GGCAACATTT CATTTGGCGA TCAATTATTT GAAGCTAAAT 102 60 

AATGCTTACT AT AAACAGG T GCGTATACCT TCATAAGGTG ACGCGCCTGT TTTTTCTTTG 10320 

CTATTGTATT TTGCAGCATC ATTGATAGTT CGCTCTCCCC TTAAATTTTG AATTTTAAGA 10380 

TCATCAATTA AAGCCCCCCT TCATACTCAT TTCCTAAAAA ATATTAATTG TTCACTATTG 104 4 0 

TTAGCGTTTT CACAACAAAG TCAACTTCCT TGACCTTACA CTATATTCGA GGCTATCATT 10500 

TTAAGTGTAA ATATAGAGAA AAGGTGGCTT TTTTTATGAA ACAACGCATT GGAGCTTACT 10560 

TAATTGACGC TATTCATCGA GCAGGGSTCG ATAAAATTTT TGGTGTTCCT GGTGATTTTA 10 620 

ATCTCGCTTT TCTAGACGAT ATTATCAGCA ATCCCAATGT AGATTGGGTA GGAAATACAA 10 680 

AT G AAT T AAA CGCAAGTTAC GCAGCGGACG GTTATGCCCG TCTTAATGGA CTCGCTGCAT 107 4 0 

TAG T TACT AC ATTTGGTGTT GGCGAATTAA GTGCCGTCAA CGGTATCGCA GTTCATATG 10800 

CTGAACGCAT ACCTGTCATT GCGATTACAG GTGCGCCGAC ACGTGCTGTT GAACAAGGCG 108 60 

GTAAATATGT ACATCACTCA CTTGGTGAAG GTACATTTGA CGACTATCGA AAAATGTTTG 10 920 

CACATATAAC CGTTGCACAA GGTTATATCA CACCTGAAAA TGCAACAACC GAAATACCAC 10980 

GTTTAATTAA TACAGCAATC GCCGAAAGAC GCCCAGTTCA TTTACATTTA CCAATCGATG 1104 0 

TCGCAATCTC TGAAATTGAG ATACCGACAC CATTTGAAGT GACGGCAACT AAATATACGG 11100 



711 

( opied h orn 1< 80(ii>0(i on 08/ 1/ t?00 ; 



ATGCATCAAC ATATATAGAG TTATTAGCAA CTAAACTGCA TCAAGCGAAG CAGCCTATCA 11160 

TCATTACTGG ACATGAAATT AACAGTTTTC ACCTCCATCA AGAATTAGAA GATTTTGTAA 11220 

AT C AAAC AC A GATACCAGTA GCACAACTTT CAT T AGG AAA AGGTGCTTTT AATGAGGAAA 112 8 0 

ATCCATATTA TATGGGTATT TACGATGGGA AAATTGCCGA AGATAAAATA CGAGATTATG 11340 

TGGACAACAG CGATTTAATT TTAAATATTG GAGCCAAATT AAC AG AT T C A GCAACAGOA 11400 

GTTTTTCATA CCAATTCAAT ATCGATGATG TCGTTATGTT AAAT C AT C AC AATATCAAAA 114 60 

TTGACGATGT TACAAATGAT GAAATATCTC TACCATCATT GTTAAAACAG TTATCCAATA 11520 

TTTCATATAC GAATAACGCA ACGTTCCCTG CGTATCATCG TCCAACATCA CCCGATTATA 11580 

CTGTTGGCAC AGAAGCATTA ACACAACAAA CTTATTTTAA AATGATGCAA AATTTCTTAA 11640 

AACCAAATGA TGTCATCATT GCTGATCAAG GTACATCATT CTTTGGTGCT TATGATTTAG 117 00 

CAT T AT AC AA AAACAATACT TTTATAGGGC AACCGTTATG GGGTTCTATC GGCTATACAT 117 60 

TACCTGCAAC ATTAGGTTCA CAATTAGCAG ACAAAGATCGTCGTAACTTA TTATTAATTG 11820 

GTGATGGCTC ATTGCAACTA ACTGTTCAAG CTATTTCAAC TATGATTAGA CAGCATATTA 118 80 

AACCGGTATT ATTTGTGATT AATAATGACG GCTATACGGT AG AAC G AC T T ATTCACGGCA 11940 

TGTATGAACC TTATAATGAA ATTCACATGT GGGATTATAA AGCTTTACCA GCTGTATTTG 12QD0 

GTGGTAAAAA TGTTGAAATT CATGACGTTG AAT CAT C AAA AGAT T T AC AA GACACGTTTA 12060 

ATGCAATTAA TGGTCATCCC GATGTGATGC ATTTTGTCGA AGTCAAAATG GCTGTCGAAG 12120 

aCGCACCGAA GAAACTCATC GATAtCGCTA AAGCTTTTTC ACAACAAAAT AAATAATTTC 12180 

ATCGTATACA GGGTATAAGT TTAAGCGAAT ACTTTATTAA ACGAATAGGA CTCTGATATA 12240 

AGAT GAT T AA TTTTAATAAA ACCGCTTTAG TGTTAATCGA CCTGCAAGAA GGTATTCTTA 12300 

AAATGGATTA TGCCCCATAT ACAGCTGAAA ATGTCGTTCA AAAC GC T AAT AAATTAATAG 12360 

ATGTTTTTAG AAAAAACAAT GGCTTTATCG CTTTTGTTCG CGTGAATTC TATGATGGTA 12 420 

AAGATGTATT GCAACCAAAT G T AAT GAT C T CATTACCACC AAAAGAAGGC GACGcACTAC 124 80 

AGTCGTTTCC ATCATTTATT AGACAAGAGA GATGACGATT T T G T CAT AG A CAAACGACAT 1254 0 

TTTAGTGCAT TTGTAGGAAC AG AT T T G G AC TTACAATTGC GACGTCGAGG AATTGATACG 12600 

ATTGTTCTTG GTGGTGTCGC AACGCATATT GGCGTAGATA CGACAGCGCG AGATGCCTAT 12 660 

CAATTAAACT AC AAT C AG T T TTTTGTTACA GATATGATGA GT GCACAAAA CGAAACGCTA 127 2 0 

CATCAATTTC CAATAGATAA TGTATTCCCA TTGATGGGAC AAAC AAT AAC T AC AAAC G AC 12780 

TTTCTAAATA TATTGAACTA AACATATACT TCCCCCCTTC GATCATGTTG AGGGGGATCT 12840 

TTATTTCACA AAGTATTAAT ACGTCGGGTT GTCTAACCTT CTATATTTAA CATATTCTAT 12 900 
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ATCTGTTAAA TCGTTCTTAA CTTACGCCCC TACTACATAA AAAACAGTAT TTATTCCGGA 12 960 

ATTTTCAAAA AATTTAGTAT TTATTGCAAA ATTATGTATC ACTTTATGTT TAATTTTGA 13020 

TATTATCTTA ATTAAGTAGA TTTTTATAAG TTCTAAAAAG GAGAACAAAT ACATATATGA 13080 

AGAAGAAACT AACATTTAAA GAAAACATGT TTATAGGTTC TATGTTATTT GGTTTATTCT 1314 0 

TTGGTGCCGG CAATCTTATC TTCCCAATAC ACTTGGGTCA AGCTGCTGGT TCTAACGTTT 13200 

TTATCGCTAA CTTAGGATTT TTAATTACAG CAATTGGCTT ACCATTTCTA GGTATCATTG 13260 

CTATTGGCAT TTCAAAGACA TCTGGTTTAT TTGAAATTGC ATCGCGTGTT AATAAAACAT 13320 

ATGCTTACAT TTTCACGATT GCCTTATATC TAGTTATCGG ACCATTTTTC GCCTTACCTA 1338 0 

GACTGGCAAC GACATCATTT GAAATTGCAT TTTCGC&TT TTTATCACCA AAG C AAA T C A 13440 

CTTTATATTT ATTTATTTTT AGCTTCGTCT TCTTTGTGAT TGCATGGTTT TTTGCGAGAA 13500 

AGCCATCAAG AATTTTAGAA TATATCGGTA AATTTTTAAA TCCGGTATTC T TAG TAT TAT 13560 

TAGCAATTAT TTTATTATTT GCTTTTATCC ATCCATTAGG TGGCATATCT GATGCACCTA 13 620 

TTAGTAAACA ATATCAATCA CATGCCTTAT TTAACGGCTT TTTAGATGGA TACAATACCT 13680 

TAGATGCGCT AGCGTCATTG GCATTTGGTA TTATCATTGT TGCAACGATT AAAAAGTTAG 137 4 0 

GTATCGAAAA TCCAACTGAT ATCGCTAAAG AAACAATTAA GTCTGGTACT ATCAGTATCA 138 0 0 

TTATGATGGG GATCATTTAT ACCCTACTAG CAATCATGGG TACATTAAGT ATTGGTCATT 138 60 

T C AAAC T TAG TGAAAATGGT GGTATTGCCT TAGCGCAAAT TACTCAATAC TACTTAGGTA 13 920 

ACTACGGTAT CGTCCTGTTG TCACTTATCG TTATGGTTGC TTGTTTAAAA AC AG C CAT C G 13980 

GTTTGATTAC GGCATTTTCA G AAAC AT T C G AACACCTTTT CCCfflAAATG AATTACCTAG 14 040 

CGATTGCAAC AGTTGTAAGC TTTATTTCGT TCTTATTCGC GAATGTTGGT TTAACTAAGA 14100 

TTATTATGTA CTCAGTCCCA GTGTTAATGT TCTTATATCC ATTAGCAATT GCCTTGATTG 14160 

TACTAACATT ATTTAGTAGC AAATTCCATC ATTCAAAACT TATTTATCAA TGTACCATTT 14 220 

TCTTTACAAT GATTGCTGCA TTAGTAGATG GATTAAAAGC TAGTCCAGAG TTCATTTCAA 14 28 0 

GCACATCATT C T C AC AAAC T TTGATTAATT TCAGCCAAAA ATATTTACCA T T AT C AG AC A 14 34 0 

TTGGTATGGG CTGGGTTGTT CTCAGTTTGA TTGGTTTCAT TATCGGCTTC ATTATTTATA 14 4 00 

AAATTAAGCA TCGTAAAATT CCACA7GCAT AATACTATGC CACAGTCATA TGTTAAACAT 14 4 60 

ATGCTTGTGG CATTTTTTAT TCATACTACA TTAAACTGCA ATCGTATACA T AC AT AT C AA 14520 

TGATTATCCA CAAAAAATAT TAGTACTTTC ATT TT AC AAA TCACATTAAT ACAAACACAA 14580 

CCTTATCTTT ATATTATTAA ATTTATATTT GACACTTATA TTGAACAACT EAATATATT 14 640 

AATATTAATT CTTTAAAATG TATAAATATA AAGGAGGGAG AC C GAT G a AT TCAATCATTG 14 7 00 

aATTAACTGA TTATTATAGC TCTAATAATT ATGCACCACT TAAGCTTGTC ATTTCTAAAG 147 60 
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GTAAAGGTGT CAAAGTTTGG GATACTGATG GCAAACAATA TATAGATTGC ATTTCGGGTT 14 820 

TTTCAGTTGC AAACCAAGGC CATTGTCATC CAACAATTGT TAAAGCGATG ACAGAACAAG 14 88 0 

CTTCAAAGTT GTCTATCATT TCACGTGTCC TTTATAGTGA CAATCTCGGG AAATGGGAAG 14 940 

AAAAAATTTG TCATCTTGCT AAGAAAGACA AAGTACTCCC CCTTAACTCT GGTACTGAAG 15 000 

CTGTTGAAGC AGCCATTAAA ATTGCTAGAA AATSGGGCTC TGAAGTTAAA GGCATTACTG 15060 

ACGGACAAGT TGAAATCATC GCTATGAATA ACAATTTTCA CGGTCGTACA CTTGGCTCAT 1512 0 

TATCACTATC TAACCACGAC GCATATAAAG CAGGATTTCA CCCCCTACTT CAAGGCACTA 1518 0 

CAACAGTAGA TTTTGGAGAC ATTGAACAAT TAACACAAGC TATTTCACCG AATACAGGA 152 4 0 

CAATTATTTT GGAACCAATT CAAGGTGAAG GTGGCGTTAA TATACCACCG AAAGGATATA 15 300 

TTCAAGCTGT GCGTCAACTA TGTGATAAAC ATCAAATATT ATTGATTGCA GATGAAATTC 15 360 

AAGTTGGTCT TGGTAGAACT GGGAAATGGT TTGCTATGGa ATGGGAGCAA GTCGTTCCAG 15 42 0 

ACATTTATAT TTTAG3TAAG GCATTGGGTG GCGGCTTATA CCCTGTATCT GCTGTACTTG 154 80 

CAAATAATGA TGTCATGCGT GTTCTAACAC CAGGTACACA TGGTTCAACA TTTGGTGGTA 1554 0 

ACCCTTTAGC CATTGCAATA TCGACGGCAG CGCTTGATGT ACTTAAAGAT GAACAACTGG 15600 

TTGAACGATC AGAACGCTTA GGTTCATTTT TATTAAAAGCGTTGCTACAA CTTAAACATC 15660 

CTAGTATTAA AG AAAT TAG A GGTCGTGGTT TATTTATAGG CATAGAGCTT AACACAGATG 1572 0 

CTGCACCTTT TGTGGATCAA CTGATTCAAC GTGGAATCTT ATGCAAAGAC ACGCATCGTA 157 8 0 

CTATCATTCG ATTGTCTCCA CCTCTAGTCA T T GAT AAA GA GGAAATCCAT CAAATTGTTG 15490 

CAGCTTTTCA AGACGTTTTT AAAAATTAAC AATTAATCAT TTATATATGA CATAGGAGGG 15 900 

ATTCATGATG ATTAAAGTAG GTATCGTTGG CGGTAgcGGT TATGGCGCAA TTGAATTAAT 15 960 

TCGATTGTTA C AAAC AC AT C CTCATGTAAC GAT TGCAC AC AT C T AC T C AC ATTCAAAAGT 1602 0 

AGATGAACCG TTGAAATTAA CATTTCCACA TTTACAACAT ATTATGCAAC ATTTCGAAGC 16080 

ACTTACAGTG GaTAATAATG ACTGTGATGT AATTTTCTTT GC G AC AC C AG CACCCGTAAG 1614 0 

TAAAACATGT ATCCCTCCCT TAGTAGAAAA AGGTATTCAT GTTATCGATT TATCTGGCGC 16200 

ATTTAGAATT AAGAATCGTG AAAT AT AT GA AGCATATTAC AAAGAAACDG CTGCAGCACA 16260 

AGATGATTTG AATCATGCTA TTTACAGCAT TTCAGAATGG CAATCGTTTG ATAACAATGG 16320 

AACGAAGCTC ATTTCTAATC CTGGCTGTTT CCCTACAGCA ACATTATTAG CATTACATCC 16380 

ACTTATTAGC GAAAAAATAG TAGATTTGTC ATCTATTATT ATTGATGCTA AGACCGGCGT 164 4 0 

GTCAGGTGCT GGTCGTTCAT TATCACAACG gTTcATTTTT C AG AAAT G AA TGAAAATCTA 16500 

AGCGCTTATG CAATCGGAAA C CAT AAAC AC AAACCGGAAA TCGAGCAATA TTTATCTATC 16560 
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ATTGCGGGTC AAGATGTATC AGTCATATTT ACACCACATC TCGTACCAAT GACACGAGGT 16620 

ATTTTATCAA CAATATATGT CAAATTATCA TCTGAATATA CGACTGAATC ATT AC AT AAA 16680 

TTAATGACCT CTTATTATGC TAATCAGCCA TTTGTCAGAA T TAG AG AT AT TGGGACTTTT 16740 

CCAACCACAA AAGAAG T AC T CGGTAGTAAC TACTGCGATA TCGGCATCTA TGTAGATGAA 16800 

ACAACGCAAA CAGCAATTTT AGTATCAGTG ATTGATAACC TTGTCAAAGG CGCA&TGGG 168 60 

CAAGCCATTC AAAATTTAAA TATATTATAT GATTTTGAAG TGACGACTGG CCTAAATCAA 16920 

TCACCAGTTT ATCCATAAGG GGTGTTAGAA TGAAACATCA AGAAACGACA TCACAACAAT 1698 0 

ATAACTTTTC AATTATTAAA CATGGCGATA TCAGTACACC TCAAGGCTTC ACGGCTGGTG 17 040 

GTATGCACAT CGGTTTACGC GCTAACAAAA AAGACTTTGG GTGGATTTAC TCATCGTCTT 17100 

TGGCAAGTGC AGCTGCCGTA TATACTTTAA AT C AG T T T AA AGCTGCACCA CTTATTGTCA 17160 

CTGAAGACAC TTTACAAAAG TCTAAAGGAA AATTACAAGC ACTTGTTGTT AATTCAGCTA 17220 

ATGCAAATTC TTGTACCGGT CAACAAGGCA TAGATGAGC AC G AC AAAC A CAAACATGGG 17280 

TTGCTCAACA ACTTCAAATA CCATCTGAGC ATGTTGCTGT TGCTTCAACT GGGGTCATTG 17 340 

GTGAATATTT GCCTATGGAT AAAATTAAGA CTGGGACCGA AC AT AT T AAG GATGCTAATT 17 4 00 

TTGCAACGCC AGGTGCGTTT AAC GAGGCAA TTTTAACAAC TGATACCTGT ACAAAACATA 174 60 

TCGCTGTATC AC T AAAAAT C GATGGTAAAA CCGTTACAAT TGGTGGTAGC ACCAAAGGTT 17520 

CAGGTATGAT TCACCCAAAT ATGGCTACCA TGCTTGCTTT TAT AAC AAC C GATGCATCGA 17 580 

TTGAATCGAA TACACTTCAT CAATTATTAA AATCTTCGAC TGACCATACA TTTAATATGA 17 640 

TTACTGTTGA TGGCGATACA AGTACAAATG ACATGGTATT AGTCATGGCA AATCACCAAG 17 7 00 

TTGAACACCA AATACTTAGT C AAG AC CAT C CACAATGGGA AACATTTGTT GATGCATTCA 177 60 

ATTTTGTCTG TACATTTTTA GCTAAAGCTA TAGCCAGAGA TGGCGAAGGC GCAACAAAGT 17 820 

TAATCAGCGT CAATGTGTCA GGCGCTAAAT CAATATCAGA TGCAGAAAA ATCGGTAAAA 17 880 

CTATCGTAAG TTCAAATCTA GTAAAATCAG CTATTTTTGG CGAAGATGCC AATTTTGGTC 17 94 0 

GAATCATTAC AGCTATTGGC TACAGCGGAT GTGAAATTGA TCCTAACTGC ACATATGTTC 18 000 

AACTGAACCA AATACCTGTC GTTGATAAAG GTATGGCTGT ACTATTTGAT GAGCAAGCTA 18060 

TGTCGAATAC ATTAACTCAT GAAAATGTCA CAATTGACGT TCAGCTTGGT TTAGGTAACG 18120 

CTGCAGCGAC TGCATACGGT TGTGATTTAT CCTATGATTA TGTGCGTATC AACGCATCAT 18180 

ATCGAACATA AGGTGGTGTT GGTTAGATGA AATTTATTGT CATTAAAATT GGTGGCAGTA 18240 

CACTTAGTGA CATGCATCCA T C AAT TAT T A ACAACATTAA GCATTTACGA T C AAAC AAC A 18 30 0 

TCTACCCCAT TATCGTTCAT GGCGGTGGCC CATTTATTAA TGAAGCATTA TCAAACCAGC 18360 

AAATCGAGCC ACACTTTGTT AATGGCCTAA GAGTGACT G A TAAAGCAACC AT G AC CAT T A 184 20 
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CTAAACACAC GCTCATTGCA GACGTTAACA CTGCATTAGT AGCTCAATTT ACCAGCACC 18 4 80 

AATGTTCTGC AATAGGCTTA TGTGGTTTGG ATGCACAGCT GTTTGAAATT ACATCTTTTG 1854 0 

ATCAACAATA TGGATATGTC GGTGTTCCGA CCGCTTTAAA TAAGGATGCT TTACAGTATT 18 600 

TATGTACTAA ATTTGTACCT ATCATCAATT CGATTGGTTT CAATAACCAT GATGGAGAAT 18660 

TTTACAATAT TAATGCTGAC ACGCTTGCCT ATTTTATTGC ATCATCATTA AAAGCGCCTA 18720 

TTTATGTATT AAGTAATATT GCAGGTGTAC T CAT C AAT G A TGTTGTTATA CCTCAATTGC 18780 

CATTAGTCGA TATTCATCAA TATATTGAAC ATGGTGATAT TTATGGAGGT ATGATTCCCA 18840 

AAGTGCTAGA TGCCAAAAAT GCGATTGAAA ATfflCTGTCC TAAAGTTATC ATTGCATCAG 18 900 

GAAACAAGCC AAATATCATT GAATCTATTT ACAATAATGA TTTTGTTGGC ACAACAATCC 18 960 

TTAATTCATA AC TAT GAAAT TAAGGCCTAA CAAGTTTTGA CACGCGAGAT GATTCCAGTT 19020 

CGATTATCCA TTGCGCTAAA ACATTTATTT ACCGTTCATC TCGTTAACAA TTTTGAATA 1908 0 

AGTACGATAC AAT AT GAG AT GTAAAAAACT AATAACCTTT TACAAATTTG TTTATCAAAA 1914 0 

TATTTTAAGT TTTGCAAAGC TTTTTATTGT GATTATTTTC ACAAAATACT AT AAT GAG GA 19200 

TAGTAAATAG AGAGGAGTCC TTAAGTTGAC GAAACGACAA ATGGGTATAT TCATTTATGC 192 60 

TGGAATTATC GGTGGCTTGT TATCTGGAAT TGTAAAATTA GGTTGGGAGG TCATGTTTCC 19320 

ACCTCGCACA CCAGAACGTA ATGCAACGAA CCCACCTCAA GAGTTATTGC AACAATTAGG 19380 

ATTTAGTAGT GAGTTTACGC ATCAAACATA TACATTTTCA AATATGGAAT TGCCTTGGGT 19440 

AAGCTTTATT GTCCACTTTA GTTTTTCTAT CGTCATTGCAATTATTTACT GCATATTAGT 19500 

TAAAAAATAC GCTTACTTAG CAATGGGACA AGGTGCTGTT TTTGGTATTG CTATTTGGGT 195 60 

ATTATTCCAC CTTATCATTA TGCCAATCAT GCATACTGTA CCTGCTGTGT GGGATCAACC 19 620 

ATTCCAAGAG CATCTGTCAG AATTCTTTGG CCATATCGTC TGGATGATGA CAATTGAATT 1S83D 

AGTGCGACAA CATTTTGTCT ATCGCTATAA ATTAAATTAA TACACTGACT AACATTAACG 19740 

TGAGTTTt AA ATCATCGTTT GAGTATGATG ATTGATGCTC ACGTTATTTT ATTAACTGAC 198 00 

ATGATATGAT TCCAGCCAAC TTACGTGAGC ATTAAAGTCT CAAATGCGTC GTAACAAACT 198 60 

ATTATTTTCG GTAATTTCAA TflTTGCTCAG TATATTTTTA CCTTATCACT TACTTTAATC 19920 

TCGTCATGAT TTTGAATGAT GCCATCGTGT ATTCACCTTT CATTTTTCCA ATAAAAAAAC 19980 

ATCTAACAGT AAAC AT T T AG GCAGTATAGT TTAAAATCAC TGCGCAATGA TACTGTCAGA 2004 0 

CGTCATATTA ACTACTCAAT AACTGAAATA CAGACACTTT TTTATAACC CAGGGTGCCT 2 0100 

GTCCTAAGAA ACATACCTGT ACCATAAACT GATCAAAAAT AAATTGTTTG AACTTCACTT 20160 

CACGTGATTG ATAAAAGTGT GATTGTGTCA TATCATAAAT GTCCAATCCT TTGATTAAAC 20220 
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CTTCACCAAT CAATTTTGTA AAACTTTCTT TTTGTGTCCA TATTTGATAA AAAT CAT T T A 2 0280 

AACTACATAT TTGATGTGCT TCGTTTGTAG AGAAACACGT CACTAACGTA CGCCAGTCTA 2 0340 

AACGTTGTGA TATCTTTTCG ATATCAATAC CAACTGGTTC TTTATCGACA ACACACACGA 204 00 

TATAAGGATA ACTATATGAT AAGCTCACAT AGATGGGCTG TCCATCACGA TTGTGTTGAA 204 60 

CAATATCTGC CTTACCTCGT GGCGAAATGT GATAATGCCA TTCATGTGGT AATAAACCTG 20520 

TGTCATGTTG AATTCCATAT TGCACTAAAA TATCTCCCAA TCTGTGCATG AGTTTATCTT 20580 

GATTGTATCT ATAGTTGACT GTACGCGGTT TTTTATATGA CCAACGACTT TGTGATATTA 2064 0 

ATTCTTCAAT ACTTTTCAAG TTACTCTGTA ATTGCATTAC AAATACTGTC ATAAETTCC 2 0700 

CTACTTACTT ATTGAATATT GTTTTGATAT ATTGTGCCCA ATGATACAGC CAATTGTTAG 207 60 

TTATCGTTGG CCATTTTTCA CTGATGTGAT TCATTATTTT TAATGTTAAT GTTGTATCTA 20820 

TCATTGCTAG TTGTTGTTCA CGGTCAACAC TAGTTAATCC AATCGTTTCG TACATGTCTT 20880 

GTTTCTGTAA AATTTCATTA AATGATTCAT CGCTGACGAG TTCAATTTCT TTGCGCTTAA 20940 

CGCATTCTAA CAAAGATTTC ACCGGCATTT TATTAGGTGA TAGCACATGG T AAAT GAT TT 21000 

GTGGTGTGTT GACCTGTGCT AATGCGACAA TTTGTCTTGC AGTCGTATCC ACAAAAGAAA 210 60 

AATCTACAGG CATTTCAGCC ATGCTAACCC CGATACi&TC CAGTTGTAAC AAATCATTCA 21120 

TTACCATTGA AAAACGGTTA GTCTTTATAT TTCTCATATG CCATCTTCCA TTGTAAGGAT 21180 

TCGTCAAATT ACCAACACGT ACAATCCGAC CATCTAAGCC ATTATTTACA GCTTCTAATA 212 4 0 

CTTTTAATTC ACTATAAAAT TTGCTCCGTG TATATGGTGA TGTTAGTAGT TGCCCTTTAT 21300 

AGACATCCGC TTsTGAAAAT GTCACATCTT CTGTGTCTAT ATCAAAATAA GTTCCCACAC 213 60 

TTATCGTAGA CACATATATT AACCTTGCAT GATGTTGTTG TGCCAAACGT AT G AC AT C AA 214 20 

CAGTACCTTG AACATTTACT TTTTCAAATT CATCATCATC ACCAAAGTGA TCTGTACGAG 214 8 0 

CACCTGCATG AATAATCGTA TCCATGTTTT CTGGTAAAAC AACATCATCC ATACACTCGA 21540 

AATCACCAAC AATGACTTCA ATGTTTGATA ACATTATTTC AACCGTCTCT TCTGAAAAAT 21600 

AATCATTTAA ATTCGTCATC AACTTATACC ATGCTATTTC CTCATTATCA GCACGTATGA 21660 

AACAATAAAT GCGATGACTG TATCCTTGTA GTACTTCAAT CAGSAAGCA CCTAAAAAAC 21720 

CTGTCGCGCC AGTCAATAGT GTATTTCCTA GAGGTCGATG ACTTAGACTA TCCTCTAAAA 217 80 

TACCCAAGTT ATAACGAGAC ATAACAATCT TTTGTAATTC CGAAAGATTA TCCGGTAATG 218 4 0 

CAACTAATGA TTGTTGATTT TGGTACATAT AATTAACAAT CTGTCGCACG GTTTTATATT 21900 

GGTATAATGT CTGCATTGAA ATATGATGGC CAAATCGTTT TAAATGCGAG ACAACTAACA 21960 

TCGCCTCTAA TGAGTTACCA CCAAGTTCAA AGAAATCATC GTCAACACCG ACATCATTTT 2 2 020 

GTTTCAATAC CTCTCCAAAT ACATCAACAA ATGTCTGCTC AATTTCATTA GAGGGTTCGC 22080 
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TATACACTTT ATTAGACTGT TGTATA-5GTG ATGGATTTGG CAAACGCGTA GTATCCACCT 2214 0 

TGTCATTCGT GGTTAATGGC ATACAATCGA TATGCGTTAT AGTCTTAGGA ATCATATACT 22200 

TAGGCAGCTG AT CAT T T AAA TATTGCTTCA AATCCTGTTC CACTTGTTGC TCTCCGACAT 222 60 

AATAAGCATT CAATATATCA TGCGTATCAA AGTGACTTAC TGTTACAACA EAT C AG AT A 22 320 

TACCACGAAT AGCTAATATT GCATTTTCAA TTTCATCAAG TTCAATACGG TACCCGTTAA 22380 

CTTTCACTTG TTTATCTATT CTTCCTAAAA ATTCAATTTG ACCATCAGAT GTATAACGTG 224 4 0 

CTAAATCACC ACTATGATAC AACTTTCCTT TACCAAATGG ATTATTTTGC CAT T TAT C AG 22500 

CCATTAATTC TGGACGATTA ATATATCCTA TCGCTAAACT ATCACCTGCA ATACACAACT 225 6 0 

CGCCTGGCAT ACCAATACCG CATAACAAAC CATCTGACAT AATATACACT TGGATGTTAG 22 620 

ATAAGGGTTT GCCAATTGGA ATCGTCTCAG GTATCAAATC ACCACAATGA TGTGACCAAT 22 68 0 

ACGATGTGAT GACTGTTGAC TCAGATGGTC CA'AGGCATT GAAATACGTG CCACAATGCT 227 4 0 

TCTCAATATA TTTAACAAAG GATGCCGTAC TAGTTGCCCC GCCTGTAATC AACTTTTCAA 22 8 00 

TATAAAAGTC TTCCATAACA CTACACATCT GTAACGGAAT CGACGCAACC GTCACACGAT 22 8 60 

GCTTATTAAT GAGTTGTTGT AACTGTTCTG GATTAACACG TTCCTCTCTA TCTGGAAT5C 22920 

CAAGCGTATG ACCATTTAAC AAACAACAAT AAATCTCCAT AACTGATGCA TCAAAAACAA 22 98 0 

TATTTGCATG TTGCAAAAAT ACTTCATTGT CGCCTAATTG CAATTCAGTT GACCATGCAT 230 4 0 

GCACTAAATT CAACAAATTT CGTTGTCGTA TGGCAACCCC TTTAGGCATC CCGGTCGTAC 2 3100 

CAGATGTGTA AATAGCATAC ATCTCATTAT CTAACATCGC TGTGTTTTCA AGTTGATTGC 23160 

CATGTAAATC ATCATATTGT TCATTTTCCT TTGATTCAAC AAAGCCTTTA GCATTTTCCA 23220 

ATTGAATGAC TGGTAATGTC GTTTCAATTT CAACGCCGTA AGACATGACT GCAGTTACTT 2 3280 

TAGCATCCTC CAAAATTGCA CCTTGTCGTT TATTCGGAAAATCAATATCG ATAGGTATAT 2334 0 

AAGATGCACC TACTTTAACT GTCGCCAACA TCGCCGCAAT CATTTCAAAA CTACGTTCTG 234 00 

TAAACAAGGC AACCCGTTGA CCATTGCCCA CACCATTTGA TAGGAGCATG TGCGCAATGG 2 34 60 

CATCCACATA GTTGCGTAAT GTTTCATACG TCATTGTCAA ATCATTCATG ACTAGCGCAA 2 3250 

CATGATTACC TTGTCGTGAG ACAACTTCAT TAAAGTAACT TATGATAGAT TTATTTCCCG 23580 

GGACATTAAG CATTCGATCG TTAACATGCG TATTGACCCA ATTTAGAAGT TCCTCCGTGC 2 3640 

CGTTTGGTAT ATCACAAATT TGTAGTGTAT CTTGATGCTT CAAAATATAA TCAAT CAT AA 2 3700 

TCATACATTG ATTACCCATG TGACGAACTG TTTCTGAGTG ATATAAATCG G TAT TAT ACT 23760 

C GAT AT T GAT TGTATAGTCA TCGCGATCTT CTTCAATGAT GAAAGATAAA TCAAATTTCG 2 382 0 

CCGTCACTGA TTTGGGTTGA ATGTGTGTTA AT T T AC TAT G CCCAAAATGA GCATGATTCG 2 3880 
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TTTCATTGTT TTGTAGTACT AACATGACAT CAAATAATGG ATTCCGTBG GCATCATGTG 2394 0 

ATTGATCTAA GTCATTTACT AAACATTCGA ATGGGTATTC TTGATGCTCG TATGCCTCCA 24000 

AACTCATTTC CTTAACCTCT TGTAAAAACT GTGTCCACAT TTTATCAGGT GACGGTTGCC 24060 

CTCTATATAC CAACGTATTA GCAAAC AT GC CTAGCATTTG CTCCGTGCCT TTATGCATAC 2 4120 

GCGCACTCAT CACACTACCG ACAACAACAT CATCTTTTCG AGCATATCTA CTTAACAACG 24180 

T C AT G AC C AC AC T C AT AAAG AACATAAAAT CAGTAATTTG ATGCTTTTCT ACATACTTTT 2 4240 

GAAGTAGCTG TCTCATTTGT TGATTCATTG T AAAT G AC AT CATTGCTCCA TTTGTCGTTT 2 4 300 

TAATATTTGG TCTAACATAG TCTGTCGGTA AGCTTAAAAT AGGTACTTCA TCTTTGAATT 24 360 

GAGATAACCA ATATTGTCTA TGTTTCGTCA TATCACGATG CGACATCCAC TCACTATAGT 24 4 20 

CTTTATATTG CAATTTAAGT GGTAACAATA ATTTATGTTG ATAAAGTGCG T T AAG AT CAT 24 4 80 

TCATTAATTG TATATTACTC ATACCGTCAT TAATGATATG ATGCGTATCT ATAA&AGGT 2 4 540 

ATGCATGTAA GGGACTTCTA ATGTATCTCA CTCTAATTTG ACTTGGCTTT TCCAAATTAA 2 4 600 

AAGGTGCTAC AAATTGGCGC ATGATTTCTT GTTCATCCGT AAAATGCGTG TTAACTTCTT 2 4 660 

CAAAGTCAAC TGCAACATCT GCCACAATAC GTTGTCGAAC CTCATCATCT ACAACAATAT 24720 

ATTGTGTTCG TAAAATCTCA TGTCGCGCTA TCAAACGCTG CACTGCTTGT CGCAATTGAG 24780 

CT AC ATT AAG TTCTGATGAT AACCGCCATA AAAAAGGTAC GTTATACACC GTATCTTTAT 24 8 40 

GGTTTGATTT CCATAATAAA TACATACGCT TTTGTGCAGA GCTCAGCACA TAATCATCTT 2 4 900 

TAACTATAGT TTCTGGAATC ACTTCATAGT TTTGTTCTTG AACCTTAGCA ATCGCTTGTG 24 960 

CTAGTTCAAA TACAGTTGGC TTTTGTAATA AATCACCAAT TTGTAATCGT TTCCCAGTAG 25020 

ATGCCTCTAT CCGATTCACC ACTAACGTTG CTTTTAATGA ATGGCCACCT AATTCAAAGA 25080 

AATTATCATG AATACCTACT T GAT T C AC AT GTAAAATATC TGCAAAAATT TGGCATAGCA 2514 0 

AGTGTTCGGT ATCTGTACTC GGTGCTACAT AGGCATCCGT ATCGACATAG T C C AT GAT AG 2 52 00 

GCAATGCCTT CTTATCTAAT TTCCCATTAA TAGTAATAGG AATTTGCTCA ATATGCATGA 2 52 60 

AATTAACTGG TATCATGTAC TCCGGTAAGG TCATACGTAA TTGTGATTTA ATCTTATTAT 25320 

GTGATAATGT ATGCATCGCT TCATAATAAG CAACGATATA CTGATCTTGA TCATGATTTT 25380 

GAACAATAAC AACTGCTTTA TTAATACCTT GTATACGCTC GAGCGCATGC TCAACCTCTG 2 5440 

ACAACTCAAT CCTAAACCCT CGAATCTTAA CTTGTTTGTC CTTTCGATAT AAATAATCTA 25500 

TGTTGCCATC GGGTAACAAA CGAACGATAT CACCACTTCT ATAfiTCAGC TGaTTTATAT 25560 

TTGAATCTTT GATAAATTTA TCTGCTGTCA ATTCTGGCTG ATTTAAATAA CCTGCAGCTA 25620 

ACCCAAAGCC AC T T GT AC AT AATTCTCCAG GAATACCAAC GCCACACCGA CGCTCGCCTT 2 5 680 

GCATGATATA AACATGAGTA CCCAGAATCG GTTTACCAAT AGGAATACGA TTTGGAACTT 2 5740 
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TGTTAGGTAT ATTATACGTC GTTGTAAATG TTGTATTTTC AGTTGGTCCA TAACCATTAA 2 58 00 

TAATTTGAGG ATGCTTCGGT TTTTGATTAA GCAAATCCAC CCACTTAGCA TTCAATACTT 258 60 

CTCCACCAAT TAATAAATAC TTTAACGGTA CCAATACTTC TATTCGTTCA CTAGCAATCT 2 5 920 

GATTAAATAA TGAGGAGGTT AACCACATAG TATTAACGTC ATTTTCATTG ATTAATTGTT 2 5 980 

CTACCGCTAT TGGATTTAAT AATTGTTCTT TTTTAGCAAC AATCAGCTTT CCACCATTGA 2 6040 

GCAATGCACC ATATATTTCA AATGTTGCAG CATCAAAGGC TATAGTTCCT GATAACAAAA 2 6100 

TCGTCGTCTC TTCATTTAAT GGTACATAAT GATTTTGATG GACCAAGCGA ©AATACCTC 26160 

GGTGCGGAAT TAGTGTCCCT TTAGGGTTAC CAGTTGTCCC CGACGTGTAA ATAACATAAG 26220 

CATGATCTTC TAACGTGTTA CAT T TAG AAA GATTATCAAT ATTTTTCCAC GCTATCTTAT 2 6280 

TCAAATCAAT GTGATTAATA TTTTGTTTAC CATTTTCATA TAAAGCTTGG TACGTTATTA 2 6340 

CAACTTTAGG CGTTACATCT TTTAAAATGT ACTCCTGACG ATCACTTGGA TAGTTCGGAT 2 64 00 

CAATTGGCAC GTAAGCCCCA CCAGCTTTCA ACACACCTAT CATCGCTATT ATCATCTCAA 2 64 60 

TACTTTTTTC AGCTATGACA GCGACACGAT CATTAGGTTC AAC AC CAT AC TGGTTTCTCA 2 6520 

AACGGTGTGC TAAATCATTC GCGCGTGCAT TCmTGTTTG ATATGTTATA AACAcTCCGT 26580 

CAAATTGCAC AGCGACATGA TTCGGCGTTG CTTCAACTTG TTGCTCAAAT AAGGTAACAA 2 6640 

CTGTTTGCGC ATCATCTATC TCAGGCAAAC TTAAATTGAT ATCGTCATAT AAT TGAAT AT 2 67 00 

CACGTTCTGT CATCAAATTA AGTTCATCTA CAGTTGTTCG TTTATTTCCA TTTTCTTOA 2 67 60 

TAATTTGCAA ATAAATATTT CGAACTAAGT CACTCAGCGT CTCGATTGAG AGCAAATCAT 2 6820 

ATGCTGCACT GTTATATATA ATTTTGAACC CGTGTTGATG TGGAAAAATT TCAATATCCG 2 68 80 

CTAAAGATGT ATGTGCATCA TGTATTTGAT GTACATCCTC AAT AAC AT C A TTACAACAAG 2 6940 

ACATCATATG ATGATAACAA T GAAAAAT AG TCTCTAGTGA AAGCGAAGAC TTCGCGCACT 27000 

GTAATTGCGA CATATTTTGC AACACACATT TATTAAAATC TGTTGTAAAA CGTTGACATA 27060 

CATCTTTTGC ATCGATTGTT AACGTTAACG GCACAATATT TCCGTGTAAA TCATTTGGTA 2 7120 

AATGTGATGG TACATGTATA CCTAATGTGA CATCATGTTGTTGACTCATT ATATGATTAG 2718 0 

CTAAATACAC ACTAACAGCC AACGATGCCA TATCTATTGA TGTCATATCA TCAATCAAAT 2 7240 

ACGTTTGATA TAAAGCTTGT TCAAATGGAT GCTTAATTGG AAAAT AAC T A TCAATATGGA 27300 

TGTCAGAGTT ATTCTCTAAC CGAAAATAGT TTGAGTCTAA TGCTATATGC GATGCATCTT 2 7O0 

GATTGTCTTT AT CAT CAT T T C TAT T TAT AT GTGCATGCTG TCGAGTATTG T T AAT AAC AG 274 20 

TATTGCCACG ATATGCATTG CATAAATCAT CAAGAAnAAT ATCAATTTGA CTATCATCga 274 80 

AAATGGaCAC AT GAAAAT C T aATAGTATAT ATGcAGCATC AG C G AAC T Gm AACAATTTAA 27 54 0 
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CTTTGAATAA AGGTGAATCA TTAAAATGGT AAGTACTTAA TTCTTGCTTA AAAAAAGCTT 27 600 

CTAAATCATA GTTTGCGGAA GAAGATGGAA CTTGTTTTAT CTCAATAAAA GGCAGAAATT 27 660 

CATGAAGTAT CATTGTTAAA TTGTCATCGG TAGTAACATC AAAAAAATGT CTTATAGATG 27720 

CATGTTGTgC ACAATTGTCG ATAATGCATA CATCATTTTA GTAGCTTfiA CATTTTTAGC 27780 

GAGTTTAACC CAATACGCAT TACGGTGTGT CGTTGATTCT GTATTATTTT TGTATATACG 2784 0 

AAAATATTCC TGTTGAAATC TCAAATTACC CATAATCATA AAAAGTCCTT CTTTCATATC 27900 

ATAATACTCA TTACTTACTG AAATTGCATG ATGATATGAT AACCGACGAA ATGTTAATTA 27 960 

ACTCGTTATG TAATGaTTAA TATaAAACAC CATTCGCAAC ATATGAGCGA TATATTCTAC 2 8 020 

CCTAAAATAC ATCTTGTATC ATCGTTACAA TTGGTATATT TTTCAATGTA AAT T AC AT AC 28080 

ATCTTCGATA AATAGCACAC TACAAATCGT TAATCACTTT CTGTTGTTCA CATCTCATTG 2814 0 

CAAACTCAAT ATTGTTGTTA CAAAATATCC ATGAAGCAAG TTTATATTAA ACAAACAACT 28200 

CGCATAAAAC AATTGTTATC CTTAAATTTT AACAAATTCT TAATAAATTT ATCTCTATTT 28260 

TAATTACGAC CAAATTAATA GGTTTTCCAT ATAAAAAGAT GCATAAAATA AATATTTAAA 2 8 320 

TAAATTCAAT TTGTATTCAC TTGTTTTTGT CCCCCAAATA CACCAGCAAC AAGCAGCTA 2 8 380 

GCACCAATTG TTAAAACGAT AAACATATAC AGTCCCATTT GTAATGACGT TAAGAAAACA 2844 0 

CCCAACACAA TCCCTAACCT AGCTAGTGTT TCTGAAAAAT GAATACCTAA TGCATTAACT 28500 

GCACTATATG TTCCTCTTTT AGCTTTAGGA ATAATTTTAA AGCGTTGTTC TGAAACTATA 28560 

GGCGAATAAA TAATTTCACC TACAGTCGCA ATTATCATAA AAACAACTAA TAAGCCAAAC 2 8 620 

TGATTAAGAT AGGTTAGACC ACTATAGCCA ACAATATAAA TCAGCAAACC AGTGATTAAA 2 8 680 

GCTTTTTTAA AATCTATTTT CAATACAACT TTCGAGATTG AAT AC G T GAG TAaAATGACG 2874 0 

ACGACCGTAT T AAT CAT TAG CAAGATTGCT AACATCTTAG CACCTGTAAT AT CAT AT G AA 28 8 00 

CCTATACTTA TTGTTTCAAA CTGATCCTTT AG T C T AAT AG C AAT AT AT G A GGAGATTGAA 288 60 

AATTCACCCA TCAT GAT GAT ACTGAACCCC GAAAT C AAT A ACATATAATT ACGGTCTTTC 2 8 920 

AAAACTAATT TATAACTGCG AAATATATTC ATTATTTGTA ATTTTTGATA ACGACTTGCA 28 980 

TGCCTCTTGT CATCACTTTG CTTTACTTGA TTTCGGTCTT GAGGTAACCA AATATATAAA 29040 

ATAAAGAGTA CAATTAAAAA TATACAAGCT GCTATTAAGA AAAGTAGTAA CATACTGTAG 2 9100 

CCATACATCA AGCCACCTAA CAATGCCCCA ATAGCTACCG ATAAGTTTGT CATCCAATAG 2 9160 

CTAATCTTGT AAATATAATG TTCCACGTCT TCGGTAATTG CATCCATAAT TAATGTGTCC 29220 

ATAACTGGAA ATTGTAATCC CCAAACGATT GTAAATATGG CATATGCAAC ACAAAAACCA 2 9280 

ATAATTTGCC AC AAT T GAT G TGACCCAAAT ACGCCCATGA ACACAAGCAT TATCACCATC 2 934 0 

GTCGCTTGAT AAATAAGTAC TAGCAACTTT tTCGGAAATA TCTBATAAG GTAACCAGAT 294 00 
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ATAATGGACA ATGGAAATTT nAGAACCACT AAACCAACAA GATATATACC GACAATTGAT 2 94 60 

TGACTTAACA TATCTGTTAA ATATAGTGCT ATAAACGGTA TAAATGCTGT CGTAATAATT 2 9520 

AGCTGTAAAA nATTGCTAAT CAATCGTACT TTCAA 2 9555 
(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 07: 

AAAAAAAAAA AAAAAnGGTG AATCTTTAAT TAAACACTAA TATTGTAAAA GATGTTAAGT 60 

AAACGCTTAA T G AC AC T T AT TTTTTGAAAA TAATAGTAAT ATCATTTTGT TAAATGAAAG 120 

AATAAAGCTA T AATmAT TAT AGAATAACTA TTTAAAGGAG ATTATAAACA TGCCAATTAT 180 

TACAGATGTT TACGCTCGCG mGTCTTAGA CTCTCGTGGT AACCCAACTG T T G AAG TAG A 24 0 

AGTATTAACT GAAAGTGGCG CATTTGGTCG TGCATTAGTA CCATCAGGTG CTTCAACTGG 300 

TGAACACGAA GCTGTTGAAT TACGTGATGG AG AC AAAT C A CGTTATTTAG GTAAAGGTGT 3 60 

TACTAAAGCA GTTGAAAACG TTAATGAAAT CATCGCACCA GAAATTffiTG AAGGTGAATT 4 20 

TTCAGTATTA GATCAAGTAT CTATTGATAA AATGATGATC GCATTAGACG GTACTCCAAA 4 80 

CAAAGGTAAA TTAGGTGCAA ATGCTATTTT AGGTGTATCT ATCGCAGTAG CACGTGCAGC 54 0- 

AGCTGACTTA TTAGGTCAAC CACTTTACAA ATATTTAGGT GGATTTAATG GTAAGCAGTT 600 

ACCAGTACCA AT GAT G AAC A TCGTTAATGG TGGTTCTCAC TCAGATGCTC CAATTGCATT 660 

CCAAGAATTC AT GAT T T T AC CTGTAGGTGC TACAACGTTC AAAGAATCAT TACGTTGGGG 720 

TACTGAAATT TTCCACAACT TAAAATCAAT TTTAAGCAAA CGTGGTTTAG AAACTGCAGT 780 

AGGTGACGAA GGTGGTTTCG CTCCTAAAIT TGAAGGTACT GAAGATGCTG TTGAAACAAT 840 

TATCCAAGCA ATCGAAGCAG CTGGTTACAA ACCAGGTGAA GAAGTATTCT TAGGATTTGA 900 

CTGTGCATCA TCAGAATTCT ATGAAAATGG TGTATATGAC TACAGTAAGT TCGAAGGCGA 960 

ACACGGTGCA AAACGTACAG CTGCAGAACA AGTTGACTAC TTAGAACAAT TAGSGACAA 1020 

ATATCCTATC ATTACAATTG AAGACGGTAT GGACGAAAAC GACTGGGATG GTTGGAAACA 1080 

AC T T AC AG AA CGTATCGGTG ACCGTGTACA ATTAGTAGGT GACGATTTAT TCGTAACAAA 114 0 

CACTGAAATT TTAGCAAAAG GTATTGAAAA CGGAATTGGT AACTCAATCT TAATTAAAGT 1200 

TAACCAAATC GGTACATTAA CTGAAACATT TGATGCAATC GAAATGGCTC AAAAAGCTGG 12 60 
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TTACACAGCA GTAGTTTCTC ACCGTTCAGg aAACAGAAGA TACAACAATT GCTGATATTG 



1320 



CTGTTGCTAC AAACGCTGGT cAAATTAAAA CTGGTTCATT ATCACGTACT GACCgTATTG 



1380 



CTAAATACAA TCAATTATTA CGTATCGAgA TGAATATTT GAAACTGCTA AATATGACGG 



1440 



TATCAAATCA TTCTATAACT TAGATAAATA ATTTTCTnTA TAATCAAATG CTGACATAAT 



1500 



TTTAGTTGAG GAT TAT TAT G ACGGTATAAA TAAATAAAG 



1539 



(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

CAATTTCTAT CTATCAATGA TGTGCATACT TCCAnTTAAA TTAAtCGAAA TGaATCAGC 60 

TATATCATTC CTGCCTCTTT ATATAACaAC AAATAGTGAT TACAATATTT CGGTTATTAA 12 0 

CACGAAAATT TTACAAGCAC CTATTTCATT TACATATATA TACAGCAAAA AAGAAAGCCC 180 

AG AAAT AT T G GTGTTTATTA AATCATTTAA AAAGTATATT GCCAATGAAC AATTATAATA 24 0 

AATTTCAAAT CTAAAAAACC AAGAATGCGA TTAATCATCA CATTCTTGGT TCAATTTTAT 3 00 

TCATGAATTT TTTCAACATT AAACGTTAAG TTATTGTCTG AATTTAAATT AACTTTAATC 3 60 

GTTGTACCTT CTGGGAATCC CTCTTTAATC ATCATACGTG CTAATGGTGT TTCAATTTGG 420 

CGTTGTACAA AACGTTTTAA TGGTCTTGCA CCGTATTGAGGTTCATAAGC TTCTTGACCT 4 80 

AGCCAAGCTT TAGCATCATC AGAAACTTCA ATTGAGATTC GTTGTTCTAA TAATCTTATA 54 0 

TTTAATTGCG TTAAGATTTT ATCTACAATC ATACTCATGT CATCAATAGA TAATGGTTTA 600 

AATAATACGA TATCATCCAT AC GAT T C AAA ATTTCTGGTT TGAAATATGC ATTTAAACTT 660 

GTCATAACAG CTTTTTCTGT TGATTCTGTA ATTTCACCAG TCTCTTTTAC GTTTTCTAAT 720 

AAAAC T T GAG ATCCAATATT ACTTGTCATA ATAATAATAG TATTTTTAAA ATCAACGCTA 780 

CGTCCTTTAG AATCAGTTAA ACGGCtTCAT CTAAAATTTG CAATAATACA TTAAAGACGT 84 0 

CAGTAT 8 4 6 
(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

nTGGGAACAG TAAGCCAGTA TTTTTAGAAA GTTGCCATAC ATGAGCGTCG ATTTTTCCAA 60 

TATGGCTATG ACACTAGAAC AATGGGAATT TGGAGGAAAA GTAAATGATT AAACCTAAAA 120 

TAGCATTAAC CATTGCAGGT ACTGATcCaA CAGGTGGTGC CGCGTAATG GCTGATTTAA 180 

AATCATTTCA TTCATGTGGT GTATATGGTA TGGGCGTCGT TACAAGTmTT GTTGCTCAAA 24 0 

ATACATTGGG CGTACAACAT ATTCATAATT T AAAT CATC A ATGGGTAGAT GAACAACTTG 300 

ATAGTGTCTT CAATGATACC TTACCTCATG CTATTAAAAC GGGGATGATT GCTACAGCAG 3 60 

ATACTATGGA AACGATTCGT CATTATTTAA TGCAACATGA ATCTATTCCA TATGTAATtG 4 20 

ATCCTGTTAT GTTGGCGAAA rCggTGATTC ATaATGGwTA ATGACaCAAg CaAAACTTGC 4 80 

AGCATaCGTT ATTGCCATTA GCTGACGTAG TAACACCGAA TTTACCAGAA GCTGAAGAAA 54 0 

TAACGGGACT AACCATTGAT AG T G\AG AAA AAAT TATGCA GGCTGGCCGC ATCTTTATTA 600 

ATGAGATTGG TAGTAAAGGT GTCATCATTA AAGGCGGTCA TTCAAATGAT ACTGATATAG 660 

CAAAAGATTA TTTATTTACT AACGAAGGTG TTCAAACATT TGAAAATGAA CGATTTAAAA 720 

CAAnACATAC GCATGGAACA GGGTGTACAT TTTCAGCAGT TATAACGGCAGAACTTGCAA 780 

AAGGTAGACC ATTATTTGAG GCTGTACACA AGGCTAAAAA GTTTATTTCA ATGAGTATAC 840 

AATATACGCC TGAAATCGGC CGTGGTAGAG GTCCAGTGAA TCATTTTGCA TATTTAAAGA '900 

AAGAGGGATT AG AC GAT G AA TTATCTAAAT AAAATACGTA TTGAAAACCC ATTAACAATT 960 

TGTTATACAA ACGATGTAGT TAAAAATTTT ACAGCGAATG GTTTATTAAG TATTGGTGCT 1020 

AGCCCTGCAA TGAGTGAAGC TCCCGAAGAA GCTGAAGAAT TTTACAAAGT TGCACAAGCG 1080 

CTATTAATCA ATATCGGTAC TTTAACAGCA GAAAATGAAC AAG AT AT TAT TGCGATTGCT 114 0 

CAAACGGCAA ATGAGGCAGG CTTACCTATT GTATTTGACC CTGTAGCTGT TGGTGCTTCT 12 00 

AC AT AT C G AA AGCAATTTTG T AAAT T AT T A TTGAAATCAG CGAAAGTATC AGTAATTAAA 12 60 

GGCAATGCAT CTGAAATATT AGCGTTGATT GAT GAT AC AG CAACTATGAA AGGTACAGAT 1320 

AGTGATGCTA ATCTTGATGC GGTTGCAATA GCGAAAAAGG tTACGCAACA TATAAAGJTG 1380 

CAATAGTAAT CACAGGTAAA GAGGACGTTA TTGtTCmAGA TAATAAAGCC TTCGTATTAG 14 4 0 

CTAATGGATC TCCATTATTA GCACGAGTAA CTGGAGCTGG TTGTTTATTA GGAGGCGTTA 1500 

TTGCTGGATT TTTATTTAGA GAAACAGAAC CAGACATAGA AGCGTTAATT GAAGCGGTAA 1560 

GCgkATTTAA TATTGCTGCT GAGGTAGCTG CTGAAAATGA AAATTGTGGT GGTCCTGGTA 1620 

CGTTTTCACC ATTGTTGCTT GATACGTTAT ATCATTTAAA TGAAACAACC TATC 167 4 
(2) INFORMATION FOR SEQ ID NO: 210: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

ATGAGTTGCC GATGAATTTA GCACCACCAA CGATTGCnTT TGATACTGTG TCCCAACCAG 60 

CTTGTTTAGC ATATTTAATA CCTTCACGTA AAGC&.TCGTT ATCATATGCA GCAATACCAA 12 0 

ATACGTTATG GTATTTCGTG TTTGAGTTAG TTACAACTTT GTnTTGCACT ACATCTGCAC 18 0 

CTTTCGCTAA TTGAGAAGTA CCGTTACCTG TTTCTAATAG GGCATGTGAG ATAAGATAAA 24 0 

CTTCATTAAT GCCATACATT TGAGCAGCTT TGTTAAATGC AGCACCTTGG TTTTCTAATA 300 

CACCTTTACC TTTTAAGAAT TGATTAATTT TATCAATAGA AATATTTTGT GGTTGGTCTA 3 60 

AGCGTAAGAA TTGATATTTT AATGCTGGAT CTTGAGCTAA ACGCTTCGTA TCCATTGCAT 420 

GCTTAACATC ATTAAATTTA GCATCTGTCC ACTTACCTGG TACACGTTGT ACTTGTGGTT 4 80 

TATATTGTAA ACCAGCTTGT ATTTGAGCAA CTTGGTTTAA TGTCATACCT GTTTGATTAT 54 0 

ACTTAATTAA TTCTTTAGCT AAATCAGTTG ATTTAATCCA TGCtAATTTA CCGTTAGATA 600 

ATTTACCATA GTACCAAGTT TGTCCATTAA TGACTTGTTC TTTAACAACT GCGAATGGTT 660 

GTTCATTAAA TGCTTTTAAT GAGTATTTAG CTGTATCAGA SITTGGTGTT ACATAGTAAT 720 

AACCATTACC ATTTTTAATT ACATAAGTGT AGTTATAATC TTTGGcAGCT GATGTAGTTG 780 

GTTTCACAGC AGTTGGTGCA GTTAAATCTT TTGCATTTAC CCAACCAGTG CGGTTATTAA 840 

TAGTACCGTA TAAATAAACA TCTTTGCCTA CAGATACTTG TTTCGTTGCA TTAAATGTAC 90D 

CTTGAGCAAT GTTATTGCCT GTTAAAATGA CTTGGTTTTT AGTACCCCAA GGAACCATTG 960 

ATAAGCCGTT ATTTGATTTA TTAACAGTAT ATTTTTGAGT CGTTTTAACT TCTTTGCCTA 102 0 

AGTTTTGAAC ATTTAAGTCT TTTACATTGA ACCAACCTAA TGGGATGTTA TGGCTTGTAT 108 0 

TGTTTAATAA TACATACGTT TCATTACCAT GAGCACGCTC TTTTGTTACA TAGAACGTAC 114 0 

GGTCTGCATA TTTCGCACCG TTTTTCGCTG TTTTTTCATA AACAGAAGCA CGAATACCAG 1200 

TGTTGTTTGG TTTAACTTGA GCAATCTTGC TAACTGTTTG AGTCGTTTGT GGTTTAGTAA 1260 

CAGTATAAGC TTTTACAGCT GTTTTTGGTT GTGCTACTGC TTTTTTAGE GCAGCAGGTA 1320 

CAGCTAAATA TGCTTTACTT ACCCAACCAG ATTTACCATT TACAGTTCCA AATAAATAGA 138 0 

TAGATTTATC AATTTGTTGT TGCTTAGTCG CTTTAAAAGT TTGGTTACCT GTACCAGAAA 14 4 0 

CTGCACCAGC TTCTTGTTTA TAAGTGCCCC AAGGTACTGA ATATAATTTA GTGCCTGGgT 1500 

TTACTGTATA TGTTTGCATT AC AT T T AC AG GTGATTTTGC ATtGtTATAA ATACGTCACC 15 60 
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TTGTTTAACC CAACCAATTA AAGTTGGACT ATTGTAATCT TTAACTAAGT AGAATTTGTT 1620 

TCCACCTAAA CTTGCTTCTT TTGTTACAGC AAATGTTTTT TGAACTTCTT TCGTTGGCTT 168 0 

ACCAGTTTTG TCATAAACTG TAGTGAATAA GCCATTGTTT TTAGCATTAA TTTGAGCAAC 174 0 

ACCGTTTAAT GATGAAACTG TTAATTTATT ATTTGTTGTA GGTGTTGATG GCTTAGGTGT 1800 

TGGTGTAGGC GTAGGTTTAG CAGTATCAAC TAAATATGCT TTACTTACCC AACCAGATTT 1860 

ACCATTCACA GAGCCATATA AATAAATTGA TTTATCAATT TGTTGTTGCT TTGAAGCTT 1920 

AAATGTTTGG TTTCCAGAGC CAGACACACT ACCAGCAACT TGTTTAGATG TACCCCAAGG 1980 

TACTGTATAA AGTTTCGTAC CAGGTTTGAT TGAATATGAT T GAT T T AC AT TTACAGGTGA 20 4 0 

TTTAGCTGTG TTGTAAACCA CATCGCCTTC TTTAACCCAA CCAAATTTAT TACCAGAATT 2100 

GTAATCTTGA ACAAGATAGA ATTTTTGATT ACCTAATGTA GCTGTTTTAG ATACAGCAAA 2160 

TGTTTTTTGA ACTTCATTAG TTGCTTTACC AGTTTTGTCG TATACAGTAG TATATAAACC 2220 

ACTATTTGTT GG 2232 
(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

GATTTAAATA AAATTAATGG ATATCGTGAT CGACGATGT TAGAACTTCT GTACGCAACG 60 

GGAATGCGTG TATCTGAATT GATACATTTA GAG T TAG AAA ACGTGAACTT AATAATGGGA 12 0 

TTTGTACGCG TATTTGGTAA AGGCGATAAA GAAAGAATTG TACCATTAGG CGACGCAGTC 180 

ATTGAGTACT TAACTACTTA TATTGAAACG ATTAGACCGC AACTTTTAAA AAAGACTGT 24 0 

ACTGAAGTCT TATTTTTAAA TATGCATGGT AAACCTTTAT CACGACAAGC AATATGGAAA 300 

ATGATTAAAC AAAATGGTGT AAAGGCAAAC ATTAAAAAGA CGTTAACGCC ACATACGTTA 3 60 

CGCCACTCTT TTGCGACACA TTTATTGGAA AATGGCGCAG ATTTAAGAGC AGTGCAAGAG 4 20 

ATGTtAGGtC ACTCTGaCmT ATCTACTACC CmaCTCTATA CmCATGTTTC GrAATCTCAA 4 80 

AT T AG AAAAA TGTATAACCA ATTTCATCCT AGAGCATAAA GTGAACAATA ACTCAAAAGT 54 0 

CACAATACAC ATGACTAAAA ATGTCTGTGC TATTGTGGCT TTTTTAAATT GGTTGATTAA 600 

TTACGTCTAT GTTTTCTTAA TTGAATCGCT TCTTCTTTTGCTGCAATCAC TTCTGAACGA 660 

TCACGGCGCA TGTGATGGTC TACAATAAAA GGATCTGTTG CTGTTTCCTG ATTATAATCA 720 
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TAGTCTGGAT AGTTGGCCTT GATGATGCGT TCAAAGACTG GAGTTATTGG TAATATAACA 78 0 

GAT G AAAAAG GCTTTGCTGC ATTCAATTTT GCAATCTGTT GCTCAATTAA CAACTGATAA 430 

T CAT T T AAAT TAAGGTATAA CGCATCTCTA TCTTTAGCAT TTTGTATTAT TTCTTTAGAT 90 0 

TTATTAAAAG ACTTATAGGC GCCTTTTAAA TTATTGCGGC GATAATGGTA AC AAGCAGT T 9 60 

GCAAACAAGA TTAAACTAAC AACTGCATCT TGCTTACTGT AGTTATTTTC AGCTTTCCAT 102 0 

GCATCTTCTA AAATGTCATG ACATAGGAAA TAATGTTGCT TAGTATGAAA TTGATAATAG 108 0 

AAATTTATCA GTGCCTGTTG CATTTTGTTA TCACCCCAAT TTAAAAGTAA GTTATTTTCA 114 0 

TGCTATAATA T T T T AG AG AA TTATGCACAT ATGACGCAAT ACGAGGTAGA TATTATGTAT 1200 

GAAGTTAAAT TAGATGCTTT CAATGGACCA TTAGATTTAT TGCTGCATT TATCCAAAAA 12 60 

TTTGAAATAG ATATTTATGA TATTCCTATG CAAGCATTAA C AG AGC AG T A TATGCAGTAC 132 0 

GTTCATGCAA TGAAACAGCT TGAAATTAAT ATTGCAAGTG AATACCTAGT ATTAGCGTCA 138 0 

GAACTCTTAA TGATTAAAAG TAAGATGCTA TTACCACAAT CAACATCAGA TATGGATGTT 14 4 0 

GAT GAT G AC C CACGGGAAGA TTTAGTtGGG CGTTTAATAG rATATCaAAA T T AT Ar AG AA 1500 

TATACTGCtA TTTTAAATGA CATGAAAGAA G AAAG AG AT T TTTATTTTAC C AAAAAG AC C 15 60 

GACAGATTTA TCtCATTTGG AAAc AG AT G A ATCyTGGGAT CCaAATCATA CGATTGATTT 162 0 

AACTGAATTA ATTGTAGCTT ATCAAAGAGT TAAAAATAGA GTTGAGTTAA ATACACCTAA 1680 

ATCTGTTGAA ATCCGAAAAG AGAC AT T T AC CAT T C AAC AA GCTACAGAAC AAGTGACATC 174 0 

GAGATTGAAA GAT AAAG AT C ATTTTAACTT CTTTAGTCTG TTTACGTTTT CTGAGCCAAT 1800 

TGAACAAGTA GTCACTCACT TTTTAGCTAT TTTAGAGATG TCAAAAGCAG GAATATTAA 18 60 

TATTGAGCAA CAACGTAATT TTGAAGATAT TAACATTATT AG AG G AG T G A ACTACCATTT 1920 

TGGATAATCA TGGTATATTA GAGTCGCTTT TAT TT AC AGC TGGCGATGAA GGTTTAGATG 198 0 

AAAAACAACT ATTAGAAATA TTAGATATGT C G AAAG AC C A ACTCGTTGAA TTAATTGAAA 204 0 

ATTATTCATC A3ATGGATTA AT GAT AC AAC GATTTGGAAT GA 2 082 
(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

TCTATTCTCG TTCTTCCAAG ACCCTGaATT AGAAGTTAAG AAAATCGAAG AAGATGAGAA 60 

AGAATCTATT AAAAAAGCTC AAAAAG G TAT T TAT AAAG AC CCTAGAGACA TCAATGATGA 120 
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CGAACAAGAT GATGATACAA AAGATACTGT TtATAAAAAG GAATGATTGT AATTGCCTAA 180 

CAAAAACACT CAAGAATATT GGGAAGAACG CGGACGCAAA GCAATCGAGA ATGAGTTGAA 24 0 

GCGTGATAAA ACTAAAGCTG AAGAAATAGA ACGTATATTG AATATGATGA TTAAGCGCAT 300 

T G AAAAAG AG ATCaATGCGT TTATTGTCAA GTACGGAGAT TTTGCAGGCG TTACATTSA 360 

AGAAGCACAA AAGATTATTG ATGAGTTCGA TGTAAAAGCG TTTCAAGAAG AAGC AAAAAG 420 

ATTGGTCGAA AACAAGGAGT TTAGCGATAG AGCAAATGAA GAATTAAAGA AGTATAACAC 4 80 

GAAAATGTAT GTATCTAGAG AACAGATGTT AAAGATTCAA ATAGAATTCT TAATTGCTTA 54 0 

TGCAACAGCT CAAACAGAAT TATCGATGAG GGAATATTTC GAATCAACAG CTTATCGTGT 600 

GTTCAGTGAT CAAGCGGGTA TTTTAGGTGA AGGTGTACAA GTAGCTAAAG AAGTTATAGA 660 

TACAATCGTT GATACACAAT TTCATGGTGT CGTTTGGTCA GAGCGATTAT GGACTAATAC 720 

CGAAGCAATG AAACAAGAAG TAGAAGAAAT AATTGCTAAT GTAGTTATTA GAGGTCGACA 780 

TCCTAATGAA TATGTTAAAG ATATGCGCAA cACTTAAATA AATTCGAAGG CACAGCACGA 840 

CAAAAGACCG CAGCAATTAA ATCATTGCTT TATACGGAAT CGGCACGTGT TCACGCACAA 900 

TCAAGCATTG ACAGCATGAA AGAAATTTCA CCGGAAGgAT ATTATATGTA TATTGCAAAA 960 

ATCGATAATA GAACAACTAA AGTATGCAAA GGGCTTAATG GAGAAATATT CAAAGTTAAA 102 0 

GACGC TAAAA TTGGTGTTAA TTTCTATCCT ATGCATATCA ATTGTCGTTC AGATTGCGCT 108 0 

TTACTACCTA AATCTATGTG GCCGAAAAAA CCAAGCAAGA AACGAAAAAC AAAATACTTC 114 0 

GGAGGGAAAG TGAAAAGCGG TGATTGATTT AAAAGTGAAG TTTTTTAAAG GCAAGTTAGT 12 00 

TTTGTATGAC AGTAAATTAA ATGTTTGGAG GATACTAATA TGAGTAATAC T G AC AAAT AC 12 60 

C T TAG AG AC A TAGCAAGAGA .ATTAAAAGGT ATACGTAAAG AGTTACAAAA GCGAAACGAA 1320 

ACAGTTATTA T T GAT GC AAA CTTAGACAGT TTAAGGTCGG CAGTATXGC CGATAAAGAA 138 0 

AAATCGAAAT ATAATGAACC TCTCTTTTAA TAGCTAGCAC TTAATTGTGT TGGCTATTTT 14 4 0 

TTATGTCCAA AACGTGCTGA TGACATAAAA AGCACGCATG G AAAAAC AG T CGACAGACTA 1500 

TAAATGGAGG TATATCTCAT GGAAGAAAAT AAACTTAAGT TTAATTTGCA aTTTTTTGCA 15 60 

GACCAATCAG ATGATCCGGA CGAACCAGGC GGAGATGGTA AAAAAGGAAA TCCTGATAAG 1620 

AAAGAAAATG ACGAAGGTAC TGAAATAACT TTCACGCCAG AGCAACAAAA GAAAGTTGAT 1680 

G AAAT AC T T G AACGTCGTGT AGCCCACGAA AAGAAAAAAG CTGATGAGTA TGCAAAAGAA 17 4 0 

AAAGCAGCAG AAGCTGCTAA AGAAGCTGCT AAAT TAG C G A AAATGAACAA GGATCAAAAA 18 00 

GATGAATATG AACGCGAACA AATGGAAAAA GAACTGGAAC AATTACGTTC AGAAAAACAA 18 60 

T T AAAC G AAA TGCGTTCAGA AGCACGAAAA ATGTTGAGTG AAGCGGaAGT T GAT T CATC A 192 0 
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GATGrGGTTG TCAATTTAGT T G T AAC AG AT ACTGCTGAAC AAACTAAATT GAATTTGAA 1980 

GCTTTTTCTA ATGCAGTAAA AAAAGCGGTT AATGAAGCGG TTAAGGTTAA CGCTAGACAA 2 04 0 

TCGCCATTGA CTGGTGGAGA TTCATTTAAT CACTCGACTA AAAATAAACC GCAAAACTTA 2100 

GCTGAAATAG C T AG AC AAAA AaGAATTATT AAAAATTAAC GGAGGCATTT AAATGGAACA 2160 

AACACAAAAA TTAAAATTAA ATTTGCAACA TTTTGCAAGT AACAATGTTA AACCACAAGT 222 0 

ATTTAACCCT GACAATGTAA TGATGCATGA AAAGAAAGAT GGCACGTTGT TAAACGACTT 22 8 0 

TACAACACCT ATCTTACAAG AGGTTATGGA AAACTCTAAA ATCATGCAAT TAGGTAAGTA 2 34 0 

CGAACCAATG GAAGGTACTG AG AAGAAG T T TACTTTTGG GCTGATAAAC CAGGTGCTTA 24 00 

CTGGGTAGGT GAAGGTCAAA AAATCGAAAC GTCTAAGGCT ACTTGGGTTA ATGCTACAAT 2 4 60 

GAGAGCGTTT AAATTAGGGG TTATCTTACC AGT AAC AAAA GAATTCTTGA ATTACACTTA 2520 

TTCACAATTC TTTGAAGAAA TGAAACCTAT GATTGCTGAA GCTTTCTATA AAAAGTTTGA 2580 

CGAGGCAGGT ATTTTGAATC AAGGTAACAA TCCGTTCGGT AAATCAATTG CACAATCAAT 2 64 0 

TGAAAAAACT AATAAGGTTA TTAAAGGTGA CTTCACACAA GATAACATTA TTGATTTAGA 2 7 00 

GGCATTGCTT GAAGATGACG AATTAGAAGC AAATGCATTT ATCTCAAAAA CACAAAACAG 2 7 60 

AAGCTTGTTA CGTAAAATTG TAGATCCTGA AACGAAAGAA CGTATTTATG ACCGTAACAG 2 82 0 

TGATTCGTTA GACGGTCTAC CTGTGGTTAA CCTTAAATCA AGCAACTTAA AACGTGGTGA 28 80 

ATTAATCACT GGTGACTTCG ACAAATTGAT TTATGGTATC CCTCAATTAA TCGAATACAA 2 94 0 

AATCGATGAA ACTGCACAAT TATCTACAGT TAAAAACGAA GAEGCACAC CTGTAAACTT 3000 

GTTTGAACAA GACATGGTGG CATTACGTGC AACTATGCAT GTAGCATTGC ATATTGCTGA 3060 

TGATAAAGCG TTTGCTAAGT TAGTTCCTGC T G AC AAAAGA AC AG AT T C AG TTCCAGGAGA 3120 

AGTTTAATAA ATAATTAGGA GTGGTAACAT GCCCGAAATC ATTGGAATTG TTAAAGTAGA 318 0 

T T T T AC AG AT TTAGAAGATA AC AG AC AT G T CTATATGAAA GGGCATGTCT ACCCTCGTAA 32 4 0 

AGGTTATAAT CCTACAGATG AACGTATCAA AGCTTTAGCT AGTGTTGAAA ATAAACGCAA 3300 

CAAACAAATG ATTTACATTG TAAATGACAA ATTAACCAAA AAAGAACTTG TCGAAATAGC 3360 

AAGTGTTGCT GGCTTACAAG TTGA1SAAAA ACAAACAAAA GCTGAAATTA TCAATGCTTT 3420 

TGAGTCACTA GAGTAGGTGG TTATATGACT ACGCTAGCTG ATGTAAAAAA ACGTATTGGT 3480 

CTTAAAGATG AAAAGCAAGA TGAACAATTA GAAGAAATCA TAAAAAGTTG T G AAAGC C AG 354 0 

TTGTTATCAA TGTTACCTAT TGAAGTTGAA CAAATACCGG fAAAGgTTTAGTTACATGATT 3600 

AAAGAAGTTG CAGTTAAACG CTACAACAGG ATTGGTGCTG AAGtATGACA TCAGAAGCGG 36 60 

TTGACGGACG TAGCAATGCG TATGAATTGA AC GAT T t C AA GGAGTATGAA GCTATTATTG 3720 

ATAATTACTT TAATGCTAGA AC GAG AAC T A AAAAAGGAAG GGCTGTGTTC T T T T GAG AT A 37 8 0 
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TGAAGATAGA GTTATTTTTC AATTAGAACA AGTAGCAACT TACAATCCTA AAACTAGCAA 384 0 

AAAAGAAAAC AC AC T AAT C A CTTATGATGC GATACCATGC AATATTAACC CCATTTCTAG 3 90 0 

AGCAAGAAAG CAACTTGAAT TTGGTGATGT AAAAAACGAT GTAAGTGTTC TGAGGATAAA 3 960 

AGAATCAATA TCTTACCCTG TTAGCCACGT GTTGGTTAAT GGCATTCGCT ACAAGATAGT 4 02 0 

TGATACAAGG ATATACAGAC ACGAAACGTC ATATTATATC GAAGAGGTCA ATT GAT GAAT 4 08 0 

ATAGATGGAT TAGACGCACT GTTAAACCAA TTTCACGATA TGAAAACCAA CATTGATGAT 414 0 

GATGTAGATG ATATTTTACA GGAAAACGCC AAAGAATATG TAG T AC GAGC TAAATTG&A 4 2 00 

GCTAGAGAAG TAATGAATA 4 219 
(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

GCTTACAAGT ATATTCATAA TTACATATTC AAGGTCCTTG CATGTGGTAT TTTGCTATGG 60 

yCtTTaACTA CAACGGGGTC TAAGACTGCG TTTATCATAT TAATCGTCTT AGCCATTtAT 120 

TyCTTTATkA AAAAGTTATT TAGTAGAAAT GCGGTAAGTG TTGTGAGTAT GTCAGTGATT 180 

ATGCTGATAT TACTTTGTTT TACCTTTTAT AATATCAACT ACTATTTATT CCAATTAAGC 240 

GACCTTGATG CCTTACCGTC ATTAGATCGA ATGGCGTCTA TTTTTGAAGA GGGCTTTGCA 300 

TCATTAAATG ATAGTGGGTC TGAGCGAAGT GTTGATGGA TAAATGCCAT TTCAGTAATT 3 60 

AAATATACAC TAGGTTTTGG TGTCGGATTA GTGGATTATG TACATATTGG CTCGCAAATT 4 20 

AATGGTATTT TACTTGTTGC CCATAATACA TATTTGCAGA TCTTTGCGGA ATGGGGCATT 4 80 

TTATTCGGTG CATTATTTAT CATATTTATG CTTTATTTAC TGTTTGAATT ATT TAG AT T T 54 0 

AACATTTCTG GGAAAAATGT AACAGCAATT GTTGTAATGT TGACGATGCT GATTTACTTT 600 

TTAACAGTAT CATTTAATAA CTCAAGATAT GTCGCTTTTA TTTTAGGAAT TATCGTCTTT 660 

ATTGTTCAAT ATGAAAAGAT GGAAAGGGAT CGTAATGAAG AGTGATTCAC TAAAAGAAAA 720 

TATTATTTAT CAAGGGCTAT ACCAATTGAT TAGAACGATG ACACCACTGA TTACAATACC 7 80 

CATTATTTCA CGTGCATTTG GTCCCAGTGG TGTGGGTATT GTTTCATTTT C T T T C AAT AT 84 0 

CGTGCAATAC TTTTTGATGA TTGCAAGTGT TGGCGTTCAG TTATATTTTA ATAGAGTTAT 90 0 

CGCGAAGTCC GTTAACGACA AACGGCAATT GTCACAGCAG TTTGGGATA TCTTTGTCAG 9 60 
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TAAATTATTT TTAGCGTTAA CAGTTTTTGC GATGTATATG GTCGTAATTA CTATATTTAT 1020 

TGATGATTAC TATCTTATTT TCCTACTACA AGGAATCTAT ATTATAGGTG CAGCACTCGA 108 0 

TATTTCATGG TTTTATGCTG GAACTGAAAA GTTTAAAATT CCTAGCCTCA GTAATATTGT 114 0 

TGCGTCTGGT ATTGTATTAA GTGTAGTTGT TATTTTTGTC AAAGATCAAT CAGATTTATC 1200 

ATTGTATGTA TTTACTATTG CTATTGTGAC GGTATTAAAC CAATTACCTT TGTTTATCTA 12 60 

TTTAAAACGA TACATTAGCT TTGTTTCGGT TAATTGGATA CACGTCTGGC AATTGTTTCG 1320 

TTCGTCATTt AGCATACTTA TTACEAAATG GACAGCTCAA CTTATATACT AGTATTTCTT 138 0 

GCGTTGTTCT TGGTTTAGTA GGTACATACC AACAAGTTGG TATCTTTTCT AACGCATTTA 14 4 0 

ATATTTTAAC GGTCGCAATC ATAATGATTA ATACATTTGA TCTTGTAATG ATTCCGCGTA 1500 

TTACCAAAAT GTCTATCCAG CAATCACATA GTTTAACTAA AACGTTAGCTAATAATATGA 15 60 

ATATTCAATT GATATTAaCA ATACCTATGG TCTTTgGTTT AATTGCaATT ATGCCATCAT 1620 

TTTATTTATG GTTCtTTGGT GAGGAATTCG CATCAACTGT CCCATTGATG ACCATTTTAG 168 0 

CGATACTTGT ATTAATCATT CCTTTAAATA tGTTGaTAAg CaGGCAATAT TTAtTAAtAG 174 0 

TGAATAAAAT AAG AT TAT AT AATGCGTCAA TTACTATTGG TGCAGTGATA AACCTAGTAT 1800 

TATGTAyTAT TTTGATATAT TTTTATGGAA TTTACGGTGC TGCTATTGCG CGTTTAATTA 18 60 

CAGAGTTTTT CTTGCTCATT TGGCGATTTA TTGATATTAC TAAAATCAAT GTGAAGTTGA 1920 

AT AT T GT AAG TACGATTCAA TGTGTCATTG (TGCTGTTAT GATGTTTATT GTGCTTGGTG 1980 

TGGTCAATCA TTATTTGCC 19 99 



(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

TCATTATTAA GAC TAT TATA TATAATGAAT TTTAACTGGT TTATTAAACG AGAACGTCGG 60 

GAATTAAGTA ACTACAATAA AAATAAGATA TGACAATAAG GAGACTACAC GGTGATCAT 12 0 

TGCCATAATT ATATTGATAT TTATTTCGTT TTTCTTTTCA GGAAGCGAGA CGGCATTAAC 180 

GGCTGCCAAT AAAACAAAAT TTAAAACTGA AG C T GAC AAA GGTGATAAAA AAGCAAAAGG 240 

CATTGTAAAG TTACTTGAAA AACCAAGTGA GTTTATTACA ACGATTCTAA TTGGGAATAA 300 

TGTCGCGAAT ATTTTATTAC CAACACTTGT TACAATTATG GCTTTACGTT GGGGGATTAG 3 60 

CGTTGGTATT GCATCAGCTG TTTTAACAGT TGTTATCATT TTGATCTCCG AAGTGATTCC 420 
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CAAGTCTGTC GCTGCAACAT T T C C AG AT AA AATAACAAGG CTTGTATATC CAATTATTAA 4 80 

TATTTGTGTC ATTGTGTTCC GTCCTATCAC ATT&TTTTA AATAAGTTGA CGGACAGTAT 54 0 

TAATCGAAGT TTATCTAAGG GCCAACCTCA AGAAC AT C AA TTTTCAAAAG AAGAATTTAA 600 

AACAATGTTA GCAATTGCTG GACATGAAGG TGCTTTAAAT GAAATTGAGA CGAGTAGGTT 6 60 

GGAAGGTGTC ATTAATTTTG AAAATTTAAA AG T AAAAG AT GTAGATACAA CACCTAGAAT 720 

TAATGTGACG GCATTTGCTT CAAATGCGaC AT AC G AAGAA GTTTATGAAA CGGTTATGAA 780 

TAAGCCATAC ACTAGATATC CAGTGTACGA GGGAGATATT GATAACATTA TTGGGGTGTT 840 

TCATTCTAAA TATCTGTTGG C T T GG AG T AA TAAAAAAGAA AATCAAATTA CAAACTATTC 900 

AGCTAAGCCA TTATTTGTGA ATGAACACAA TAAAGCTGAA TGGGTATTAC GTAAGATGAC 9 60 

TATTTCTAGA AAACATTTAG CAATTGTGTT GGACGAATTT GGTGGTACTG AAGCGATAGT 1020 

G T C AC AT G AA GACTTAATTG AAGAATTATT AGGTATGGAA ATTGAAGATG AG AT G G AT AA 108 0 

AAAGGAAAAA GAAAAACTTT CTCAACAGCA AATTCAATTT &ACAACGGA AAAATCGCAA 114 0 

CGTATCTATA TAAGGAGCGA ACAGCTATGT GGAATAAGAA TCGACTTACT CAAATGTTAA 12 00 

GTATTGAATA TCCAATTATA CAAGCAGGTA TGGCAGGAAG TACGACACCG AAATTAGTTG 12 60 

CATCAGTAAG TAACAGTGGT GGGTTAGGCA CAATAGGCGC AGGTTACTTT AATACGCAGC 132 

AATTGGAAGA TGAAATAGAT TATGTACGCC AATTAACGTC AAATTCTTTT GGCGTAAATG 138 0 

TCTTTGTACC AAGTCAACAA TCATATACCA GTAGTCAAAT TGAAAATATG AATGCATGGT 14 4 0 

TAAAACCTTA TCGACGCGCA TTACATTTAG AAGAGCCGGT TGTAAAAATT ACCGAAGAAC 1500 

AACAATTTAA GTGTCATATT GATACGATAA TTAAAAAGCA AGTGCCTGTA TGTTGTTTTA 15 60 

CTTTTGGAAT TCCAAGCGAA C AGAT T AT AA GCAGGTTGAA AGCAGCGAAT GTCAAACTTA 162 0 

TAGGTACAGC AACAAGTGTT GATGAAGCTA TTGCGAATGA AAAAGCGGGT ATGGATGCTA 1680 

TCGTTGCTCA AGG TAG T G AA GCAGGTGGAC ATCGTGGTTC ATTTTTAAA CCTAAAAATC 1740 

AATTACCTAT GGTTGGAACA ATATCTTTAG TGCCACAAAT TGTAGATGTC GTTTCAATTC 1800 

CGGTCATTGC CGCTGGTGGA ATTATGGATG GTAGAGGAGT TTTGGCAAGT ATTGTCTTAG 18 60 

GTGCAGAAGG GGTACAAATG GGCACCGCAT TTTTAACATC AC AAG A C AG T AATGCATCAG 1920 

AACTACTGCG AGATGCAATT ATAAATAGTA AAGAAACAGA TACAGTCATT AC AAAAG C G T 1980 

TTAGTGGAAA GCTTGCACGC GGTATCAACA ATAGGTTTAT CGAAGAAATG TCCCAATACG 204 0 

AAGGCGATAT CCCAGATTAT C C AAT AC AAA ATGAGCTAAC AAGTAGCATA AGAAAAGCCG 2100 

CAGCAAACAT CGGCGACAAA GAGTTAATAC ATATGTGGAG TGGACAAAGC CCGCGACTAG 2160 

CAACAACGCA TCCCGCCAAC ACCATCATGT CC AAT AT AAT CAATCAAATT AATCAAATCA 2220 
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TGCAATATAA ATAATCGACC GCAATCCACA AAAGCACAAG CACCCCCAAA CATTATTTTA 22 80 

GTGCTTGCCA TTTTTGTGGA TTGCGTTTCT ATTTTACCAA TTTAATCAAA CGAAA&A.TC 2340 

AAGCTGAAGA TCGCCGAAAG ATTTTAATCA AGCAAAAACA TCAAACTAAA GTTCGCTGAA 2 4 00 

AT GAT TAT GA T AAAAG f TAT ATGGTATGAT GACATTGGTG AT AT AT AT G A TAAACATCGG 24 60 

ATTAACAGGT TGGGGTGATC ACTATTCATT ATATGAAGAT TTAGAACGCC AAACCGATAA 252 0 

ACTTAAAACA TATGCTGGAC ATTTTCCGGT TGTCGAATTA GATGCGACAT ACTATGCGAT 2 58 0 

ACAACCGGAA AGAAATATAT TGAAATGGAT AAAAGAAACG CCTGATACAT TTGAATTTGT 2 64 0 

GGTCAAAATT CATCAAGCAC TcACATTGCA TGCAGACTAC AAAACATTTG CAGATACAAG 27 00 

GCAAGAACTA TTTGATCAAT TTAAGAATAT GTTAGAGCC T T AC AT AC AC AGAAAAAATT 27 60 

AGCAATGGTA TTGGTTCAAT TTCCGCCATG GTTTGACTGC AATGCACAAA ATATCAAATA 2820 

TATTTTGTAT GTAAGACAGC AATTACAAGC ATTTCCAATG TGTGTAGAAT TTAGGCATCA 2880 

ATCATGGTTT AGTGATGCAT TTAAAGAACA AACATTGGCA TTTTTAACAG AACATCAAAT 2 94 0 

CATTCATGCA GTAGTTGATG AACCACAAGT GAAAGATGGC AGTGTACCTT TAGTCAATCG 3000 

AATCACAAAT GAAATTGCGT TTGTACGTTA TCATGGACGT AATCATTACG GTTGGACTAA 30 60 

GAAAGATATG TCAGATCAAG AATGGCGCGA TGTACGCTAT TTATATGATT ATAATGAGCA 3120 

AGAATTAATA GACTTGGCAC AAAAG G C AC A AATAT T AGCA CAAAAAGCTA AGAAAGTTTA 3180 

CGTCATATTT AACAATAATT CTGGTGGTCA TGCAGCAAAT AATGCCAAAA CATATCAGCG 32 4 0 

ATTATTGAAT ATAGAATATG AAGGGTTAGC ACCACAACAA TTAAAATTAT TTTAAGAGGC 3300 

GACGACTATG TTATTAACAA TTACATTATT AGTTTTAATC GGAGCBTTGT CAGCGATTAT 3360 

AGGGTCTATC GTAGGCATTG GAGGCGGTAT TATTATCGTT CCAACAATGG TTTACCTCGG 34 20 

TGTTGAACAT GGATTACTAC ATAATATTAC AACACAAGTA GCGATAGGGA CGTCTTCAGT 34 8 0 

CATTCTAATT GT G AC AG G AC TTTCTTCATC ACTTGGATAT TTAAAAACAA AACAAGTTGA 354 0 

TAT T AAAAAT GGTTCCATCT TTTTATTTGG ACTATTACCA GGTTCATTGC TTGGGTCCTT 3600 

CATTAGTAGA TATTTAACAT TTGAGTCATT TAATTTATAT TTTGGTATCT TTTTAATTTT 3 6 60 

CGTAGCCATT TTATTAATGG TAAGAAATAA GATTAAACCG TTTAAAATTT TCGATAAACC 3720 

CAAGTATGAA AAGACTTATG TAGACGCTAA AGGTAAAACA TATCATTATA gTGTTCCACC 37 8 0 

ATTGTTTGCT TTTATTACAA CGTTTTTAAT TGGTATATTG ACAGGTTTAT TTGGTATTGG 38 4 0 

AGGTGGCGCA CTAATGACGC CACTAATGCT TATTGTATTT AGATTTCCAC CTCATGTAGC 3900 

TGTTGGAACA AG TAT GAT GA TGATTTTCTT TTCAAGTGTC ATGAGTTCTA TSGGCACAT 3960 

TGCTCAAGGT CACGTAGCTT GGGGTTATGC AATCATnTTA ATTATTTCTA GTTATTTTGG 4020 

TGCGAAAATC GGTGTCAAAG TGAATCAATC AATTAAGTCA GATACGGTAG TAACATTATT 4 080 
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GAGAACAGTA ATGTTGTTAA TGGGTATATA TTTAATTATT CGTGCGTTGA TTTAATACAA 414 0 

CTTTAAAAGG AGGACGTCAA TTTGAGGCTT ACAATTTATC AT AC G AAC G A TATTCATAGT 4 2 00 

CATTTACATG AATACGAACG CAT T AAAGC A TATATGGCAG AACATCGGCC ACGACTTAAT 4 2 60 

CATCCTTCTT TATATGTTGA TCTAGGTGAT CAT G TAG AT T TATCCGCACC TATAACTGAA 4 320 

GCAACTTTAG GTAAAAAGAA TGTGGCATTA CTA26.TGAAG CAAAATGTGA TGTTGCAACA 4 38 0 

ATCGGTAATA ATGAAGGGAT GACCATTTCA TACGAAGCTT TAAATCACCT TTACGACGAA 4 440 

GCAAAATTTA TAGTGACATG TAGCAATGTT ATAGATGAAT CAGGTCATTT ACCAAATAAT 4 500 

ATCGTTTCTT CTTATATTAA GGACATAGAC GGTGTGAAAA TACTATTCGT TGCAGCGACA 4 5 60 

GCACCTTTTA CCCCATTTTA TCGTGCACTA AATTGGATTG TTACCGATCC ACTTGAATCT 4 62 0 

ATAAAAGAAG AAATTGAACT TCAACGAGGT AAATTTGATG TATTAATCGT GCTAAGTCAT 4 680 

TGTGGCATTT TCTTCGATGA AACATTATGC CAAGAATTGC CTGAAATTGA TGTCATTTTT 4 74 0 

GGTAGTCATA CGCATCATTA TTTTGAACAT GGTGAAATCA ATAATGGTGT ACTGATGGCG 4 8 00 

GCAgCTGGAA AGTATGGTAA TTATCTTGGA GAGGTTAATT TAACTTTTGA GGCACATAAA 4 8 60 

GTAGTACATA AAACTGCAAA GATTATTCCT T T AG AAAC AT TACCTGAAGT TGAAACTTCA 4 92 0 

TTTGAAGAAG AAGGAAAAAC GTTAATGTCC AATTCAGTAA TCAACATCC AGTAGTGCTT 4 980 

AAGCGTAGTA TGAATCACAT AACTGAAGCT GCATACTTAT TAGCTCAAAG TGTTTGTGAG 50 4 0 

TATACACATG CACAATGTGC CATCATCAAT GCTGGCTTAC TCGTTAAAGA TATTGTAAAA 5100 

GATGAAGTGA CAGAATATGA CATTCATCAA ATGTTACCGC ATCCGATTAA TATGGTAAGG 51£B 

GTTAGACTTT TTGGTGTGAA AT T AAAAGAG ATTATAGCTA AAAGTAATAA ACAAGAATAT 5220 

ATGTATGAAC ATGCACAAGG TTTGGGTTTC AGAGGGAATA TATTTGGAGG ATATATTCTT 5280 

TATAATTTAG GGTACATTCA TTCTACAGGG CGTTACTATC TGAATGGAGA AGAAATCGAA 534 0 

GACGACAAAG AATATGTACT AGGTACGATA GATATGTATA CGTTCGGTCG TTATTTCCCA 54 00 

AC AT T G AAAG AATTACCAAA AGAGTATTTA ATGCCAGAGT TTTTAAGAGA T AT ATT T AAA 54 60 

G AAAAAT TAT TGGAATATTA AAAAGTAAGA TTATTGGATT TTCATTTGTC ATGAATTTCG 5520 

ATATAATGTT TAAAGATACA CTTAACAGGA GGGTATGTGT TGTTATGGG ACAAAAAACG 558 0 

AGGAAATATT ACGTAAACCG GATTGGTTGA AAATAAAATT AAATACCAAC G AAAAC TATA 5 64 0 

CAGGACTTAA GAAGATGATG AGGGAAAAAA ATCTTAATAC TGTATGTGAA GAAGCTAAAT 5700 

GTCCTAATAT ACATGAATGT TGGGGTGCAC GTCGTACAGC GACATTTATG ATTTTAGGTG 57 60 

CCGTATGTAC AAGAGCTTGT CGTTTTTGTG CGGTTAAGAC AGGTTTACCT AATGAACTTG 5820 

ATTTAAATGA GCCTGAACGT GTAGCTGAAT CAGTTGAATT AATGAATTTG AAACACGTTG 58 8 0 
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TTATCACTGC TGTTGCGCGT GATGATTTAA GAGATGCTGG TTCAAATGTT TATGCTGAGA 5 94 0 

CAGTACGTAA AGTTAGAGAA AGAAATCCAT TTACAACGAT TGAAATTTTA CCATCAGATA 6000 

TGGGCGGGGA CTATGATGCG TTAGAAACAT TAATGGCGTC AAGACCTGAC ATTTTAAACC 60 60 

ATAATATTGA AACTGTTCGT CGCTTAACAC CGAGAGTTCG TGCGCGTGCG ACTTACGACA 612 0 

G AAC AT TAG A GTTTTTACGT CGTTCAAAAG AATTACAACC GGATATCCCA ACTAAACAA 618 0 

GTATTATGGT TGGATTAGGT GAAACTATAG AAGAAATTTA TGAAACGATG GAT GAT T T AC 62 4 0 

GTGCGAATGA TGTAGATATT TTAACGATTG GTCAATATTT ACAACCTTCA CGTAAACATT 6300 

TAAAGGTTCA AAAATATTAC ACGCCTTTAG AGTTTGGTAA ATTAAGAAAA GTGGCAATGG 6360 

ATAAAGGGTT TAAA.CATTGC CAAGCTGGAC CTTTAGTACG TAGTTCTTAT CATGCGGATG 64 2 0 

AGCAAGTAAA TGAAGCTGCT AAAGAAAAGC AACGCCAAGG TGAGGCACAG TTAAATAGTT 64 8 0 

AATATTTAAC CATTAATAAG GCATAAAGGC TTAGTTTGTA CAAAACGAAC GTGTCATAGA 654 0 

AGTAATCATT ATTTTTATGA AACACTAAAA GTATAAACTG AGCCTTTCAT TGTCATAGAT 6600 

AGGTGAAGAA TTTGATAAAA GTAGATCAAC ATTACTTTGA ATTAATAGAA AATTATCGCG 6660 

AATGTTTTAA TGAAGAACAA TTTATTGCTA GGTATTCAGA TATTTTAGAT AAATATGATT 6720 

ACATAGTTGG TGACTATGGT TACGATCAAT T ACGAT T AAA AGGTTTTTAC AAAGATTCTA 6780 

ATAAAAAAGC AGAGAT GAG T AAACGTTTTT CAAATATTCA AGATTACATA TTTGAATATT 68 4 0 

GTAACTTTGG TTGTCCTTAC TTTGTATTAA GACATTTGTC TAAACAAGAG GTTAAAAAGT 6900 

TAATCGAAGA AGTTCATCCG TCTGATGTGA TAGATGACGA CAATAAACTT CAAGATGTGA 69 60 

AGATTAAGCC AACCATTCAA GATACTGAAC ATTAATAAAA CCCTTAGCTA GATTGAAAAT 7 020 

GGGAATCATG CAATTCAAGC ATGGACCTGT AATCTAGTTA GGGGTTTTTA TCTTTAATGA 7080 

AT GAC T T CAT TTAAATACTC AGTAATTTCA TCGCCTTCTT CAGCATTTAC ACCTAAAATA 714 0 

TGAGCGATAT AGCCTTCTTC TTTTAAATCA TCAGTACCGA TAATECGAA TTTATTTGTT 7200 

TGCATATTAA GTACGAGTGT CTTACCATAA TGTCTATTTG TATGGACTAA CAT C AAAT C A 7 2 60 

TATCGACTAT GCTCGCCAAC AAAACCAACA AACTGAACTT GACTCTCTTC GTTGTCATCA 7320 

TATAAATACA TATCAATCAT TTTGTAGCGA CTCCTTTTAA AAGTAGTAAA GTTAGTATAA 7 380 

CGACAAATGA AGTATACTGC AAAAT TAT G A TAATATATAA GTGAGAGGTG ACAAGGAATG 74 4 0 

TATTTTGTAG ACAAAGATAA ACTAACTCAG AAATTAGCCT ATTTACAAGC ATTAACTGAT 7 500 

GATTATCATG AGAGCAAGCA CAATCATTAT GCATTTGAAC GCATTGCTCA AATGTTGATA 7 5 60 

GAATCATCGG TAGATATAGG GAATATGVTT ATCGATGCAT TTATTTTAAG GGATCCTGGT 7 620 

AATTATAAAG ATGTGATTGA TATATTAGAA CTAGAAAATG T TAT T AC T AA AGAAACACAG 7 68 0 

CAGGCGATTA ATAAAACTGT CGGTATTCGT AAACAATTTA CATATGATTA CACAGCCTTA 77 4 0 
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GATGTTGAGA TTATCATGCC AATGTTTGA 
(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

ACCGCCACCC ATTAATGATT GCTTAAAATC AATAGTCGTA CCATTTAATA CGGGTGCATC 60 

TTTTTTGTCT ACTAATACTT TTAATCCAAA GTATTCTAAG ACTTCATCAT TTTCACCAGG 120 

CGCTTCTTCT GCACCCATAC CGTATGTTAA ACCAGTGCAC CCGCCACCAT TCACTTTAAT 18 0 

TTTTAAATAG CCATCTGGCA TTTCATTTGC TTTAAGCATA TCTTTTACTT CGTAAGCAGC 24 0 

TGCTTCTGTT AATATAACTG TTGGCATGAT AACTCCTCCT TAAAAAATCC AAGTTTCTTT 300 

TATATGTGCA TAT AT AT T T T GTAATAATTC TTCCGGCGAA TCACCTTCAA CAATATCACC 3 60 

ATTTACTAAA GCATACAACC CGGCTGAACA TATACCACAA TGTGTCAGGC AACCATACTC 420 

TAACACATCG ACATCTGGGT CATTTTCCAG TTGATTAAAA ACATAATCTC CACCTTTTGC 4 80 

CATGTTAGAG AGACAAAATT CTACGATCGG ATTCATACTT CACCTTCTTA TTTCATTTGT 54 0 

TACAATATTA TAGCATTTTA AAACTGGTAT TTTAACATGA TGTGCTCAAT TAGCAEAAC 600 

TGATGTTTCT TATCCCAGTT ATGTAATAGT GCCTTAGTTA GTAC 64 4 
(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

GAATGATGAA AGGAATAGAA AAGAAAAGAT AAATAATGTA ATAGATTTAT CCGAGAAAAT 60 

TGAAAGAACA AAAGATATGC CAATCAAGAA TACTATAACT ACTCAATTAG GAAATAAACT 12 0 

TATTGGCACA AAAAAAGCTC GTTTTGATGA T AAGAAAG T A GTGTCGTTTG GAGCATTTGA 180 

AGATGAATAA AATAAATGAT AG AG AT T T AA CAGAATTGAG TAGCTATAGG GTTTATCAAG 24 0 

ACATCAATAA AGATAATGAC TTTACAGTTA AC G AAAAAC G ATTTAAGCAG GCAGATGTAT 300 

TTGAAGATTT ATATAGAGAG AAACTAAAAG ACKAAATAA ATTAAGAGAG TATAATTATT 360 
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TACAAAATGA AACTTTTAAA AGCGCATAAA TAGGTGATGA GATATGCTTA AAAAAGCAAA 4 20 

ATTTATCTTA ATGGCAACGA TACT ACT ATC AGGATGTTCA ACTACCAATA ACGAATCCAA 4 80 

CAAAGAAACA AAATCTGTAC CAGAAGAAAT GGATGCTTCA AAAT AT G TAG GACAAGGAT 54 0 

CCAACCACCT G C AG AAAAAG ATGCGATTGA ATTTGCAAAG AAGCATAAAG ATAAAATTGC 600 

TAAGCGAGGC GAACAATTTT TTATGGATAA CTTCGGTCTA AAAGTTAAAG CTACAAATGT 6 60 

TATAGGTAGT GGCGATGGTG TAGAAGTATT CGTGCATTGT GATGACCACG AyATCGTATT 7 20 

TAATGCGAGT ATTCCATTTG ATAAATCAAT wATTGAsAGT GATAGCTCAT TAAGAAGTrA 780 

GGAyAAAGGy GAT G AT AT G A GTACTTTAGT TGGTGCAGTA CTCAGTGGGT TTGAATATCG 84 0 

AGCACAAAAA GAAAAAT AT G ATAAATTATA TAAATTTTTC AAAGATAATG AAGAGAAATA 900 

T C AAT AT AC A GGATTTACAA AAGAAGCAAT TAATAAGACGCAAAATAGTG GTTATGAAAA 9 60 

TGAATATTTT TATATTTCGG CCATACCTTA TAATTTAGCT GAGTATAGAG ACTATTTTGA 1020 

ACCATTGTTA AACAAAAGTG ACAGTGAATT TTCAAAAGAA TTGTCAAATG TTAAGAAGCA 1080 

ATTAAAAGAT AAGTCTAAAG TTTCGGTAAC TACTACTCTA TTTAGTAAAA AAAAGAACTA 1410 

TACTAAAAAA AGTAACAGTG AAAATGTAAT AAAAATGGCA GAAGAAATAA AAAAAG AT AA 12 00 

AGAGATACCA AACGGTATAG AGCTTAGTAT AAAATTTTCG GACAATAAAA TAAATACGGT 12 60 

TAAACCAAAT TTTAACGGTG aAAGCACTTC AGAATATGGT GTGTTTGATC AAGAATAAAA 1320 

TTAATGATGa AAATTTAACG GAJAATAGTG TATATTGAGT AG AT CmAG AA TAAAAAGATA 138 0 

ATTCTACTAT TGTTGTGAAG GCAAATAAGT AGAAGATTTT AAGTGTAATT TCTGGTGATT 14 4 0 

TAAATAATAA TATAnATGGn AGTACTGATA TAAnACTTTT TAACCTACTA GATTCTTATA 1500 

ATTTGCTTTC CATTTTATGA CGATTTTTAC TCCAATTGAG TGATAGAXC CAAAAAAGCC 1560 

ATCTCCAAAA ATTAATCC 157 8 



(2) INFORMATION FOR SEQ I D NO : 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

TGTTTTCCTT GGGTTAAAAC ATGCTTGCTA TGCGTTTGTA AATATGACTT GCTGTTTTnA 60 

CCTGnATACC CGTCACACCA TGGAAGTAAA AATGTTTCTT GCTCTTGGCT TACAATTTTA 120 

GCTTTAATCG CTTCATATGC TTTATATTGG TCTTCTGTTA ATTGCTGTTT TGATTCTTGT 18 0 

TCGAAAACAC GATCTTTAAA TGGGTCTCTT TCAACAACCG CGTCATATTT T TCAACAT AA 24 0 
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CCTTTTTTGA TAAGTCCATC TAAACTGGAT TTTGAAAAGC CCATATCCTC AATATCAGTT 300 

AAAAATATTG TTTTATGTTG TTCTXAGAC AAGTAAGCAT ACAAATCGTA TTGTTTAATA 3 60 

ACTTTCTCCA ACTTAGCTAA TACTTCATCA GGATGATACC CTTCAATGAC ACGAACAGCA 4 20 

CGCTTGGTTT TTTTAGTTAT ATTTTGTGTG AGAATCGTTT TTTCTTCAAC GATATCATCT 4 80 

TTTAACAACT TCATAAGCAA TTGAATATCA TTATTTTTTT GCGCATCTTTATAATAATAG 54 0 

TAACCATGCT TATCAAATTT TTGTAATAAA GCTGAAGGTA GCTCTATGTC ATCTTTCATC 600 

TTAAATGCTT TTTTATACTT CGCTTTAATA GCACTCGGAA GCATCACTTC TAGCATAGAA 6 60 

ATACGTTTAA TGACATGAGT TGAACCCATC C AC T C AC T T A AAGCTATTAA TTCTGATGTT 7 20 

AATTCTGGTT GTATATCTTT CACTTCTATG ATTTTTTTTA ACTTCGAAAC GTCAAGTTGT 7 80 

GCATCAGGTT CTGCTGTTAC TTCCATTACA TAACCTTGAA TCGTTCTTGG TCCAAAAGGT 840 

AC AAT T AC AC GCACACCAGG TTGGATGACA GATTCGAGTT GTTCGGGAAT TATATAATCA 900 

AATTTATAGT CAACGCTCTT CGACGCGACA TGACTATGA CTTTCGCTAT CATTATTGCC 9 60 

ACCTAGTTTC TAGTTCATCT AAAATTTGTG CAGCTAATAC TACTTTTTTT CCTTTCTTGA 1020 

TATTTACTTT TTCATTATTT TTAAAATGCA TTGTCAATTC ATTATCATCA GAACTAAATC 1080 

C GAT AG AC AT ATCCCCAACA TTATTTGAAA TAATCACATC TGCATTTTTC TTGCGTAAT 114 0 

TTTGTTGTGC ATAATTTTCA ATATCTTCAG TCTCTGCTGC AAAGCCTATT AAATACTGTG 12 00 

ATGTTTTATG TTCACCTAAA TATTTAAGAA TGTCTTTAGT ACGTTTAAAA GATACTGACA 12 60 

AATCACCATC CTGCTTTTTC ATCTTATGTT CTAATACATC AACCGGTGTA TAGTCAGATA 1320 

CGGCTGCTGC TTTTACAACA ATATCTTGTT CGTCAAATCG GCTTGTCACT TGTTCAAACA 1380 

TTTCTTCAGC ACTTTGAACA TGAATAACTT CAATATCTTT TGGATCCTCT AGTGTTGTAG 14 4 0 

GACCAGCAAC TAACGTCACG ATAGCTCCTC GATTTCGCAA TGCTTCAGCT ATTGCATAGC 15 00 

CCATTTTTCC AGAAGAACGA TTGGATACAA ATCTGACTGG ATCGATAACT TCAATAGTTG 15 60 

GTCCTGCTGT AACCAATGCG CGTTTATCTT GAAATGAACT ATTAGCTAAA CGATTACTAT 1620 

TTTGAAAATG AGCATCAATT AC AG AAAC G A TTTGAAGCGG TTCTTCCATA CGTCCTTTAG 1680 

CAACATAACC ACATGCTAGA AATCCGCTTC CTGGTTCGAT AAAATGATAC CCATCTTCTT 714 0 

TTAAAATATT AATATTTTGC TGCGTACGTT TATTTTCATA CATATGCACA TTCATAGCAG 18 00 

GCGCAATAAA TTTCGGTGTC TCTGTTGCTA GCAACGTTGA TGTCACCAAA TCATCAGCAA 18 60 

TACCTACACT CAATTTTGCA ATTGTATTTG CCGTTGCAGG TGCAACAATG ATTGCATCTG 192 0 

CCCAATCACC T AAT GCAAT A TGCTGTATTT CTGAAGGATT TTCTTCTATA AAAGTATCTG 198 0 

TAT AAAC AGC ATTTCGACTT ATTGCTTGAA ATGCTAATGG TGTCACAAAT TTTTGTGCGT 204 0 
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GATTCGTTAA CATAACGCGA ACTTCATACC CAGATTGTGT TAACTTACTT GTCAAATCAA 2100 

TTGCTTTATA TGCCGCAATG CCACCTGTAA CGGCTAATAA TATTTTCTC ATATTCAATC 2160 

TCCCTTAAAT ATCAC TATGA CATTTACGCT TTACATCATC ATATGCGCAC AAATGCTCAT 2220 

TACTTTTTTA TAGATACAAA TTTAGTATTA TTATAACATC AATCATTGGA TAAACTAAAA 2280 

AAACACACCT ACATAGGTGC GTTTGATTTG GATATGCCTT GACGTATTTG ATGTACGTCT 234 0 

AGCTTCACAT ATTTTTAATG GTCGAAACTA TTCTTTACCA TAATAATCAC TTGAAATAAC 2 4 00 

AGGGCGAATT TTACCGTCAG CAATTTCTTC TAACGCTCTA CCAACTGGTT TAAATGAATG 2 4 60 

ATATTCACTT AATAATTCAG TTTCAGGTTG TTCATCAATT TCACGCGCTC TTTTCGCTGC 2 52 0 

AGTTGTTGCA ATTAAATACT TTGATTTAiT TTGTGaCGTT aATTGGTTtA AAgGTGGATT 2580 

TAACATTATT TTTTAGCCTC C AAAAT CAT T TTTCTATACT TAGCTTCTAC GCGCTCTCTT 2 64 0 

TTTArGTGcT CAGCTTCTAC AATACATTGA ATTCtATTCy TCGcAAGtTC TACTTCAtCA 2700 

TTAACTACAA cGTAAyCGTA T AAAT T CAT C ATTTCrACTT CTkTACGCGC yTCETAATA 2 7 60 

CGACTTTGTA TTTTCTCATC AGATTCTGTT CCTCTACCTA CTAATCGCTC TCTCAAGTGT 2820 

TCTAAACTTG GAGGTGCTAA G AAAAT AAAT AGCGCATCTG GAAATTTCTT TCTAACTTGC 2880 

TTTGCACCTT CTACTTCAAT TTCTAAAAAT AC AT CAT G AC CTtCGTCCAT TGTATCTTTA 2 94 0 

AC AT AT T GAA CTGGTGTACC ATAATAGTTG CCTACATATT CAGCATATTC TATAAATTGG 3000 

TCATCTTTGA TTAAAGCTTC AAACGCATCC CTAGTTTTAA AAAAGTAATC TACGCCATCA 3060 

ACTTCACCTT CACGCATTTG ACGTGTTGTC ATTGAAATAG AAT ACT TATA TGATGTACTT 3120 

GGATCTTCAA ATATnCGTnT TCTAACAGTA CCTTTAXTA CTCCAGATGG TCCTGATAAA 318 0 

ACGATTAACA ATCCTTTTTC ATTATCCATG CCTTACGACC TCTCTAAGCT AATCTTCTAT 324 0 

TATTTAAATA TGATATCACA TTGTTCTTTA TATTGTATAG CATATTTGAA ATTGCATGCC 3300 

ATAATTTCTA TTAAGTCTAA CAATATCGTT ATATTGCACG ATTAATTTTA AT T AAAT AAA 33 60 

TTGAATTGCA AACTTTTAGA TAATGTAAAA TGTATGGCAT AATGTATGGT TCAATAACTA 34 2 0 

TACT G AAAAG TTACAATCAT GTTAAAATGA AACGAATGAT ATGAAGAAGG TGGAAGATAA 34 8 0 

ATTATGGCTT ATGATGGCTT ATTTACAAAG AAAATGGTTG AGTCTCTACA ATTTTTAACA 354 0 

ACAGGACGTG TTCACAAAAT CAATCAACCT GATAATGACA C GAT AC T AAT GGTTGTACGT 3 600 

CAAAATAGAC AAAAC CAT C A ATTGTTATTG TCAATCCATC CAAACTTTTC AAGATTACAA 3 660 

T T G AC T AC T A AAAAATATGA TAATCCATTT AATCCACCCA TGTTTGCGCG TGTTTTTAGA 37 2 0 

AAACACTTAG AAGGTGGTAT TATCGAATCG AT TAAGCAAA TTGTAATGA TCGTCGCATT 37 8 0 

GAAATCGATA TAAAGAGTAA AGATGAAATT GGCGATACTA TTTACCGCAC TGTCATCCTT 384 0 

GAGATTATGG GTAAACATAG TAACTTAATT TTAGTAGATG AAAATCGCAA AATAATTGAA 3900 
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GGATTTAAAC ACTTAACACC AAATACGAAT CACTATCGTA CAGTAATGCC AGGATTTAAT 3960 

TATGAAGCAC CACCTACTCA GCACAAAATA AATCCGTATG ATATTACAGG TGCAGAGGTG 4 02 0 

TTGAAATATA TCGATTTTAA CGCAGGTAAT ATTGCTAAAC AATTATTGAA TCAGTTTGAA 4 080 

GGATTTAGCC CTTTAATTAC GAATGAAATC GTTAGTCGTC GTCAATTTAT GACTTCATCA 4140 

ACATTACCAG AAGCATTTGA CGAAGTAATG GCAGAAACCA AGTTACCACC TACTCCTATT 4200 

TTTCATAAAA ATCATGAAAC AGGTAAAGAG GATTTCTATT TTATAAAGTT AAATCAATTT 4 2 60 

AATGATGATA CAGTTACATA CGATTCATTA AATGATTTGC TTGATCGTTT TTATGATGCG 4 320 

CGTGGCGAAC GTGAACGCGT TAAACAACGT GCGAATGATT TAGTTCGATTTGTTCAACAG 4 38 0 

CAGTTGCACA AATATCAAAA TAAATTAGCG AgTTGATTGA AGAATATGAG CAGTCTAAAA 4 4 40 

ATAAAGATAC TGAACAGTTA TATGGTGAAT TGATCACTGC TAATATATAT CGAAT T AAGC 4 5 00 

AAGGCGATAA AGAAGTGACG GCATTGAATT AT T AT ACGAA TGAAGAAGTT GTCATTCCTT 4 5 60 

TAAATCCTAC AAAATCCCCA TCAGCAAATG CTCAATATTA TTATAAACAA TATAAyCGTA 4 62 0 

TGAAAACGAG AGAmCGTGAA TTACAACATC AAATTCAATT G AC G AAAG AC AAT AT AG AT T 4 680 

ATTTTTCAAC AATCGAACAA C AAT T AC AT C ATATTTCTGT CCATGACATT GATGAAATTA 47 40 

GAGATGAATT AGCAGAACAA GGCTTTATGA AJCAGCGTAA AAATCAAACT AAGAAAAAGA 4 8 00 

AAGCGCAGAT TCAATTACAA CATTATGTAT CAACTGATGG CGACGATATA TATGTTGGTA 4 8 60 

AGAATAACAA GC AAAAT GAT TATTTAACAA ATAAAAAAGC TAAAAAAACT CACACATGGT 4 920 

tACACACAAA AGATATTCCT GGTTCACATG TCGTTATATT TAATGATGCA CCAAGTGRA 4 980 

CGACAATCAA GGAAGCGGCT ATGTTAGCAG GATACTTTTC AAAAGCTGGT AATTCTGGAC 504 0 

AAATACCTGT TGATTATACA TTAATTAAAA ATGTGCATAA ACCATCaGGT GCAAAGCCTG 5100 

GGTTTGTAAC ATATGACAAT CAAAAAACTT TGTATGC 5137 
(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

GTTTTATCGC AGCAGTAAAG CTATCAATCG GCGGTTCAAT TGATGATGCA T T AGC AG AAA 60 

T C An AC AAT C ATTTTAGTTA AAATTTACTA ATAATGAaAA ATGTAAACCT TTTTCAAATG 120 

AAACTTTATa AaAAATATGA TAGTATATAT GTAAATGTTT AATAAAATCT GGAGAAATAG 180 
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GAG G AC AT T G CCATGCAACA CCTTATAAAA AAACffGTAT TGAATGGCGA GTTTGATTTA 24 0 

GTACGACAAT TGATGTCCGA AACAGATTTT ATGGAATTTG AAGAAGCATA TATTTCAAGT 30 0 

GCGCATGAAG TAGAAAGTAT GATGTTTTAT ACATGTATTT TAGATATGAT TAAGTACGAA 360 

GAATCATCTG AAATGCATGA CTTAGCATTT TTATTGCTTG TGTATCCACT AAGTGAATAT 4 20 

GAAGGTGCTT TGGATTCTGC TTATTATCAT GCAGACGCTT CC AT AAAACT TACTGACGGC 480 

AAAGAAGTTA AAAGTTTGTT ACAAATGTTA TTATTGCATG CGATACCAAC ACCTGTTATT 54 0 

TCAGATAAGA AGGCTTTTGA TATCGCCAAG CAAATTTTAA AATTAGATCC TAATAATAAT 600 

GTTGCTCGTA ACGTCTTAAA AGACACTGCC AAACGTATGc gACAaCGTTG TTGTTGATAT 660 

AAATGAATTA CACCAACGTA ATGCACGTTA AT T AC AT T T C AATTATATTA GCTTAATAAT 72 0 

AGTTTTAACA TTTGGTTGGG TTGGGCATAT GTTCCAGCCT TTTTTAATAC TTAAAAACTA 780 

ACGAAgTATA CTTGTGTGCA CAAATGGTTT TTATACAACA TTTATAAAT T TAT AC AT T T 840 

TAATAAAGAA CATACGATAG ATGGTTTAAA CCTTGTTAAC TGAGAAATTT T GAT AT G TAT 90 0 

TCTTCGAAAT TTAACTAAAT ATACGAAATT CAAGAAGCAC AATAATTAAT CATTTTTCCT 960 

ATACAAAAGT TCGTATGACT GC ATT AT AAA AGCATAAATT TATAATTTTT TTAAATGTCA 102 0 

TTGAACGTGA TAATGTGAAT GGATTGAGCA ATTTTGAAAA AG T G AAAAAT AACCTATGCG 108 0 

ACTTGCAATT AATTTTCAGT ACGTTATAAT GCACACTGTG CAAAATTAAG GAGGTCTATT 114 0 

ATTCACATGA TGATGAaTAA AGAAGCAACA AAAATTGGAT TTGCCTACGT CGGCATTGTA 12 00 

GTGGGCGCAG gATTTTCAAC TGGATAAGAA GTTATGCAAT TTTTCACTAA ATATGGCTTG 12 60 

TGGGCTTATT TAGGTGTTAT TATATCTGGT TTTATTTTAG CTTTTATTGG GCGCCAAGTA 132 0 

GCAAAAATTG GTACTGCCTT TGAAGCGACA AATCATGAAT CAACATTACA ATACGTATTC 138 0 

GGTGAAAAGT TTAGTAAAGT CTTTGaTTAT ATTTTAATCT TCTTCTTATTTGGTATAGCT 14 4 0 

GTAACCATGC tAGCTGGTGC AGGCGCAACA TTTGAAGAAA GTTATAACAT ACCTACATGG 1500 

CTAGGTGCTT TaATTATGaC ATTAGCGATT TATATTACGT TGCkATTAGA CTTTAATAAA 1560 

AT AG T AC G T G CACTAGGTAT CGTTACACCA TTTTTAATTG TTTTAGTTGT ATTAATCGCT 162 0 

GGCGTTTATT tATTTAAAGG TCATGtTTCA TTAGCAGAAG TTAACCAAGT AGTGCCtGAA 1680 

GCAAGTATTT GGAAGGGAAT CTGGTTTGGT AC AAT AT AT G GTGGATTAGC TTTTTCTGTA 174 0 

GGTTTTAGTA CCATCGTAGC AATCnGTGGG GAT AC T G AAA AGCGTACAGT GTCAGGTGCA 1800 

GGCGCGATGT ATGGTGGTAT TATCTATACT Q?ATTACTAG CAT T GAT C AA CTTTGcATTG 18 60 

CAAGTGaATA TCCAACTATT AAAAATGCCT CAATTCCTAC ATTGACGTTA GCAAATAATA 192 0 

TCCATCCTTT AATAGCAACA GTGkTATCTG TTATTATGCT GGCGGkTATG TATAATACTA 198 0 

TTCTAGGACT AATGTATTCA TTTGCAGCAC GTTTTACAGA AC CAT AC AG T AAAAATSTC 204 0 
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ATATCTTTAT TATTATAATG ATGGTAGCAG GTTATTTATT AAGTTnCGTA GGATTTGCTG 



2100 



AATTAATTAA TAAGTTATAT ACnATTTATG GGATATGTAG GCTTATTnTA TTGTAGTAGC 



2160 



TGTAATTATn AAATATTTCC AAACGTAAAA ATGGCGGATA AAAAACATAT TGCTTTAATA 



2220 



TCATATGGAG GGGATATCCG AAACTTTACA ATTTGAATCA CTTTGGT 



2267 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

GTCAATGTAA CCTAATAGTT TATGTCTATC TTGTGTACCA ACTACTACAT CGACACCAGG 60 

AATTTCCATA ATTTCAGCTG ATGAAGTTTG CGCATAACAA CCTGTTACAC AGATTACAGC 12 0 

ATCAGGATTT TGTCTTATTG CACGTCTAAT TATTGACGA CTTTTTTTAT CACCCGTATT 180 

CGTTACTGTA CAAGTATTAA TAACAAATAC ATCAGCATTC GCTTCAAAGT CAACGCGCTC 24 0 

ATAGTTTGCT TCTTTAAATA ATTGCCAGAT TGCTTCAGTT TCATAATGGT TTACTTTACA 30 0 

ACCTAATGTG TGaACGCAAC TGTTGACATA AATAT TCACC CCATTAATTC TTTTTCATAA 3 60 

CTTATTGCAC TTAACGCATA CAATGGCGCA GTTTCTGCCC GTAAAATTCT CGGCCCAAGA 42 0 

CCAACAACTG TACTAGTATT ACTAAATAAT GAAATTTCAT TTTCTGACAA ACCACCCTCA 4 80 

GGaCCAAAAA TCATCAACAC TTTATCCTGA GCATTGAATT GTTGTAAAGT TTGCTTGAAA 54 0 

TTGCTTAACT CACCATCTTT TGCTTCCTCT TCATATGCAA TAAGAATATA GTCATAATTA 600 

TCAATAGTAT CACAAATTAA TTTTAAATTC GACTCGAATT GAATAGATGG AAT C AC T AAA 6 60 

CGATAGCTTT GTTCAGCAGC TTCTTTAATT ATTTTTTGCC AACGCTCTAT CTTTTTGGCA 72 0 

ACTTTTGCCT CGTTTAATTT AAC AAT T G AA CGTTCCATGC EACAGCTAT AAATGATGAA 780 

GCACCCAATT CAGTAGCTTT TTGTAGCAAC CACTCATATT TGTCAGCTTT GATTAGTCCA 84 0 

CTGCAAATCG T AAC AT C AAC TGGCAATTCT GTATTAATAT TTTGTTTTTC TTTTAAATCA 900 

ACTTCAATTT TATCACTTGT TATGTCAGCA ATTTCACATA AATAAACTGT TTGATCATTA 9G6 

AAAGTTAAAA TAATTTTACT AC C AAC AT C A TATCTCATTA CATTTGTTAT ATGATGAATA 102 0 

TCTTCTTTTT TTGTAATAAA AAAACGCTGA C T T AC AT C AG CGTTTTGGnT CTATGAAATA 1080 

ACGTTGCACA TTATTCACTC ACTTTCTGGC CAACAAGACA AACCCAACCG TTGTCATGTT 114 0 

GTTCTGAAAT AATTTTAAAA CCTACACGCT CCATATGTGA CTGTATACCT TCATACTTCT 1200 
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CTTTTATAAT ACCAGAAGTA ATAAAATAAC CGCCTTCATT TAGAGTATTA TAAGCATCTT 12 60 

CAATCATTTC ATCAATAATA TGCGCTAAAA TATTTGCTAT TACAATATCA AATTTTTCTG 132 0 

TTTCGTCTTT CAATAAGTTA CCTGGAACAG CTTCAATTAA CGTTTCACA TGATTTCTTC 138 0 

TGAAGTTTTC TTTAGCTACA CTCACTGCCA TTTCATCAAT ATCCAACGCT TTAATACGTT 14 4 0 

TTACACCGAT TAG AT G AC T T GCAATACTTA ATATACCTGA GCCAGTACCA ACATCAATTA 15 00 

CTGAATGCTG TGGCAATACA TATGTTTCTA TTGCCTTCAA AC AC AT AC T T GTAGTCGGAT 15 60 

GATCACCTGT TCCAAAAGCC ATACCTGGGT CGAGCTCAAT GCAAAGCTCT TCATCCGCTT 162 0 

CTTTAGCATA TGTTTCCCAA CTAGGAACTA TTGTGAACTT CTTCGACGCT CGGAATGGAT 1680 

GGAAATAGTT TTTCCATTCA TTTTCCCAAT CCGTCTCTGC AATAATTTGC TCACTGAATT 17 4 0 

GAACGTTATG TTGATCAAGT TCATCTAAAT TTAATAACTC ATCTTTAATT TGCTGTCGCA 1800 

ACT TAT CATC ATAAGTCATT TCATTAAAAT AGGCTTTCAA TCTTACTCCC TTATCTGGAT 18 60 

AATCCTCTTT TTTCAAAGCG TAAATTTCAC CGTATTTATC TTCTGGTTGG TTAATTAAAT 1920 

CATCTGAATC TTCTATCACG ACACCATTTG ATCCATGATT TTCAAGTATA TTGGTQCCA 198 0 

ATTCTACTGC TTCATGATTA ATAATAATTG AAAGCTCTGT CCAGTTCATA CTTTATTCTC 20 4 0 

CCTTAAAGAA TCTTTTTGCT CTATCTTTAA AATTCGAAGG TTGTTCATTA ATTTCTTCAC 2100 

CATTTAATTG GGCAAATTCT TTCATTAGTT CTTTTTGTCT ATCTGTTAAT TTAGTAGGCG 2160 

TTACTACTTT AAT AT C AAC A TATAAATCTC CGTATCCATA GCCAT GAACA TTTTTTATAC 222 0 

CCTTTTCTTT TAAGCGGAAT TGCTTACCTG TTTGTGTACC AGCAGGGATT GTTAACATAA 2280 

CTTCATTATT TAATGTTGGT ATTTTTATTT CATCGCCTAA AGCTGCTTGT GGGAAGCTAA 234 0 

CATTTAATTT GTAATAAATA T CATC AC CAT CACGTTTAA TGTTTCAGAT GGTTTAACTC 24 00 

T AAAT AC T AC GTATAAATCA CCAGCAGGTC CTCCATTCAC GCCTGGAGAG CCTTCACCAG 24 60 

CTAATCTAAT TTGTTGTTCA TTGTCGACAC CTTCAGGTAC TTTCACTTCT AATTTAACTG 2520 

TTTTATTTTC AGTACCTTTT CCGTGACATG TTGGACAAGC TTCTTCAAAT TCTTGACCAC 258 0 

TTCCATTACA TTTAGGACAA ACTTGTTCAG TACGAACTCT ACCTAAAATT GTGTTTTGTT 2 64 0 

CTACAGCTAC ATGACCAGCG CCATTACAGT AACTACAAGT CTTTTTACTT GTTCCAGGCT 27 00 

TTGCACCATC ACCATGACAT GTTTCGCATG TTACATCTTT ACGGATTGAA ATTTCTTTTG 2 7 60 

TTGTACCAAA TACCGCTTCT TCAAATGTTA ATGTCATTGT ATACTGAAGA TCATCACCTT 2 8 20 

TTTGCGGTGC ATTTGGATCT CTTTGTCTGC CGCCACCGAA GAAAGAGCTA AAGATATCTT 28 80 

CAAAACCGCC GCCACCGAAG CCACTAAAAC CGCCAAAGTC AGAGCCATTG AATCCTTGTC 2 94 0 

CACCAAAACC TTGTGGACCA TCATGTCCAA ATTGATCATA GcTTGGCGT TTATTATCAT 3000 

CACTTAAAAC TTCATAGGCT TCAGAAATTT CTTTAAACTT TTCATCTGCA CCTTCTTCTT 30 60 
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TGTTAATATC TGGATGATAT TTTTTCGAAA GCTTTCGATA CGCTTTTTTG ATTTCATCTT 3120 

TTGAAGCATC CTTACTAATG CCTAAAACTT CATAATAATC TCTTTTGGCC ACAGCTATCT 3180 

CTCCTTTTCT TAATTAACTC ATATAGTTTA ACGTAATATG TCATACTATC CAAATAAAAA 32 4 0 

GCCAAAGCCA ATGTTCTATT GACTTTGACT TTTCAGATCA TGACAACATT CTAATTGTAT 3300 

TGTTTAATTA TTTTTTGTCG TCGTCTTTTA CTTCTTTAAA TTCAGCATCT TCTACAGTAC 3360 

TATCATTGTT TTGACCAGCA TTAGCACCTT GTGCTTGTTG TTGCTGTTGA GCCGCTTGCT 34 20 

CATATACTTT TGCTGATAAT TCTTGAATCA CTTTTTCAAG TTCTTCTTTT T TAG AT T T AA 34 80 

TATCTTCTAT ATCTTGACCT TCTAAAGCAG TTTTAAGAGC GTCTTTTTTC TCTTCAGCAG 35 4 0 

ATTTTTTATC TTCTTCACCG ATATTTTCGC CTAAATCAGT TAAAGTTTTT TfiACTTGGA 3 600 

ATACTAGACT GTCAGCTTCG TTTCTTAAGT CTACTTCTTC ACGACGTTTT TTATCTGCTT 3660 

CAGCGTTAAC TTCAGCATCT T T T AC CAT AC GGTCGATTTC TTCGTCTGAT AATGAAGAAC 372 0 

TTGATTGAAT TGTAATTCTT TGTTCTTTAT TTGTACCTAA GTCTTTTGCA GTTACATTTA 3780 

CAATACCGTT TTTATCGATA TCAAACGTTA CTTCAATTTG AGGTTTACCA CGTTCAGCTG 38 4 0 

GTGGAATATC AGTCAATTGG AATCTACCAA GTGTTTTATT ATCCGCAGCC ATTGGACGTT 3 90 0 

CACCTTGTAA TACGTGTACA TCTACTGATG GTTGATTATC TACTGCTGTT GAATAGATTT 3 9 60 

GAGATTTAGA TGTAGGAATC GTAGTGTTAC GTTGLATTAA CGTATTCATA CGTCCACCTA 4020 

AAATTTCAAT ACCTAAAGAT AGTGGTGTTA CGTCTAATAA TACTACGTCT TTAACGTCAC 4 080 

CTGTGATAAC GCCACCTTGG ATTGCAGCTC CCATTGCCAC TACTTCGTCC GGGTTTACTC 414 0 

CTTTGTTAGG CTCTTTACCG ATTTCTTTTT TGACAGCTTC TTGTACTGCT GGAATACGAG 4200 

TTGATCCACC AACTAAGATA ACTTCATCGA TATCTGAGTT TGTTAAGCCA GCGTCTTTCA 4 2 60 

TTGCTTGGCG TGTAGGTTCC ATTGTTCTTC TAATTAATGA ATCTGATAAT TCTTCAAATT 4 32 0 

TAGAACGAGT TAAGTTTACT TCTAAGTGTA ATGGACCGTT TTCACCAGCT GAGATAAATG 4 38 0 

GTAATGAGAT TTGAGTTTGT GAT AC AC C T G ATAAGTCTTT TTTAGCTTTT TCAGCAGCAT 444 0 

CTTTCAAACG TTGTAATGCC ATTTTATCTT GAGATAAGTC TACGCCATTT TCTTTTTTGA 4 500 

ATTCTGCAAC TAGGTAGTCA ATAATTACTT GGTCAAAATC ATCACCGCCA AGTTTGTTGT 4 5 60 

CACCGGCTGT TGATAGTACT TCGAATACAC CGTCACCTAA TCTAGGATA GATACGTCAA 4 62 0 

ATGTACCGCC ACCTAAGTCA AAAACAAGAA CTTTTTCATC TTTATCAGTT TTGTCTAAAC 4 680 

CATATGCTAA TGCTGCAGCT GTTGGTTCAT TAATGATACG CTCAACTTCT AAACCAGCAA 4 74 0 

TTTTACCAGC ATCTTTAGTT GCTTGACGTT CAGCATCGTT AAAGTATGCA GGTACTGTAA 4 80D 

TTACAGCTTT GTCAACTTTC TCACCTAAaA TAgTTTCAGC TGTATTTTTT AAGTTTTGTA 4 8 60 
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AAATCATAGC TGAGATTTCT TGTGGTGTGT ATGATTTACC TTCAATATCT ACTTTATAAT 4 920 

CAGTACCCAT ATGACGTTTA ATAGATTGAA CAGTGTTTGG GTTTGTAATA GCTTGACGTT 4 98 0 

TTGCTACTTC aCCAACTTGA GTTTCTCCAT TTTTGAAAGC TACAACAGAT GGTGTTGTAC 504 0 

GTGAACcTTC AGGGTTTTGA ATTACTTTTG GCTCATCGCC TTCTAATAcT GTnACACATG 5100 

AATTTGTTGT ACCTAAGTCT ATACCAATAA TTTTACTCAT AATAAAATTC CTCCATTTAA 5160 

TCATTAAATT AATTTAATTT TAAACAATGT CTTTTCGCCA AATTTAAGT ATTGGTTTAC 5220 

TTTGACCATT GATGGTCTTA ATACTCTATC TTTAAGCTTG TATCCTTTTT GTAGTTCTTG 5280 

AGTGATTTCG CCAGATTCAA AATCAGGGTT ATCATCTTGA ACTACAGCTT GGTGAATATT 5 34 0 

TGGATCAAAT GCTTCACCTT CAGTTTTAAT AACTTCAAGA CCATTATCTT TTAGTGCGTT 54 00 

AATCAAACTT TCATGCACCA TTTGTACACC TTTTTGAAGA GATTTAAAAG TCTCATCATC 54 60 

ACCTTCAATT TGAAGTGCAC GTTCTATATT GTCTATTGCT GGTAAAATAT CTGTTAACAC 5520 

ACGTTGTGCT TGATATGTTT TGTTTATTTC ATTTTCTTTT TGAATTCTAC GCTTATAATT 5580 

TTCAAACTCA GCGTAGAGCC TTAAATATTT CTCTTCGTTT TCATCTGCTA ATTGTTGAAG 5 64 0 

TTCATTAATT TTTTGATCTT TTGGATCTAT TTCTTCAATA ACATTCTCGT CAGACGTTTC 5700 

TTCTATTGCT TCATCTTGTA AATGACCTTT ACTTTCTTCA GCTTGTTCAA CTGAATCATC 57 60 

AATATTTTGT TTGACGTTTG TTTCTTCAAC TGTTGATTCA GTGTTTTTTT CAACT&TTC 5820 

GTCTTTATTT GTCATTTTCT GTCCTCCAAT ACTTTCTAAT CCATCATTAC CAAATTCTAT 5880 

TTAATAATTG AATGACATTT TGATAATGCA TAGCTGTAGG TCCAATCACA GCGATTTGAC 5 94 0 

CTTTTAACGT TTCATCAAAA TGATATTGAC TTGTTACAAT TGAAATATCA CTTAAGCTGT 600 0 

CATCAATTTC ATTACCAATT TTTACATTAA TATTTGGTGA AGATATATCT TGTAATAATT 60 60 

CTGCAATTCT ATTTGATTCT ATATATTGTA GAATGGGCTG AATTGAAGAT ACATTACTTT 612 0 

CATTCAATGC ATCAATAAGT TTAACCTTTC CACCCATATA AATGCTATTA CTTTGATTAG 618 0 

AAATATGATT ATTCATCGTA TTTAACAATT TATTGATMA AATTTCTTCC TGCTCTGATT 62 4 0 

GAaCAAAAGA GACAATATCA TCTTGTAAAT TCTGATTAAA CTCAGTTAGT TTGTTTGTAA 6300 

CAAAATTTGA TATTGTATTT AGTTTGTCAT TATTAA 633 6 
(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 
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TTCATGATTA TTATCTGTTG TAGACACTGC TGGATCTTCC GATGTATCTT TCGATGCEC 60 

TTTCGATTTG TGTATTTGCT GATTCAAATG GTCTAGGTCT TCTAACGCCT TATTTACCAT 12 0 

TGCTTCATCA TTTTTATCAT CTTTTTCTCC ATGTTTTGTT GTAGCCGTTT GTGACATATC 18 0 

ATTTTTCATT GCATTAAGAT CGTCCTCGCC ACTTTGTTGA CCCCTATCAA CATTTGAAGA 24 0 

AACCTCATTT AAATCTTTAA GCAATTGATC TAATTTACTG TCTATATCAC TTTGACCGTT 300 

CATTTCAGTG TGAGAACTTT TATTGTCTTT GCTATCCAAC TCATTAGCTC GTTTTATGAT 3 60 

TTCATCTATT TGCGATGCTG TTTTCGCTTC ATTTAGTTGT GCTTTATAAT GTTCTTTAGA 420 

TGAAGCCGAT AACTGTTTTA ATTGCTCAAT TTGACGAATT GCCTTGTCAA CTTTGTCTAA 4 80 

TAAATCTTGC T T AG AT AAT A TCTCTTTTGT AATTTCAGTA TCCTTTTCAG ATGCAGCTTG 54 0 

GGCATCGTAC GGCAAGATAT TCGTTAAAAT GATACTTGTC GCCATCATTG TCGAACACGA 600 

TAACTTTACA TATAATTGAA ACGGTTTCCC TCGATATTTA GCCATCAACA TACTCCTTCC 660 

TCACTTACTT CCTTCAAAGA ATTACATACT ATTATATACC TGTTTACAAG AAATTTACAC 720 

TTATCTATCT AGTTATTGTT GTTAGTAATT ATCTACTTAT TACTTAGCTT ATATTTAAGT 7 80 

AAACAAAACA AGCATGACGT AATATCATAT TGTCCATGTC GCTAACATCA TATTACGTCA 840 

AATCTTTTAT ATTAAATGAT GTTTTATTTT AGACTGCTTT TTCCTTTTAG CTTTCGAGCG 900 

CCTGTTTAAA AACTTGCTCG AATTGTTCAC GCGAGATTTC GTGTGCATGT GCTTTTTGTG 960 

CTAATAAAGC ATCTCGAAAC TGTTGTTGAT CTTTCAAACT TTCTAACATT TGTATTAATT 1020 

GGTCTTTACT TTCCATTGTT ATCTCATCAT TATGCTCAAA TAAGTGTCT GATAATGTTA 1080 

CTTTAGCATG GTGTGCGGTT TGACGATAAC CTAAAATCAA CAACTCATAG TCAAACGCTT 114 0 

GTTCCACCGC ATTTAAAATT TCATTACCCT CATTGATATC AAGATAAATA T C AC AT AAC T 12 00 

GGTATAGTTC ATTTACCCTG TCAATATTAA TAGATGGGTA TAAATGCACA TTAGCATATT 12 60 

GATCAAGTTG CATTAGCTTA TCAGACATCT CTGTAATAGC AGCGATGTGn AACTTAAAAT 1320 

CTGGTAAAGt TyCAACCAAT ACCTTGATGT TACGAatTGa TCCgAGTTAG T T AAT AT T AC 138 0 

AATTTCTTTA GTATATCTAT TACGACTACG ATAGTTATAT AGATATCCGC CTTGTAAAAT 1440 

ACGAGATTGA ACCTTTGCGT CTGCTATATT GAGCATCGTT TCATATTCGT TTTTATCTGG 1500 

AATAATAATA TTACAATGTC GTTTCATATC ACCTTTACAC ATCAATTGCA TATTTCCCGG 1560 

GACATTACCA TTACAGTGTT CTTGCCATAC C AAAAC AT C A CTACCTTTTG ATGGCAAATT 1620 

ATATAACACT GAAAATGGTA GGGCTAGTGA GTTAATAACG AAATGATGTT CCGSATTTC 168 0 

AAGTTGCTTG ATAAAAAATA ATGCGAATGC GAGCTTTGAA GGGAAAAAG T AAGACTTCCC 174 0 

TTGCCAATCC AAT AT G AC AT CAGATGTTAC AAAATTTTCA TAAATCACTT CTTTACCTTC 1800 
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TGCTGTCATA TATTTCTTCA AGATCGCTTT AC GAT T T AAA TCGTAAACAG TTTGTGCAAA 18 60 

TTTAATACCA TTCTTAGAAT AATAATCGAC AAATCGGACA CGTTGTTGGT CATCAAACCA 1920 

TTCGACACGA CTAACAATTC TAGGGCGCTC TCCACTTTGA yAAAATATTT TACCTCGyAG 1980 

ACGTCCCATA TCATTaATTG TAGCCGAATT GTTGTTACCT TTAATTTCCC AAAAAGCTGG 20 4 0 

TACAGTAACC T GAT T AAAAA ATCGTGGTTT CATATTTCT GTATTATGAT TATCTGCAAA 2100 

AAATTGATAC GGTGATATAA CATCGTCCGG TAAAAAGCCA TTGTCATTGA GTACAATTGT 2160 

TAAATCTTCT TCCAACTTAC TGGCTTTAAA AGACTCATAT AACTTTCGTG AATGATCGTT 22 2 0 

AAAGTAATCA AATAATTTAA TCATGTAGCA CCTCTTGaAC TAATGTTTCC CATTTTAAAA 228 0 

TAATATCTTG AGTCATAAAT TGCTGTGCCA CTTCATAAGA GATGTCATGT GGTGTCTGGG 23 4 0 

GACCATTGTT AAAAT AC AT T ACAATGGcAT GAGCTAGTTT TGCGATAACA TCATCCACAC 24 00 

TATCTTCGTC GGTATCAAAA GGTACCAAGT AGCCATTTTC CCCATCTCGA ATAAAGGTTG 24 60 

GGTTACCATA ATT C AC ATT T AATCCAATCA TACCTAGTCC TGAGCCTACC GCTTCCATTA 2520 

GTGTTAACCC AAAACCTTCG CTAGTTGATG CAGAAAGAAA TAACTCATAA TCATTATAAA 258 0 

TTTCATCAAG TTTAACATGC C C TAG T AAAC GAATATAATC TTGTGCGCGG TGTGTATCAA 2 64 0 

TAATTTTACG CAGTCGCGTC TTCTCGCTAC CTTCTCCATA AABTCAAAT GTTAATTCTG 2700 

GCACTTGTCG TTTAGCCACG ATAACCGCTT TGACAAGCCA ATCAATATGT TTCTCATTCG 27 60 

C T AAAC GAGA TGCACTAATC ATCGCATATG GCTTTCTTGA TAATGTTGGA TATGATAATG 2820 

CATCAATGCT TCCCACAGGA ATAGTATAAA CACGTGGGCG ATAACCTTGA TATTGCTCAA 28 80 

ATTGTCGACA AACCATATGA TTTTGAATAT CTGTTGCTGT AATAAAGAAA TCAATGTATT 2 94 0 

TAGCTTTTGA AAAT TGATAT TCATAATAAT TGTTCCATAG TATATGCTGC T C AC T CAT C A 3000 

TATTATTACT AT AAT GAT C A GCATGAATCA CAACACCAAC TTTACTATCA CCTTTATGCT 30 60 

GCAAAACAGC CTGACCAATA TCAG^AGCGC GGTCTAATAT GACAATATCG TCTCGGGTTA 312 0 

AATTCAATCG TTGTAAAAAG TATGCAATAA ATTCCGTTTT GTTATACAAC ACCGCATCTT 3180 

CAAACACATA TATAGAGCTG TCTCCATCAA TATATTCGTT ATAAGCGATG GAACCATCTT 32 4 0 

GATTATAAAA TTGTCGCATA TATAATTTCG CTTTATTATC AGCTGGTGCATAATACTCAG 330 0 

AAAATATGCG CGTATAACTA TAAAAATCTT TACGTACTAA CATACTATTA ATTACAAATT 33 60 

CTGCACGATC CACAATATCT TTTTGTTCAT TTTGCAGATA ACATGTTACA AAT GAT GAT T 34 20 

TCCCATTAAA ATATAGGCGG ACTATCTTAC CATTTCTTTC TCTAAAACTA ATGTCATGAC 34 80 

CAAGCTCACG TTCAATGTCA TCTAACGTGT ACGTTGTTGG TGCTAAAGAA ATATCACTAA 35 4 0 

AAT AC T GAT A CAACCAAATA ACTTCTTGAT CTTTAAACCC AATGTTTTGC GTTAATGTCT 3600 

GTATGTTCTC TGACTGTATA AAATCTAAAA AC AC AAAT T T AGTGTCTTGA TTTGTACGTC 3660 
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TCAATAATTT AGCACGGTAA GCTTGTGCAT ATCAACACC GCTACTCGCC CAGCCTATAC 3 72 0 

CAAAGTTTAT ATTATATATT GTCATGCGCT ACCCCTTTTC ATTTATGGAA AATGTATAAC 37 8 0 

TGGCATACCC TCTTTATCAA ATGTAATCAT GCTTTGACAA ATATTTTTCA CCATTCTTTT 3840 

TTTGATATTT CGTGTCATAA CTTCAAATGA ATCTAAGGCA ACTCTATGGT ATTCAAA&T 3900 

AGCGTTACGT TGCCCATTTT GTCGTTGATT AACGCTTGCT TTTAATTGTT GTAAATAGTC 3960 

GACTTGTTCT AACCAACATG AATCAATTGC TTTCAAAAAG ACTTTTTGAA C GAAAAT AT T 4 020 

ATAATAATAT GCACTTTGCA TGTTTTTACG ATTCAAAGCT AATTGCTTTT CAAATTGCTC 4080 

TAATAAAAAT GTC2CTACTG CTTGCTTATC TTTAAAATTA ACACAAGCCA CATCTTTATT 414 0 

AAAT T GG AAA CTTAAATTTT GATAAATATA CTCGACAACA CGCGATTTTG TTAGCACCTT 4200 

TTCCTCATTT ACAAACATTT C AAAT AC AT C TTTAGCTAAC GCTTTAAAAT CTTGATTCTC 4 2 60 

AGCATCATCT ATTTCTAAAA CTCGATTGCG TTCCTCGTAT ACAAGATCTC GCTGTATACT 4 320 

AATGCTTTTT TCAAATTCAT TAGCCATTTC ACGAGCTTTA ACCCCTTGTT CTTCCGAGAt 4380 

aCGcTGCGCT TTAACTACAA TTTGCTTAAC TTTGCGATTA AACAAATTAC TTTGCGATAA 44 40 

TCGTTGTGCA TCTAATGAAT ATAATTGATT ATTTTCCGCT AAAT T AC TAT CGCTCCATCG 300 

CTTAACTAAA TAATCATCTA GTGAAATATA TATACAAGAT GATCCCGGAT CCCCTTGTCT 4 5 60 

ACCAGAACGA CCACGTAATT GCCTGTCTAC ACGGCTATTT TCCATATGTT CATGAATAAT 4 620 

AACAGCTAAT CCACCTAATG CTTCGACACC TTCACCAAGT TTAATATCTG TGCCTCGACC 4 680 

TGCCATACTA GTCGCAACAG TCATGGAACC AATTTGCCCT GCTTCAGCTA TCATCTGCGC 474 0 

TTCTTTTGCA ACATTTTGCG CAATGAGTAA ATTATTAGGA ATATCCATTT GGAATAATAC 4 800 

TTTCGAAAAG TATTCAGCCG CTTCAGCAGT TCTCGTTATG AGTAAAACCG GTCGCCCCGT 4 8 60 

TTCATGAAGT TCAACTATAT CATGAATCAT CGCGATGTTT TTCTCACAA CTGAACGAAA 4 92 0 

CACTTTATCT GGTTCATCGA TACGTTGAAT CGCTTTATCA GTTGGTACTT GTACGACTAT 4 980 

TTTTGAATAC AAATCAAAGA ACTCTGATTC GCCTAATTTT CCTGTAGCTG TCATACCTGA 504 0 

AAATGATTCA AAAAGTTTAA ATAAATTCTG GAAGGTAATT GTTGCCATAA CACTTTTATC 5100 

TGTTGAAACC TCCATACCTT CTTTCGCTTC AATAGCTTGG TGAAGTCCAG CTTGCAACTT 5160 

AGTTCCCGGT AACATACGAC CTGTAATACG GTCAATTAAA ACAATATCAC CATTATATAC 5220 

AAAGTAATCG AC AT TAG AT T CAAACAAATA TTGTGCGCGC AGTGCTAAAT TAATATTACG 5280 

CACTAGGACC ATCGCTTGTT CGCTATATffi. ATCTTCAACA TTAAAGTATG ATTGTGCCGC 534 0 

TTCAATACCT TGATTTAACA GCCATATTTC TTTTTTGGTC TTCTTCATTT TAAAATGCAC 54 00 

GTCTTCAATC AATGTATCTA CAAACTCTTT CACAATATGA AATAGATTTG ATTGTAATCT 54 60 
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TGGTGCACCC GAAATAACTA ATGGTGTTTG AGCAGCATCT AAAATGATTG AATCACTTC 5520 

ATCAATAATA CCGTAATTTA ATTGTGGTAA AAATTTCCCT TCCGCACTAT CAGCCAAATT 558 0 

AT C AAT T AAA TAATCAAAAC CGAGACGTCC ATTAGTTGTA TATATAATAT CATGTTCATA 5 64 0 

TATATTACGT TTTTCCCCTT TTTGATACTC ATAATCCACA ATATCAACAA AACCTAATGA 57 0 0 

AGCAGTTAAA CCTAACCATT CATATAATGG TTGCATCTCT TCAAAATnCc gnTCGCTAAA 57 60 

TAATCATTCG TTGTAATTAA ATATGTTCCT TTTCCCGAAA GAGCATTTAA ATATAAAGGC 5 820 

ATCGTTGCCG TTAATGTTTT ACCTTCGCCT GTTTGCATCT CCGCAATGTT ACCTTCATGC 5880 

AATACAATCG CTCCGATTAA CTGAACTTCT TTAGGAACA TACCTAATAC TCTCCAGCTC 594 0 

GCTTCACGTG CCACTGCATA AGCTTCAGGT AACAATGTAT CTAGTGTATC AACTCCTGAT 6000 

GCTAAACGTT CTTTAAATTC TATTGTCTTT TGTTTTAACG CATCATCAGA ATATGATTTA 6060 

ACTTCATCGC TCCATGTATT GaTGsGTTcA CTATTTTTCT AATCGACTTT AGTCTTAATT 6120 

CGTTTATCGT AACATCTAGT TTATGTTTCA TTTACTTCCC C AC C AT T C AG TTTCGATACA 618 0 

TCTAAGTAAT CTAAAAATCG TACTGGATTC ATTAAACGTG ACATATAATT TAGATGTTTG 62 4 0 

TCTTGCTCTT CTTTAAAATA AACCTCGACA TTTGTATCTT TTAGTTCATG ATTTCCTGGG 6300 

ACATGTTCTG TAAGCCATCC TTTTAAATCA TCATCTTCAT GGCTTGTACG ATACACTTTG 63 60 

CAACCCAAAT GCTGAGCGAC ATAAGTTGCA AAAACATTTG ACTTTGACCC ATAACTAATC 64 20 

AAATTAATAG CCTTTAGGGT ATCTTGACTT TGCAAATCAT TCTTTAGTTG CTTAATATTT 6480 

CCCTCGATAT TGTCGTCCAT CCAACGTTCA AC GAG C C AAA CAEACCAAA CAGTTTCAAA 65 4 0 

AAATCATTCG AAATAGTTGG ATAGGTGTCA GATGGTTCTG CAATAATGAC ATT GAT CAT A 6600 

TCATTTCCAT ATTGGTCATC GCCTATCTTC GTCACCCGCA TGCTTTTATA CTCTAAATCA 6660 

TATTGATGCG TCATCTCTGT GATTGTTAAA CATCTAAATA TAAGACTCGT CGATGCTGCA 6720 

TTCATCATTT TTATTTTATA AGCATAGGCk TCATCAGGAT ATTGAATCGT AATACTATTT 6780 

GACTTTACAA TCTCAGTACT TAGTTTTGTG CCATTTTTAT TATAAAAAAT GATGATAAAA 68 4 0 

TACACTGAAC CAGCAGGCGT TGCATCAAAA TCAAAATGCA ATT TAT AAT G CTGTCCTCTA 6900 

CGCAAAATTG GkAAACTTGG CGcACTTTTA TATTTTGAAA ATTGCTTTAA CATCAACCAC 6 9 60 

TCATGAATCG GTAATCCAGA GGGCATCAAA GGATTTATAA AAGTCACTTC ACCATTTGAA 702 0 

AATGATACTT TAGAGCCATA CATAAATGTA GTTTGTGAAA TATAATTCCA AGTAACTTTA 7080 

AATGTTTTGT TTTTCAGCAT GTTGAACTCT CCCAAACTTG TCTTCCAAAATAATGTTGTA 714 0 

AAAATTAACA AACCAACTTG CAATGGTAGG TGAATCATCA TTATGTCGCC CAGGAATACT 7200 

GCGATTCATC ACTCTTGCTT GGTGTGCTGT CAATACAGGT AATAGCTCTT GAAATGCATG 7260 

TGGATCATAA TCATCATGTT GCATATATGC TATGGCAAAA ACAGTTTGTG ACAATGATTy 7320 
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CTTTTGAAAT GTTTGCCAAA ATTTTTGATT TAATGCCTGT ATCGACGCTT GAGATGTATC 7 38 0 

ACCTTCATTA GACACCAGGA CGTCTAATGC TGTACCGAAC TCTTCTGGTC TAAGTAAT CG 74 4 0 

CATATGTTCA GCAATCGTTC CAATATTAAC AAGTGGTTTA CCAACAATAA TTGCCTGAGG 7 50 0 

ATTTAACTGA GCACCATAAT ACAAGGCACC AATGAACCC ATAGATAAAC CAGATAATAT 7 5 60 

TAATTCATGT GATTTAAAAT TCAGCTTTTC TAATGTCTCG TCAATAACAT TGATAATACC 7 620 

TTGTTCATAT T C AG AT G AAC CGATATAAAA ACTACCACCT TCAACACGAG GATCGCCGAT 7 68 0 

AAGTAAAAAC GGTGCATTCA TACGTTTCAT CATATAATAT CCTTCGAAAC CTTCCGQ3T 77 4 0 

T C GAT AAC C A CTAAAATATA CGTTTAGTGG CGGTTTCATA TCACCAGGGT GGAAATAATA 7 800 

AATAAATTCC TGTCGTTGAC TATCTACGAA ACGACTACCA CCAAGTAAAA ATTGACCCAT 7 8 60 

GTCTAATCTA GACCATCGTT TGTGTATAGG TCCTAAATGT ACCGTCCCGT TCCCACGCGC 7 92 0 

CTTAACAGTT ACACTTATAT AAGCATCAAA TGGTTTCGCA GGTATCTCTA AAGGACTGTC 7 98 0 

TAACATATCA TCAGTCAATA CGATTTGTTC AATTAATGCA CCATCAGCGC CAGTCTGAAT 8 04 0 

CAATCTAAAT GTATATTGCA ACTCGACCGC ACCATCAATA TCAAATTCTG GCCATATTTG 8100 

AATGACTTTA TCTTTATCGT AAACGAGATT ATTTTGCCAAGATGCGATAG GTTTAAATTC 8160 

TTTCCCAAAT TCTCCACTCA ATGTGAGCTC TGAATTACCT TGGTAAACGA CATCTCCTTT 8220 

AAAATTCGGA TGCACAAGTG CTAACTTAGG AGAAACCTTA TCTCCATACT GTCCTGAGAA 828 0 

GCTAACTGCC TCTAATTTAT TATTACGTTC TTCAATATTC CGGTAATGTA ATGGTTGAAC 34 0 

AACGTATTTT TGGACATTTT CGTCTTGTTC ATATTCAACT GACCAAAATG ATTCATCAAC 8 4 00 

ATACGTATTG TATGGTTCGC TTATCATTTG TAATAAATTC GTTAATGTCT CCGAGTATGG 8 4 60 

TGCTTGAATA TAGATAAAAT CAAAGCGCCC TTCTGCTTCA ACAATCGCTT CAATAGCCTC 8 52 0 

TACATAACCA CTATCAAATT G\AACAATCC AATATCGAAG TAATCCCAAC TCACACCTTT 8 58 0 

TTTGTGTTGA AAAATAGGTT CTAAATCGTC TCCTCCAATT TGCAAAACTC TAAATTTACG 8 64 0 

TGGCATCATT TTCACCTTCT ATTAACTCAT CGAGCTGATT AATAATATTC TTAGAAGCAT 8 7 00 

ATGCATCTAT TAATTTTAAA GAATAGGCGT ACGCATAATT CCAATTTTC AAATAAAATA 87 60 

AATAATAATT TAACGCATCA TCTAATTCAT CAACTGTATT TATAATACGG CCATTGTCAT 8 82 0 

AAT C AG AG AC GTAATCTGTT TGTTGACCAT TAATTTGTGG AATCCCAGCG CTAATTGCAC 8 8 80 

TAATTTGTAA ATACAAGTCA GGTTCTTTTG ACATATCTAT CACAAGTCGC AACGTCCGCA 8 94 0 

ATGCTTCTAC AACATCATGT TCAGCATGTA TCGTCTTAAC AGCAATGATG TCATCTTGAT 900 0 

CTTCAGGTGT CATTAATGCT GAAACATTAA CATCCGCATT CTGTTTAGCT TGGTATTCCT 9 0 60 

CATTTACCGA CGTAATACAT TCACGAAGCC ACATCGGTAT GTCATTTTGA TGGCGCGATA 9120 
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ATAAAATTAA ACGGTAATAA TCTTCCTGTS CGATATAATC CACAAGTCGT TGCATCATTT 9180 

GTTGCAAATC AGCGTCACTC ATACCATCTA TCCATACACC TATAAATGTT TCCATCAATT 924 0 

GACTACTTAT ATTAGGTGAT TGTCTCGTTT CAAATGGTGT GATTCGAATC ATTGTATTCT 9300 

CCCGCTGATA CTTCTCTTGA TGACTAATTA AATCACGTTC TAGTTGCACA CTAOAACAA 9360 

TTAAATGGGC ATTCTTTACG ATAGATTGAT ATTCCTCATC TGACACAGTT TCATTTCTAT 94 20 

TTTTAAAAAA TGAATAACTT AATGATTTCG CTGGAATATG ATTGGCTATT TGTCGATTGT 9480 

GCCTAGCATC TGAAGCCACA ATCACATGAT CATCTTCATG TATTTGTTGT GCAATCATTG 954 0 

CTTGAAATTT TTCTTCAATT AGTTGAGCCA TATTGTTATA TTCTGTTTGT T GAT AG T GAT 9 60 0 

GTTGATATCT TTTTGAAACA GTGACTCTGC CATTTTTCAA ATCTTCATGA AGTACACAAT 9 6 60 

CTCCATTAAT CGTTAAATAT TCTTGGTAAG AAGCCTCTCC CTGATCATCA AAATAACGTA 9720 

TCGCTGATAA ATAACCTCTG TCATCAAAAA TATAAGCCG TTGTAACTGA TCTCTTTCAA 97 8 0 

ATTCTTCAAA CCAAATTGAA TACCCTTCTT GACTAAAATA AATATTTGTA TAGGTCTGTT 98 4 0 

CACTCGTCAC ACATTTTAAT AAATACGGTG TGTACACAAA CTCAACATCA TCCGGCCATT 9900 

TTAAGTGATG ATAATTAATC GCTTGTGGCG CATGGTGACT GAATCCTTGA ATTTCATCAA 9960 

ACACAGACGA ATACTTTGTC TCATATAAGT CATATCGATG TAAAAATGTT CTTAAATTTG 10020 

GTGCATGATT GAGAACAATC AGTTGATAAT CTAAGTCATT TTCAAGGTGC ATTCCCATTA 1008 0 

AACTAATCAT ATCGTCAAAT TCCGTCTTAT TTTGTAGTTG ATAATACGGC ACAGTCGTGT 1014 0 

CTTGCCACCA TCGTTGGTCA TCGTACCAAG CTGGAATAAA GTATTTCATA ATTACCTCCT 10200 

TACCAATACT GGTTTAAAAA TGGCTTATAT TTATCAAAAT ATAAATATGT ACGAATTGTT 102 60 

TCTGCAATAT TAATACTGAT GTAAACTAAT ACAATCAGTT G TACT GAG a A ATAAATTTCA 10320 

G TAG AT AAA T GCGGTACAAA CAATGTGAAA TAAAGCGGTA TACAATAAT GACTGTAACT 1038 0 

AATGCCAATC CAAACCAACA TACGCGTCGT GCTTGATAAT TTAAATAACG TTCTGTATCC 104 4 0 

TTACCAGGTT TaACTCCTGA AAAATAATTG CCACTCTTTA AGAAATCTTT GGATTTTTGT 10500 

TTAGTATTGA TTAAAAATCT CGATAAAAAA TAACCCAATA ACATTTGAAT CACTAAATAT 105 60 

ACTGAAATAC CTACTGGACT ATCAAATGTC AGCAT TGGCA TGTCATCTGA TATGCTTTTA 10 620 

T T AAAC AT AG ATAAAATAAA ATGAATGCCA CTTTTTAAGA AAACAAAAGC TGAAATACTC 10680 

ATCATTAAAG TAATACTGCC TGCAGGGTTA ACTTTCCAAG ATAAATAAGA TTTCATATTT 10740 

GTTGCGGAAA CGTTCATTAA ATCGATATAT GGTATTCTCA CTTCTACTAA TTCAATAAAT 10 8 00 

AATAAGATAA ACAATGTGAT TATCACAAGG ATGATTAACA ACGCAATCAC AATATGACTT 108 60 

GCATCTATAT ATTCCATTTT TTGATGCATC ATTGATTTAA TAATACTAAC CATTACAATC 10920 

GGCATTGGTC CTGCGATGCC GTAGCGACTA TTTTTGTCAG CTAACCAAACTAATAACATC 10980 
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GTTCCAGTAA CCAAAATCAA TATTGTTAAG TAAATATTGT CTTGATGAAC ACGTTCTTTC 1104 0 

G AAAC AT AT T CATGAATCAC AAAATAACTT TGAATAACAC TTAAAATTAA TGTTAAGATG 11100 

CGCTCTTTAT AATGCTTCTC AAGACTTGTT TGTTTCATGT ATTTATCCAT ATTCCGATAA 11160 

GAAATCAGCA TCAAGATAAT CATTGATGTT AACCACGGAC CTAAtCCTAA AGTGAAAATG 11220 

TTTAAAGTAT TAACGTCTCC ACCCATATTA GAAATAGCTA TTTTAAAAAA TGACTCATGT 11280 

TTTACTTGCA TATCGTTaTA GGAAACGATG GAAATGTTTG TGCCTAATAT ATAAATAaAC 11340 

AAGATAAAAC ATGTGTATAG CATACGTTTA TiTATAATTT TATATTCGTA TTGTTGTAAA 114 00 

AGTTTTAACA TGTTGCACCT CTTTTATATC AAAAAC AT T A AAAAGACTAA GGGTTCATCA 114 60 

CTAATTATTA AAATCCTATA TCGATTTTTC TAGTGATTGG TGCCTCAGTC TTTTTAATTT 11520 

TAGCCAGCTA TAAATTCAAT TTATGCTTGA GAATCATCTT GATCATTTTC ATCTTTCIT 1158 0 

TTCTTTCTCT TCATTAAACC T AAAC C AAC T AATAATGTCA TAACGCCACC TAGTAATCCA 11640 

TTTTGTTTTA TTGAGTCACC TGTATCTGGC AATCTTTTTT CACTTTGTGC TGGTGTGCCA 11700 

TTATGTTTAG T C AC T T C AG A TGTTGCACTT AATGTAGACT GAGATTCACT CGTGCTCGTT 11760 

GTTGCTTCAC TTGATAAGCG AGATGTGCTC GTGCTGTGAG TATGATGCAT ACTCATTGAG 11820 

TCTGACGGAT GCATTGAGTT AGATTCAGAT GTACTTGTTG AGC CGGACAT ACTTGTTGAT 1188 0 

GTTGAGTCAG AAATGCTTTG T GAAC C AG AC ATAGATGTAC TCAGTGATTC GGATGTGCTT 11940 

GTCGAATCGG ATGTGCTCAA TGACGTTGAT GTGCTTGTTG ACACTGATTC TGAGTCACTA 12000 

ATTGATGTTG AGTCGGATTT GTCTTGTGAC ATTGAAACAC TCGATGAATT AGATTCACTC 12060 

ATTGATGTTG AGTCAGATAC GCTCGTTGAA CCTGAACCAG ACGTACTTAA TGATTCAGAT 12120 

ATGCTTGTTG AAGTTGAACC ACTTGTTGAG TCCGATGTAC TTGTCGATGT CGAGTCTGAA 11280 

TCTGATGTAC TCAATGATTC TGAGTCACTG ATAGAAGTTG AATCACTTGT AGATTCTGAT 1224 0 

TCTACTGTAC TTTGTGAACC ACTGATACTT ATTGAAGTAG AATCACTGAT ACTGTCTGAT 12 300 

GTTGATAATG ATGTCGACAC CGATGTGCTT TGTGATGACG ATGTAC TAGC AC T CAT T G AC 12360 

ATTGATGTTG ATATCGATGT ACTTAAGGAA CCAGATGCAC TTGTACTTGT TGACTGGCTT 124 20 

TGTGACATTG AATCACTTAA T GAT G TAG AT GTGCTTGTTG AGCTCGAGTC ACTTACACTT 124 80 

GTTGAACCTG AT AT T GAG T C ACTTAAACTT GTCGATGTTG AAAC T G At wC GcTTCCGCTC 12540 

ATTGAGTCAG ATGTTGAAAG TGATGTACTC GTTGAATTTG ATCCACGAT GCTAGACGAA 12600 

TCACTTGTAG ACATTGAGTC GCTTTCTGAT GCACTGATGC TCATAGAGTC AAATTGACTA 12 660 

TTACTTGTTG AGCTTGACTG CGAATCGCTC ACACTTGTTG ACGTTGATTC TGATCCACTC 12720 

ATACTTTGCG AGCTACTCAA TGATTTTGAA TCACTTAATG AATCCGAAGT GCTAAGACTT 127 8 0 
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GTGGAACCAC TTAAAGATAT TGATCCACTT AATGAGTCGG AGTCACTTGT ACTAGTAGAA 



12840 



TCACTCATTG ATATTGAATC ACTTAGcGAG GTAGACTyGc tTACGCTTTC TGAACCACTT 



12900 



AATGATGTTG AGGTACTCAA T G AAC C AG AT GTACTTGTTG AAGTCGAACC ACTTGTTGAT 



12960 



TTTGAATCAC TTAATGAATC AGATTCACX ACGCTTTCTG AACTTCTTAG TGACGTCGAT 



13020 



ACACTTAATG ATGACGAATC GCTTGTGCTT ACTGAATCG 



13059 



(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10758 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

AGGGATGGCC TTACCTAAAA AACCGGGnAA ACCCTCCAAA ACCCATTAAA AGGnTGGnTA 60 

CCCTTTAAAA TGGTAGCATT TAACCGCCAC CCGCCAAGGT GGGTGGTTffi TTCTTCCGTT 120 

ATTTAAATTA GTACACCATG CAGATTCTGT AGTTGAGGGA TATTTTAACG AAAGCTTATT 180 

AGCAACTGAT AAAAAAATAC GTCCTAAGGC ATATATTGCT TCATGGAAGG ACATCGAGCC 240 

GGCTAAGAAA ATAGAATTTA AAATTAAAAA AGGTATTAAA TGGCATGATG GTAATGAATT 300 

GAAAATTGAT GATTGGATTT ATTCAATTGA AGTCTTAGCT AACAAGGACT ACGAAGGTGC 3 60 

TTATTATCCA AG T G TAG AAA ATATCCAAGG TGCGAAAGAT TATCATGAAG GAAAAACTGA 4 20 

TCATATTAGC GGATTGAAGA AAAT AG AT G A CTACACTATG CAGGTTACAT TTGATAAAAA 4 80 

ACAAGAAAAT TACTTAACAG GATTTATTAC TGGACCTTTA TTAAGTAAAA AATATTTATC 540 

AG AT G T AC C A ATTAAAGATT TAGCGAAATC AGATAAAATC CGAAAAT AT C CTATTGGTAT 600 

TGGACCGTAT AAAGTTAAGA AAATCGTTCC AGGTGAGGCT GTTCAACTCG TTAAATTTGA 6 60 

TGATTATTGG CAAGGTAAGC CTGCACTAGA CAAAATCAAT TTAAAAGTTA TTGATSAGC 720 

GCAAATTATT AAGGCAATGG AAAAAGGCGA TATTGATGTT GCGAATGATG CTACCGGTGC 7 80 

AATGGCAAAA GATGCTAAGT CATCTAATGC TGGTCTCAAG GTATTATCTG CGCCAAGCTT 8 40 

AGACTACGGT TTAATAGGtT CGTATCTCAT GATTACGATA AAAAAGCTAA TAAAACTGGT 900 

AAAGTGAGAC CAAAATATGA AGACAAAGAA TTACGTAAAG CAATGCTTTA TGCAATTGAT 960 

AGAGAAaAAT GGATCAAAGC GTTTTTCAAT GGTTACGCTA GTGAAATCaA TAGTTTTGTA 1020 

CCATCTATGC ATTGGATAGC AGCCAATCCT AAGGACCTAA ATGATTACAA ATATGATCCT 1080 

GAAAAAGCTA AAAAAATCTT AGATAAGTTA GGTTATAAG ATAGAGATGG TGACGGATTT 114 0 

AGAGAAGATC CTAAAGGTAA T AAAT T T GAG ATTAACTTTA AACATAATTC AGGTTCTAAT 12 00 
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CCTACTTTTG AACCAAGAAC TGCTGCGATA AAAGATTTCT GGGAAAAAGT TGGCTTGAAA 12 60 

ACAAATGTGA AGTTAGTAGA ATTCGGTAAA TATAAtGAAG AC T TAG C AAA ACATGCATCT 132 0 
AATACGATTC CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC 1380 
GACAGACCTT TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA 14 4 0 

GAAACAGAGC GAGACACTGT ATTTACAACA GAATATTCGG TTCGTACTGC CATGGAAGCT 1500 

GTTTATCAAT TACTAAATAT AGATCGTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT 1560 
CTTCGCGTCT TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT 1620 
ACTAAAGATT CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 168 0 

ACGTACATTG AGTCATTATT AAAAGAACAC AAATTGTTAT AACGAAACC ATTAATAGAT 174 0 
TTTTATTTGG TGATTTCAAA T CAT GAG ACT GGGACAGAAA TGATGTTTTC ATAAAAATTA 18 00 

TTTCGTTGTT CCACTCTCAT GATTTTTTTG AT G AAAC AT A ATTACATGAT TGATTGCATC 1860 
ATTTTGTTAA ACAAGTGATT GCAAACCTGC CATTTCACAC TGAAAATTTA CATAATAAGT 192 0 

GACGATATTT TACAAGTCAT ATACAAATAA CATATATTGT TAAATAATTT TACCTAATCT 1980 

T AACAT T AAA TTTACAATTA TAAGCGATAA TCTAAATATA AAGCTTATTT GAGGTGAAAT 204 0 

AATGGAAATG TCGGTTACAG AAGTCATTTT CTCCTTTTTA GGTGGTTTAG GTATTTTCCT 2100 

TTACGGCTTA AAAATCATGG GAGACGCECT TCAAGCATCA GCAGGAGACA GGCTACGAGA 2160 

TATTTTAAAC AAATTTACAT CAAATCCAGT ATTAGGTGTT ATTGCAGGTA TCGTTGTAAC 222 0 

TATTTTAATA CAAAGTAGTT CAGGTACGAC AG T TAT C AC A ATCGGACTGG TAACAGCTGG 22 80 

ATTTATGACA TTGAAACAAG CCATTGGAGT GATAATGGGT GCTAATATCG GACAACGGT 234 0 

AACTGCATTT ATTATCGGTA TAGATTTAGG CGAATATGCA ATGCCAATTT TAGCATTAGG 2 4 00 

TGCATTCTTA ATCTTTTTCT TTAAACGCTC TAAAATCAAT AACATTGGCC GCATAC TATT 24 60 

CGGTTTCGGT TCACTATTCT TCGGTCTAGA ATTTATGGGT GATGCCGTTA AACCTTTAGC 2520 

AT CAT TAG AT GGATTTAAGC AATTAATGCT TGATATGTCT AC AAA T C C AA TACTCGCTGT 2580 

CATTGTCGGC GCAGGGTTAA CAGCACTAGT TCAAAGTTCA AGTGCGACGA TTGGTATTTT 2 64 0 

ACAAGAATTT TATCAACAAG ATTTAATTAG CTTAAACGCA GCAATCCCTG TGTTACTAGG 2 7 00 

CGATAACATT GGTACCACGA TTACAGCTAT CTTiGCTAGT TTAGCCGGCT CAATCGCTGC 2 7 60 

AAAACGTGCG GCGCTTGTAC ACGTCATCTT TAACTTAATC GGGGTAATTA TCTTCACAAT 282 0 

TTTCTTGCCA GTTGTGATTC ATTTGATTAG TTTGTTACAA GATTTATGGC ACTTAAAACC 2880 

AGCGATGACG ATT GC AG TAT CACATGGTAT CTTCAACATA ACAAATACTT TGATTCAATT 2 94 0 

ACCATTTGTA GCAGGTTTAG CATGGATTGT TACAAAGCTT GTCCCAGGTA AAGATATTGC 3000 
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T GAT G AC TAT AAACCTCAGC ACTTAAACAA AGATCTTGTT TATCACGCAC CTGGTGTTGC 30 60 

AT T ACAAGAA AC T C AAAAAG AATTACAAAA TGTGGGTCAG ATTGTCTTAT CAATGTTTGA 3120 

AGACATTCGC GAAATTACAA AAGACGATAA AAAATTGATC AAAAAGCTTG AACAAAAGCA 318 0 
TCAAGCTGTT GAAACAATCA ATGATAGCAT TCGAAATTAT TTAGTTAGAA TTTCTACAAA 32 4 0 

AGCCATTACG AAGGCAGACG TTGAGCGTTT AGCAGTTATG TTTGATGTCA ATCGCTCTAT 3300 

TTTAAAAGTA GCAGAGCTAA C AG AAG AG T A TGTCGCTCAA TAAAACGCC AAC AT GAT G A 33 60 
AGATATTCGC AT T AC AGAAG ATGCACAACG CGGTATGGAT AAATTATTCA ACCATGTTGC 34 20 

TGAGTCATTT GATAAAGCCA TCGACATGTT AGATGTTTAT G AC AAAAC G A AAAAAGATGA 34 8 0 

AATTGTAGAA C G TAG TAG AG AATCATTTAA TATTGAACAT AAACTACGCA AAGGTCATAT 3503 
TAAACGCCTT AATCGTGGTG AATGTACAAC AAAAGGCGGA TTACTATATA TCGATATGAT 3 600 

TGGTGTTCTT GAACGTATCG GTTATCATTC ACGAAATGTT TCTGAAGCAC TTGTTGGCCT 3660 
TAACGATGAT GTACCTACAG AT G AAG AAA T TGCAACAACT GAAATTTAAT TTTTACTGTC 37 20 

TTATTTATAT TCATATTTTT TT2AAATTAG AGATTCAGAT GCATGTAAAA AGCCAATCCA 3780 
ACATTCATGG GTTGGCTTTT TTGTTTAGCA AAATTTATTA TCTTAAATCG GCTATAAACA 38 4 0 

CTGATATAAT AATGCTTCAT TAGTATGCGG TAAGCATGAC GGACACTGTT CTCGGAGTCT 3 900 

GACCCCGAAA CGTTTAATAT ACACTTTTAC ACGTCGCCTT CATTGAAGG AATTGCCATA 3960 

ACCTTCACAT TATATATAGT TCTTTCCATA TAAATGTCCA AATTTTTAGA ACAACGCAAT 4 02 0 

AAATAACCAT CCACCTAACT TATCAAAAAT TTAAGTGGAT GGTTTTTCAT TTTCATTTAT 4 080 

ATTTATATTA GTGTTAATCC AATCATAGAT TTATCTATAT GCACTGCTCT ATACATTTCC 4140 

TCATTTAATT TGCTTTACTT TCATTTATAT CAT TAT C AAA ACACTTGGCG TGTCATCGTT 4 2 00 

ATTATTTCGC ATCTTTGACA CGTTTATCAT CATTAGGAAT CGCGAATAAA ATTGCGATAA 4 2 60 

ATGCCATGAT TCCCATTAAT ACGTTAACCC AAAGTGCAAT CATCGCACCT GTATGAATGC 4320 

TCGTTGCAGC AACTGCACCA GCATATACAG CACCACTAAT TGCGACACCG AATGCGCCAC 4380 

C AAG T GAT G A AGCCATTTTA TAAATACCTG AAGCAACGCC AACTTTATCT AACGGTGCAT 44 40 

TCGAAATAGC TGTATCTGTA GAAGGTGTTG CATAAATACC TAAGCCTAGT CCGAAACATA 4 500 

AATATCCTAC GACACAACTG ATAACATAAA ATATGCCTGG T AAGAAT AC T AATGA&TAA 4 5 60 

GTGCAATACC AAT G AC C AC A ATGAATGTAC CTAATAACAT TGGTCGCTTA GAACCCATTT 4 620 

TTTGTAATAA TTTTTCACCA AC T C G AAT C A TCAATAACAC CATGATTAAA TAAGTAATTG 4 680 

ATAAGTATCC TGCCTGCAAT G C T G TAT AAC CTAAACCTTG TTGCACGAAT GTATTCGCTA 47 40 

CAATTAATGT ACCTGCAAAA CCGTTTAATA AGAAGTTCGA AATCGTTGCA CCTGTATATG 4 8 00 

GTTTATTTTC AAATAATTTA AAATCAATAA GTGGATTATC TACTTTTTTC T C AAC AT T T A 4 8 60 
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AGAAAATAAA GAATGCTACA ATTACGATTG CAATCAAACC AAAGAAC CAT AATGATGTGT 4 92 0 

AACCAAGTGC TGCACCTTTA GTAATGACAA CGTTTAA&T TAGCAACATA ACTACTAGAA 4 980 

CAATTAGCCC TGCAACGTCA AATTTATGTG TATTGGTAAT TTCTGATTTC GTTTCAGGCG 504 0 

TCCCTTTGAT GAGTAACATT GAAAGTACGG CAACGATAAT TGAGAAGATG AAAAT C CAT C 5100 

TCCAACCCAT AGTTGTCGCA ACTGCACCAC CGAAGAGTGA ACAGATACCA CTGCCACCCC 5160 

AAGAACCGAT AGACCAATAA CTTAAGGCAC GCTGACGTTC AGCACCCTGA TAATAAGTTT - 5220 

TCATAATGGC CAATGTAGAA GGCATAATAC ACGCTGCTGA TACACCTTGT ATAACACGAC 52 80 

CTAAAATTAA TAATGCCGGT AAATTCGTAA TAATAATTAA TGCTGAACCA ATAATACTTA 53 4 0 

ATAATAAACC GATATTCGTC ATTTTCACGC GCCCAATTTT ATCTGCCAGA CCACCTGCTC 5400 

CAACAACAAA CATGCCTGAA AATAGTGCAG TTAGACTGAC CGCAATACTA ATTGTCCCCA 54 60 

TGTCTGTACC AAAACTTTGT TGTAAATTCG GTACAACATT TACAAGTGAT TGTGCAAACA 5520 

ACCAAAATGT AATAACACCT AATACAATAC CTAAGATTAA CTTGTGCCC CCGCGATACG 558 0 

TTTCATTCAT GTTAGTTATC TCCTTTAAGG TAATCTAAAA CAACTGTCCC TACTGCTTCT 5640 

GCAGAAATAA GTAATGATTT TTCTGAAATG TTAAATTTAG GAT GAT GAT G TGGGTAAATT 57 00 

TCACCATTTT CCACCGCTGC ACCTGTATAA ATAAAGGCAC TTGGGCGTTC TTTAGCATAA 57 60 

TATGCAAAGT CTTCTGAAGG TGGTTGTGGT TCACACATTT C AAC AC C AAA ATCAAGGTTT 58 2 0 

GCTTCTTTCA ACGTCTTAGC CACGTACTCA GTAAACTCTG GATCATTATA TAATGCTGGA 58 8 0 

TAATCATCGT TATATTCTAA GGTGCAAGTt ACACCATACA TATCCTCTAA TCCTTTTGAT 5 94 0 

AAACGTTTAA TTTCTTTTTC AATTGT1SCT TTTGTAGCAT CTGTTAATCC ACGTACATCA 6000 

CCTTCAATTT CAACAACATC TTTAATGACA TTGAATTGAC CTTTACCGTC AAATGAACCG 60 60 

ATTGTGACAA CACCGGTTTC AAATGGACTT AGTCGTCTAG ATACAACTGT TTGTAACGCT 6120 

GTGACGAAGT AGCTACCTGC AACAATGGCA TCATTGGCCA TATGTGGTGA TSACCATGA 618 0 

CCACCTTTAC CTTGAACTTT CAATTTGAAG AATGCGCGTC CTGTTTGAAC ATAACCAGGT 62 4 0 

CTGTAATACA CTTTACCTGT TTTCATTGTG CTCATGACGT GTACACCTAA T AC AT GAT C A 63 00 

ACACCGTCTA ATACACCATT TTCAATCATT GTTTTAGCAC CACCTGGTGG TACTTCTTCA 63 60 

GCTGGTTGAT GTATCACAAC GACTTTTCCT GTAAAACTAT CTTTCATTTC AGCAAGCGTC 64 20 

TCTGCTAATA CAAGCATGTA TGCTGTATGT GCATCGTGAC CACATGCGTG CATAACACCT 64 8 0 

TTATTTTGTG ATGCAAAAGA TAATCCTGTA TCTTCAGTAA TGGGTAATGC GTCAAAGTCT 65 4 0 

GCACGGATTG CTAATGTTTT ACCAGGTTTC CCTtAATCAA TCGTTACTTT AATTCCACGT 6600 

GGTCCGACAT TCGTTTCTAC TTCCACATCT TTACCTTTGT AAAATTCAGC GATGTATTTC 6 6 60 
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GCCGTTTCAT CTTCATGAAA AGATAATTCT GGATGCTGAT GTAAATAACG TCTGATCTCA 67 20 

ATCATTTTGC CTTCTTTAGA TTTTAAAGTT TCAATTAATT GTTGATTCAT ATCCTTCATC 6780 

TCCTTAGTTA CATCATAAAT GATTAATCAT TATTTATATT GCCAACAACA GAGATGTTAA 68 4 0 

CCATTAATTT TTTGCAATTT TAGCTTTGAA TATAAAAAAT CACAAATTAT GTATATCAAA 6900 

ATTTGTGATT TGTGATCATT TTATGAACTT GGGTAACGTT TTACTTCAAT TAAGTGAATC 69 60 

CCATTCGTAA TCATTTTAAT GTTTAATGCC AGTGTGTCCG TGATATCTAT ATCATATACT 7 020 

TCTAATTTCG GAAAACTCAT TCGATTAACG TAATCTATAG AGTCCTTGTC CATGCCATGT 7 080 

ATCGTATGAT GTTTGCGCCA AAGATTAAAT AACGCACCAT TTTCTTTATC TAAGGTAAAA 7140 

TGTTTAATCT TATACATACC TTCTTCCAGG GCATTAATGT CAAATGAAT CATTTCCGTC 7200 

GCACGCATAT TCATTTGATT GTCCAACGCT AAGTACGGAT TAAAATGCTT TGCATCATAT 7 2 60 

AACAATATTT GAAAATTTGA ATCAGTCCCC GTGACAATAC ATGTATCATC AGAATACAAA 7320 

ATATTGCTTG TTAATTTATT AAATAGCAAT GCCGTGAAAT AGACCGGACG TTTTCCATTA 7 3C8 

TATTGATGAA ATAGTTCAAT AGAATTCATA TAATCCCGTT CATTTTTACA ATGACTGACG 74 4 0 

TGCAAATCAT AATTCAACCA ATACCCGATA CCCTCTACTT TAGAACTTAA TTTTAATAAT 7 5 00 

TGCTCAATGA TGATACCACC TCTAAAATAT TCGCCGTTTG TAATAAATGT ATCACCCGTC 7 5 60 

AATGTATTCC AATTGAGTAA AATGAGTGGA CGCTTTAGGC GATGACGATG CATTAAGTCG 7 620 

ATAAGGTAAT TCGTTTTATT AATAATCATT TGACTCGCGG TTTTAAATTC ATCATCATTC 7680 

ATTTTATTAA AATCAACAGC GTCATTTGAA TTGGCATCAA AT AC AAAAT G GTCGATGTGT 77 4 0 

GGCTCAAGTC GTTTCAATAA TGGTAGATGT CTTTCCGTAG CTTGATCTA GTGAATGTAC 7 80 0 

AAGCCACCAT TAGGGAATAA TGCTTTAAAA TAATCAATCA TTTCAATCAA AGACGTGTGC 7 8 60 

AATGTCGTCA CATACAAGTT GAACTTCAAA TCTTTTCTAT G AC T G AC AT G CAGGGCAACG 7 920 

TGATGGATAA AAATTTTAAA TGCATCGATA TAATCACGTG AG T CAT AC T G ATCCAAATGC 7980 

ATGGTCAAAC TAAAGTTATG ATCTAATAAA AAGTCTAAAC ACAAATCAAT ATCAT AAAAT 8 04 0 

AT AT T C G AAA TTTCTGCATC ATACGTGAAT GGCGCATTGA GCTTTTTCAT GATATATGGA 8100 

ATCACATCAT ATGCTAATAC TTCATTGACT T G AAAAT CAT GATGACATGT AAGCAACTGT 8160 

GATTGATACT GTGTATTGAG CAAATTCCTC AAATAGCCCA CTTGAATAAT ATGATTAAAT 8220 

TGATTTAGTT GGTGATTGGT TGGTTGAAAG GCAATCTCTT TATAGTTCAT CTTTTCAATA 8280 

TCTTCAATAA AAT GAT T CAT TTCTTCAATG TAGTCATTTA AAAGTAATAT CAATTCACGG 834 0 

TCGTGATAAT CATGTTGTGC CGATTGCTGG TTTTCAGTGA TTGCTGGACG ATCACTCGA 8 4 00 

TATTGTTTAG GTGTTTGATG CGTAAAGTGT TTAAATGTTC TCGCAAAGCT CGCTGCACTT 8 4 60 

GAAAAGCCAT TTTGCAATGC AATATCAGTA ATGGTTTCAT CTGTGTATGT CAAATCGAGT 8 520 
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TTCGCATGCT CAATTCGCGT CGTATTTAAG AAATGATGGA ATCCTACACC TAGCGATTCT 8 580 

GTAAACTTTT TAGACAGATG GCTCTCTGAC CACCCAACGT ATTCGCTTAA TTCTGAAAGG 8 64 0 

CTTAAATCTT CAT G AAAAT G TAACTCGATA TAGTCGCATA CTTGATTCAC TTTATCATCA 8 7 00 

TTTAAGATAC TTTGGTTCGA ATGATATGTA cGCGGGACAT AATGAATCAT ATGCATAAGC 8 7 60 

AACTGAATCA CAAGTTGTTG CTCAGTCAAT TTAGACAET CATTATGTCG GATATGTGTT 8820 

GAAACCAGTC TTGCCATTAT ATTTCTCAGT TGATGTATAT TCTTTGTTGT GGTCGCATCT 8 8 80 

GTTAAGTGAA AATATAGACA ATGCACATCA TCAAACTTGT CTGCTAAATA TTTCATTTGG 8 94 0 

AATTGGATAT AACATATGAT GCCATCTTGT TGAAGTTGAA AT C GAT AC AA GTCGCGGTGG 9000 

TTAATGATGA AAATGTCGCC ACTGTTGCAT TGCGTCATAT TATTTTCATC ATAAATGTGT 9 0 60 

GCCTCnCCTT TAATAACAAA ACCAATCATT AAACTATTGA GCCTTTTGAA AT C T GAC AT A 9120 

CTCTCAGTTT CTACTCGAAT TAAATAATCA CGTTGCATAC TATCCCTCAA TTCAGTAATA 9180 

TGAATACGTT TATTTTACAT TATTTTACAG CAACATATTT GAATTTCATA TTGAATCGTG 92 4 0 

TGTGTGGATG ATTATTTATC CTCACTCGGT TCAAGATGTA GACTATCAGT AAAAAAAGTA 9300 

TTTTCACCTT TTTTCTCCAC AAAAGTAAAT TCAATGTCTT TATATCCAAC TGrTGaACCT 9360 

TTTAAGTCTC CCgAACCTTT CaACaATAAC TTTGGTGCTT TATTGTTGG TATTTTATAT 94 20 

CTTTTTCGTA ATTGTTTTAC ATTATAGTCA TCATTAGTTA ATTGATATTT TGCTGAATAA 94 8 0 

CTCGGTACCT CTGGATTATA TGATATATCG CCGTCTTTGT ACTTCGACAA ATCTTTAAAG 954 0 

CTGCCATATT GCGCGAAGAA CTTAAAATTC TCGATTTCTT TTTTTATATT TTCGTCTTTG 9 60 0 

ATACCTTTAG TTGGAATGAT TTTATTGTCT ACCATTTTAA CGGGATATTC TTTATCTTTA 9 660 

CTCTTAGGTC TACCATCTTC ATCATGAAGT GTTTCACTCA CTATATACTT CCCGGTTGTA 972 0 

GTCTTAGTGT TTCTATTCAT ATATAGAACC ATACCTTTTG ATT T CAT AC G TTCCCCTTTA 978 0 

GGTTGAACAA CCATTTCAGA ACCAATJATC CATGTACCTT TATCATTTTT ATCAAATTCG 984 0 

T CAT C AC GAT AACCTTCTTT ATCGTATAAA TCCTCTAGAT TTTTAATCGG ATACATACTC 9 900 

AATGTTTTTT CAAAGCTTTT CTTAACTTCC GCTTCTTTAC CTATGCCACA ACCAGCAGTG 9 9 60 

AAACTAATGA CTAATATCAA AAAACTAATA TACAATACCA ATTTGTTTAA TGTTTCATA 10020 

ATTTCACAAT CCTATTCTTC TTATTATCTT TCCTGGATTG ATTTCATATT TTGATCGAGT 10080 

CATGATTATT TATCCTCACT TGGTTTAAAA ATTAACCCAT CACTAAAGTA AATGTTCTCT 1014 0 

TCTTTTTTCT CTACAAACGT AAATTCAATG TCTTTATATC CAACTGATGA ACCTTTTAAA 10200 

TTCCCTGTAC CyTTCAACAA CArCTTCGGy GCTTTATTTG TTGGTATGTC ATATCTTTTA 102 60 

CGTAATTGTT TTACATTATA ATCATCATTA GTTAATTGAT ATTGAGCTGA ATAACTCGGC 10320 
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ACCTCTGGAT TAT AT GAT AT ATCTCCATCT TTATAATTCA TTAAATCTTT AAAATTGCTA 1038 0 

TATTGCGCAA AAAACTTAAA GTTTTCGATT TCTTTTTTA TGTtTTCTTC TTTAACTTCC 104 4 0 

T C AG TAG AAA TGAATTTATT ATTAATCATT TTAACTGGAT ATTTTTTTTG ATTATCCTGA 10500 

GCTACTTCGT ATTTCTCCGT CTTTAtTTCA TTAGTATAGT AAAAtCCTTT TGCACTTCTT 10560 

GTATTTCTAT CTATCTTCAA AAGCATGCCT TTTATTTTTA GAGCTTCTCC TTTATTTTGA 10620 

ATTGCCATTT GAGAATTTAC AATCCATGTT CCCTTATCAT TTTTATCAAA TTGATCATCA 10680 

CGATATCCTT CTTTATCGTA TAAATCCTCT AGATTTTTAA TCGGATACAT ACTCAATGTT 10740 

TTTTCAAAAC TTTTCTTT 10758 
(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE. CHARACTERISTICS: 

(A) LENGTH: 1109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

nTaTCaACTT TCGaATTTAA AgTCAATAAC TTTTTTAAAA ACTTTTTGTG TTCACAACCC 60 

GCTTCTTTTT CAACGCGTTT ATTGCTTAAC ACAAGAACTT ATTTTACCAG CATTCCAAAA 120 

CAAATCAACA TAAAAACGTA CAAAATAAAA GTAATTTTGT ACGTTTAGCA TAT AT TAT AC 180 

CTATTTATTT GTAGCAGCTA TAACTTTTTG TGCAATGAG CTATAAATTT TACCTAGACG 240 

ATCATCTGAT TGATATATTG ACGGTGCAAA ATCTTTTGGA TTCCAAGATG GTTGCTCTAA 300 

AGGTAATTCC CCAAGTAATT GAGTATTAAG TTCATCAGCT AACTTAGTAC CGCCACCTTT 3 60 

GCCAAAGACA TATTCTTTAT TACCCGTCTC TTTACTTTCA AAATAACTCA TGTTTTCAAT 4 20 

TACGCCAAGA ATAGAATGAT CCGTATGTTT TGCCATCGCA CCTGCGCGAg CTGCAACAAA 4 80 

TGCTGCTGTA GGATGAGGTG TCGTTACAAT AATTTCCTTA CTTGAAGGTA ACATCGTATG 54 0 

AAC AT C T AAA GCTACATCTC CTGTTCCAGG TGGAAGATCG AGTATTAAAT ATTCAATGTC 600 

TCCCCATTTA ACTTCTGTAA AGAAATTCGT CAACATTTTA CCTAACATTG GCCCTCTCCA 660 

TATAACTGGC GCATTTTCTT CCACAAAAAA GGCCATTGAT AT AAC T T T AA CGCCATGACG 7 20 

TTCAACTGGA ATTACTTCCT TCCCTTTAAT TCCAGGCTTT TCATCAATAC CCATCATATC 780 

TGGTACACTA AATCCATATA TATCGGCATC TACTAATCCG ACTTTTTCC CTTCACGAGC 840 

TAAGGCAACG GCTAAATTTA CTGCAACAGT AGATTTACCG ACACCACCTT TACCGGAGGC 900 

TATTGCAATA AATTCGACTG GATTATTTTG AGATAATAGG CCTTCTATCG TTTTAGGTTT 9 60 

ATTTTCTTCT TTTGGTTTAA ATTGATTTAC TTTTTCTTCC GGCAATGTTT CAAATCGTAT 1020 
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ACCGACCGTT TTCGCACCGT TTTCTTTTAA TGCATTAACA ACAGCCATCT GTAAATCTAA 



aTTGCGtGCA CCACCTAATT GTGCCATTG 



1080 
1109 



(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3997 basepairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

TCTTTATTTA AAAAAATGAT TGTCTAGTTT GTATCTCTCT GAAGATTTGG CAATAAATAA 60 

AAGCCGATAA CCGTATAATG ATTATCGACT TAAAGTTTAT GTGGCATTTT TTACTTTTGT 120 

AATTTCAGGT GAGTTAGATG AT TAT TAT C A GATAGATTAT TGCTTATAAT CATATGATGT 180 

TTGAATGATA TCTTTGATTT C AC T GAT TAG TGCTTCTTTA GGATTAGCAG TTGTACATTG 24 0 

ATCTTCAAAT GCGAGCTCTG CCATTCTATC AATTGACTCA TTTAATETT C T T C AG AC AC 30 0 

ACCTTGTGAT TTCAAATTCA TTTCAATTCC GACTGATTGA CCTAATTCGT AGACAGCTTT 3 60 

AgCTAATGAT TCTACGAGTG CTTCTGTCGT ATTACCTTTT AATCCTAAGA ATTTGGCAAT 4 20 

ATCTGCATAA TCTGTATCTG CTCTGAAGAA CTCATATTTA GGGAATAATG CATGTTTTTG 4 80 

CGGGTCTTTG GCATTATAAC GGATAATATG CGGTAGTAAT ATCGCATTCG CTCTACCATG 54 0 

CGGAATACCA TATTCGCCAC CAATTTTATG CGCAATTGAG TGTGCAATGC CTAAGAATGC 60 0 

ATTTGCAAAT GCCATACCAG CCAAAGTTGA TGCGTTATGC ATTTTCTCTC TTGAAACTTT 6 60 

ATCACCCTTT TCAACAGATG ATTTTAAATA TTCGAACGTC AATTTAATCG CTTGTAGACT 720 

CAAACCTCTT GTGTAGTCTG AAGCC AT T AC AGATACATAT GATTCCATTG CATGCGTTAG 7 80 

TACATCCATT CCTGTATCTG CTGTAACGCT TTTTGGCACA C T CAT C AC AA ATTGAGGGTC 840 

AATAATTGCA ACGTCAGGTG TTAAAGCAAA ATCAGCCAAC GGATATTTTA CATTGTTTC 900 

ACTATCTGTG ATAACTGCAA ATGGTGTTAC TTCTGAACCT GTACCTGATG TCGTAGGGAT 9 60 

ACAAATGAAC GTCGCATTTT CAGGCATGCC TATTTTATAA GTACGTTTAC CGATGTCTAG 1020 

GAACTTTTGT TTAGCACCGA AGAATGATGT CTCAGGGTGT TCAAAGAACA TCCACATTGC 1080 

TTTTGCAGCA TCCATCGCTG AACCACCACC AAGTGCAATG ATTGTATCCG GTTGGAAATC 114 0 

AACCATCATT TCCAGACCTT TATATACTGT ATTAGTTGAT GGGTTCGGTT CGACTTCGCT 1200 

AAATATTTTT ATTTGAGGCT GTTCCGTTTT TTGTCTTAAT ACATTCTCAA CTGTTTTTGT 12 60 

ATAACCGAAT TCTACCATAC CAGGGTCACA GACAATJATC ACTTTTTCAA TCTTGTCCAT 1320 
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TGTTGTTAGA CTCATGATTG CATTTTCTTC AAAATAAATT TGAGCAGGCA CCTTGAAAAT 138 0 

T T GAG TAT T A TTACGTCGTT TAGCAATCGT TTTAATGTTT AATAAATCTG TCGCACTAAC 14 4 0 

ATTATGTGAA ATTGAGTTTC TACCGTAGaA CCACAACCTA ATGTTAAAGA CGGAATCAAT 1500 

TCGTTATACA TATCACCAAT ACCTCCAACC GCTGATGGTG TATTTACAAG TACACGACAA 15 60 

GCTTTCATTC TTAGTCCAAA ATCTTTTTGT AATGTTTCAT CTTCTGTATG GATAACGGCT 162 0 

GTGTGTCCTA ATCCACCAAA ATGTAGTGTG TCTTCACAAA TTTGAAATGC TTGTTTTGTA 168 0 

GATTGGGCTT TTACTAAG3C TAATACTGGA GATAATTTTT C AC G AG AT AA CGGATAGTCT 174 0 

G AAC C T AC AC CGCTAATTTC GGCTATGATA AGTTTTGTAT TTTCGGGGAC AGGTATACCT 1800 

GCTAATTCAG CTATTTCAAC TGCAGATTTA CCGACAATAT CAGGCTTAAT ACCTGTTTTT 1860 

TGTTCATTcA TAATTGCATT TTCTAAGCGT TGTAATTCAT CTTTTTAAC AAAGTATGCT 192 0 

TGATGTGCTT TAAATTCATT AGTAACATCT TTATAAATTT CTTTATCAAT GACTACAACT 1980 

TGTTCAGAAG CACAAATCAT ACCATTATCA AATGTTTTTG AACCAATGAT ATCATTTACT 2 04 0 

GCACGTTTAA TGTGTGCTGT TTTTTCAATG TAAGACGGCA CGTTACCTGG TCCCACACCT 2100 

AATGCCGGTT TGCCAGTTGA ATATGCAGAC TTAACCATGC CCGAACCACC TGTTGCTAGA 2160 

ACTAATGCAA TACCTTTGTG ATTCATTAAT TGTTTTGTTG CTTCGATAGA AGGCACTTCA 2 22 0 

ATCCACTGAA TAATATCTTT AGGTGCACCT GCCTTCATTG CCGCTTCTAA T AC AAC T T C T 228 0 

GCTGCACGCT TCGACGATTC TTGTGCACTT GGATGGAATG CAAAAATGAT TGGATTTCCT 234 0 

GTCTTAATTG CAATCATCGC TTTAAAAATA GTTGTCGACG TAGGATTTGT TGTTGGCGTA 2 4 00 

ACACCACAAA TAACACCAAT TGGTTCCGCT ACATACGTTA ATCCTTTTTC TTTATCTTCA 2 4 60 

CCAATAATCC CTACTGTCTT ATTGTCTTTT ATTGAATTCC ATATATATTCAGAAGCGTAT 2520 

AAATTTTTAA TCGCTTTATC TTCGTATATA CCTCTTCCAG TTTCTTCATG TGCTAATTTT 2580 

GCTAGCACCA TATGTTGATC AACAGCTGCT AAGCTCaTTT GATGAACAAT ATGATCAATT 2 64 0 

TCTTCTTGTG ACTTTTTAGA TAATGCTTCT AATGCTTTTT TCCCTTTGTC AGCTAGAGCA 2700 

TCAATCATAA TTGCCACTTC TTGTTCTTTC GATCCACGAT TTTCTTTTTC AGGTATAGTT 2 7 60 

AACATATACA ACCACTCCTT TATACTTTGT GAATTATTTC ACAAACATTA TAGTACATGT 2820 

CTCTCAGGAT ATAAAGAAAA T T C T AT AC AA AAAAGTTTAA TTTCGAATAT TATTTGAACA 2 880 

AATATCAAAT TTTAAAATAA ATGTTTTCAT GffiATCATTG TTATTTCGGT GTTTTTAGAA 2 94 0 

TGATTTTATA ATCATAATTT TTTCAATGAC ATAATTTATT CATAATTATA TATTTAATTC 3000 

TGAATATTCT TTAATAGCCT AAAAAGCTTG TTATATCGCA CAAAACACAA CATTAAAAGG 30 60 

TCCTTGTCGA TACCTATCAA C AG AT G T T AC AAATAAAAAC CaCCCGTGTG AACGGGTET 312 0 

TTGTTCTGCG gCTATAAGCC TTCCTTACTG GCCaGCCCTA AAAGGGC AC T GACAAGTCAG 3180 
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CCAACTGCAC TACTATTCCA GCAATCCTAA AGGTTTACTC TTTTTTCTTT CTTTTTTTAT 32 4 0 

TTTTCTCTCC AGTGAAAGGA TCTAAATATT CTTCCATTGA AATTTCATCT GCAACGATAT 3300 

CCTCTTGTAA TTGATTACGA ATATAATTTT CAATCACTTT TTTATTTCTA CCTACTGTAT 3360 

CCACATAAAA TCCTTTACAC CAAAACTTTC TATTTCCATA TCTATACTTT AAGTTAGCAT 3420 

GTCTATCAAA TATCATTAAA TTACTTTTCC TTTTAAATAG CCAACAAATG ATGATACCCC 34 80 

AAGTTTGGGT GGTATACTTA C T AAC AT AT G GATATGATCT TTACATACCT CTGCTTCAAT 3540 

TATCTCTACA CCTTTTCTTT CATATAATTG ACGTAATATA ATCCCTATAT CTTTTTTTAT 3600 

TTTTCCATAT ATCGCTTGTC TTCTGTATTT AGGTGCAAAG ACAATATGTT ACTTACAATT 3660 

CCATTTCGTA TGTGCTAAAC TGTTTGTGTC AGATGACATT AAATAGCATC TCCTCGTGTT "22 0 

GATTATTTTG GTTGGCTGAC CAATATTTAC T C T AAC AT G T AGAGATGCAT TTTTTTGACA 3780 

ATGGTAGAAC CTTTTCTGGG GAG T G G G AC A GAAATGATAT TTTCGCAAAA TTTATTTCGC 38 4 0 

CGTCCCACCC CAACTTGCAT TGTCTGTAGA AATTGGGAAT CCAATTTCTC TTTGTTGGGG 3900 

CCCCGCCCCA ACTCGCATTG CCTGTAGAAT TTCTTTTCGA AATTCTCTGT GTTGGGGCCC 3960 

CTGACTaGAA TTGAAAAAAG CTTaTTaCAA GCGCATT 3997 
(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1391 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

GnGCGAGACA AACACACtTA TTGGTGCCAT TATmCcTAGA ATGaATTCaT ATGCAGTAGA 60 

TGAaaCAATC AAAGGATTGG CAAAACAATG CCAAAAATAT GATCaCAAT TAATTTTAAA 12 0 

TTACACAGGT TTAAATATCG AAGCAGAAAT ACAAGCGCTT G AAAC AT TAG CACGCaGTAA 180 

AGTAGATGGT ATTGTTTTAA TGGCTACAGA CATAACAGAG AGACATATTG AAGTCATTAA 24 0 

TAAAATGAAT GTACCAATCG TTATTGTTGG T C AAC AAC AT G AAC AAC T T C ATAGTATTGT 300 

GCATGATGAT TATAAAGCAG GTCAAATTAT AGGCGAATGG ATTGGTCAAC AGGGATATCA 3 60 

ACAAGTTGAA GTGTTTAGTG TAAGTGAAAA AGATATTGCA GTTGGTATAC AT AG AAAAC G 4 20 

TGGTTTACTT GACCAGTTAG CTAAATACCA AATTAAACCT AATATACATG AAACaAATTT 4 80 

TACTTATGTG G AAG C AC AAA AAGATGTTGC AAATGTTTTG GAAAATGTGG AGC AAG TAG A 54 0 

TGCGGTTGTT GGAGCAACTG ATACGATTGC ATTAGCTGCC TATAAATATT ATTCTGATAA 600 
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AAAAGATGTT ATGAAACCAC AT CAAAT AT A TGGTTTTGGT GGTGACCCAA TGACACAATT 660 

AGTGTCTCCA TCGATAAAAA CAATTCATTA TAATTATTTT GAAGCTGGCCAATGCGCGAT 720 

GGaAGAGATA CAACAGATGC TTAAAAAGCA AGATATGCCA TATAGCGTCA CAGTAGATGT 7 80 

TAATATTTAG ACGCTGTATT TTTTAAAATA AATGTGGAAC CGATACCATA T AAC TAT AAA 840 

TGGATAGGTT AAAAGTTAAA GAACGTAGGT AAAATTTGCT ATAATAGAAT ATAAATTGTT 900 

AACAGCATAA AT T AT AAAAG GAGGACTGGG TAAATATTAT GACCGAATGG AC T AG AG AAG 960 

AACGTTATCA ACGAATCGAG GACGTTGATA CTGAGTATTT TAAAACATTA AAACAACAAG 1020 

TTGATCAATC AAAATTTCGT CAACAATTTC ATATACAACC AGAAACAGGC TTATTAAATG 108 0 

ACCCCAACGG ACTTATTTTT TATAAAGGGA AJTATTATGT T T C AC AT C AA TGGTTCCCAT 114 0 

TAGGCGCAGT ACATGGCTTA AAGTATTGGT ATAACTACAC GAGTGATGAC TTAATAAACT 12 00 

TTAAAGCTGA AGGGCCAATT TTAAATCCAG ATACTAAATA TGACAGCCAT GGTGTATATA 12 60 

GCGGTAGCGC TTTTGAATAT AACGGGCATT TAT AT TAT AT GTACACAGGA AATCATEAG 1320 

ATAATCATTG GCAACGACAT GCGAGTACAG ATGATCGCAC GATTGAAAGA AGACGGTTnC 138 0 

AGTTGGnAAA A 1391 
(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

ATTTATTTTA ATGTTTATAT T T T C T AAC AC TTTTTTATGA TCATAGTAGT AATTGACATT 60 

TTTCAATTCA AAGACTGGTG TCATCGTATc TCACCTCGCA T T C AAC TATA CAACTCCTAG 120 

TAACATATGT AAACAGTAAT GTTTACGACT CAAAATTAGA CAAAATAAAG AGATATGCCC 18 0 

CCTTCAAGTT TTATTTATCG CATTTCTTGA AGAGAGCATT ATCATTTTAT TGTTGCATAA 24 0 

CCTTATTTTT TAATTCTGGG T CAAAT TGCT GTTGTTTAA CATTTCAATT TCAAGTTTAT 300 

ATGGCGGTTT TTTATTTTTC TTATCTTCAC CAACATAAGG TGTTTCTAAG ATTTTCGGAA 360 

TATCTTTAAA AC TAT CAT GA TGCACAATGT AATTTAATGC AT C AAAAC C A ATGTAACCGA 4 20 

AGCCAATATT TTCATGTCGG TCTTTTTGAG CGCCACGGTC ATTTTTAGAA T CAT T G AC AT 480 

GAACAACTTT GATTCTGTCG ACTCCAATGA TTTTATCAAA TTCATTTAAT ACGCCATCAA 54 0 

AGTCCTCTTT AACATTATAT CCAGCATCAT GCGTATGACA TGTATCAAAA CATACTGATA 600 

AACGTTCGTT ATTATGAACT CCATCAATAA TACGTGCTAA CTCTTCAAAT GAGCGACCAA 660 
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TCTCTGTACC TTTACCTGCC ATCGTTTCAA GCGCAATACG TACATTATTG TCATTCGTTA 720 

AAACTTCATT TAATCCTTCA ATAATCTTAT TAATTCCGGC AT CAACACC A GCTCCAACAT 780 

GCGCACCTGG ATGTAATACa ATATCTTTAG CCCCTATAGC TTGCGTTCTk TCaATTTCTT 840 

GTTGCAAGAA ATCTACACCA AGATTAAACG TTTCTGGTTT GCETGTAttG CAATaTTaAT 900 

GATGTATGGT GGCATGAACA ACAATATTAG 930 
(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

TGACGCACCA ATTTATAACG CAATTGACAA AACAATTAGA TATACCTGTG AAATTTGTAC 60 

CTGGAAACCA TGATTTATGG GAAGTTGAAA GTATGACTAC GCAAGACATT TGGAATAATT 12 0 

ATAAGAGTAT GTCACAGTGC TTGGTAGGAA AACCATTTAT AGTAAATGAA GAATGGGCAA 180 

TCATAGGACA TACTGGCTGG TATGATTATA GCTTTGCAGC ACAACGATTT T CAT TAG AT G 24 0 

AG T T AC AAAA AGGAAAACAT TATGGTGCGA CTTGGCAAGA TAAAGAACGA ATATCTTGGG 300 

GCATATCAGA TCAAAATTTA TCTAAAATAG CGGCTGAACA AG T G AAG AAA GATATATTAG 3 60 

AAGTAGGAAA TAGACGAGTG ATTTTAGTCA CACATGTTGT GACGCACCCT GATTTCATTG 4 20 

TTCCTATGCC GCATCGTATA TTCGATTTTT ATAATGCATT TATTGGGACA AGTGATTTCA 4 80 

ATCCTTTGTA TGCGATGTTC GATATACCAT ATAGTATTAT GGGCfiTGTT CATTTTCGTA 54 0 

AAAGTGTGAT AGATGATGGC AGATGTTATC TCTGTCCGTG TCTAGGCTAT CCAAGACAAT 600 

GGCGTTCAGA AGATATTTAC CAGGAAATAA AT GAGACGAT ACAAATAATA GAAATTTAAA 6 60 

ATGCGCAAAC CTGACCCAGT TTGCGCATTT TATGTTTTAC ACACGCGAGT AATGTGTTTA 720 

CTTACGTGTG TTTATTTTGT TGCTGATTTT CAATTGTATA TGAATGTGGT TGCACATAAA 7 80 

TGCACTTTCT TCCTGGTGAA TTAAAGCTGT ATTCCATTTT CTCTTTACGA ATTTTAATAA 8 40 

TTTGTTTGCG ATTTGGAATG ATGGCAGGTA AAACTAGGCC ACGACGAATA TGACTCCAAA 900 

GTGGATCGAT ATAATAGTAA TAGTAAXAT CGTACCCAAT GAGTAACGTT ACGTGAATAT 9 60 

TTGAAACTTG TTTCGCTGGC TTGTTATCAA AGCGGAAAAC ACGTAGTAAT GGTTTAGAAC 1020 

C AAG AT TAG T ATGGTATATT AACACAGGTT GACCTTGATC GATAATACCT TTAAGATCTT 1080 

CTAACGATTT ACCAGTGCCG TCTACGATAT TAGGATTGTA TTTTTGTAAA ABGGTACAT 114 0 
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ATGCTTCTGG AAATATCGTT TGATGATAAT TGCCAAGCTT AATGAATAAG TGATGTCCAA 12 00 

CATAACCTTT ATGTGGATTG TTCGGATGTG TCGGCCAATG TCTCATAATT TCTGTAGCAG 12 60 

GGATATGTTG GTTGTTGTAT TGCAACATCA TGGCTGCGGA AACACCTTCA CACCCCATGA 132 0 

CCATAGGGAT AGGAAATAGC TGACTGATAG GTTTAACTGG TAATATTTTT CGGTTCATAA 138 0 

TATAGTCCTC GCATTGATTC AATAAATATT TAATATAATT ATATAGCGTC AATGCAAAAT 14 4 0 

GTCCTAAACA TATGTTTTAC ATGAGTGAAT AAAATTAATG GAGTGATAAA ATGGAATATC 1500 

AATTACAACA ACTTGCGTCG TTAACGTTAG TAGffATTAA AGAAACG T AT GAAAATGGAC 15 60 

GACAGGCTCA GCAACATATA GCAGGGTTTT GGCAAAGATG TTATCAAGAG GGAGTAATTG 162 0 

CGGATTTACA GTTAAAAAAT AATGGTGATT TAGCCGGGAT ACTTGGCTTA TGTATACCTG 168 0 

AATTAGACGG TAAGATGTCA TATATGATTG CAGTTACCGG AGATAATAGT GCTGATATTG 17 4 0 

AAAAAT AT G A TGTCATAACA TTAGCAAGTT CAAAGTATAT GGTATTTGAA GCACAGGGCG 18 00 

CAGTACCTAA AGCAGTTCAA CAAAAAATGG AAGAGGTTCA T C AC T AC AT A CATCAATATC 18 60 

AAGCAGATAC GGTAAAATCA GCACCATTTT TTGAGTTGTA TCAGGATGGT GATACTACAA 1920 

GTGGAAAATT AATATTACCA GAAATTTGGG ATnCCTGTTA AAGGGGTGAT TGAAATAnGA 198 0 

AnTG 198 4 
(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

GATTCCACGT GTGTTAAAAG AAGTTACAcC TTCAATGATG GTATTTACTA ATTTCTTTAG 60 

AGATCAAATG GATCGCTTCG GTGAAATTGA TATTATGTT AATAACATTG CAGAGACAAT 120 

TAGTAATAAA GGCATCAAAT TATTGCTAAA TGCTGATGAT CCATTTGTGA GTCGTTTGAA 180 

AATCGCAAGT GATACGATTG TGTACTATGG TATGAAAGCA CATGCCCATG AATTTGAACA 24 0 

AAGTACGATG AATGAAAGTA GATATTGTCC AAACTGTGGT CGCTTATTGC AATACGATTA 300 

TATTCATTAT AATCAAATTG GTCATTATCA CTGTCAGTGT GGTTTCAAAC GAGAGCAAGC 3 60 

AAAATATGAA ATATCAAGTT TTGATGTGGC ACCGTTTTTA TATTTAAATA TCAATGATGA 4 20 

AAAATATGAT ATGAAAATTG CAGGTGACTT TAACGCTTAT AACGCGTTAC AGCATATACT 4 80 

GTTTTAAGAG AGCTAGGGTT AAATGAACAA ACAATTAAAA ATGGCTTTGA AACGTATACA 54 0 

TCAGACAATG GTCGTATGCA GTACTTTAAA AAAGAACGAA AAGAAGCGAT GATCAATTTA 600 
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GCTAAAAATC CTGCAGGAAT GAATGCAAGT TTATCAGTTG GTGAACAATT AGAAGGCGAA 660 

AAAGTGTATG TTATTTCGCT AAATGATAAC GCTGCAGATG GTCfiGATAC TTCATGGATT 720 

TATGATGCAG ATTTTGAAAA ATTATCTAAG CAACAAATTG AAGCTATCAT CGTGACAGGT 780 

ACACGAGCAG AAGAACTTCA ATTGCGATTG AAGTTAGCAG AGGTTGAAGT ACCAATTATA 84 0 

GTTGAGCGTG AT AT T T AT AA AGCAACGGCA AAGACTATGG ATTATAAAGG TTTCACAGTT 90 0 

GCAATACCAA ACTATACATC ATTAGCGCCT ATGCTTGAAC AATTAAACCG TTCGTTTGAA 960 

GGAGGTCAAT CATAATATGC ATGAATTGAC TATTTATCAT TTTATGTCAG AT AAAT TGAA 102 0 

TTTATACAGT GATATAGGAA ATATTATTGC TTTAAGACAA CGTGCTAAAA AACGAAATAT 108 0 

TAAAGTTAAT GTCGTAGAAA TCAATG\AAC AGAAGGTATT ACCTTTGATG AATGTGATAT 114 0 

TTTCTTTATC GGTGGTGGAA GTGATAGAGA ACAAGCATTA GCAACAAAAG AATTAAGTAA 1200 

AATTAAGACA CCACTTAAAG AAGCGATTGA AGATGGTATG CCGGGATTAA CGATTTGTGG 12 60 

AGGCTATCAA TTTTTAGGGA AAAAATATAT CACGCCTGAT GGTACAGAAT ffiGAAGGGTT 132 0 

AGGTATTTTA GATTTTTATA CTGAATCAAA GACAAACCGA TTAACAGGAG ATATTGTTAT 1380 

CGAAAGTGAT ACTTTTGGAA CTATTGTAGG TTTTGAAAAT CACGGTGGTA G AAC AT AT C A 14 4 0 

TGATTTCGGT ACACTTGGTC ATGTTACTTT TGGTTATGGT AATAATGATG AAGATAAAAA 1500 

AGAAGGCATT CATTATAAAA ATTTATTAGG TACTTATTTA CATGGACCAA TTTTACCTAA 15 60 

AAAT T ACGAA ATCACTGATT ATCTGTTAGA AAAAGCTTGT GAACGTAAGG GTATTCCGTT 162 0 

TGAGCCTAAA GAAATAGATA ATGAAGCGGA AATACAAGCG AAACAAGTAT TAATAGACAG 1680 

AGCAAATAGA CAGAAGAAAT CTCGTTAACT CTtAACATCG CATCAATGGA TTTAATATTG 17 4 0 

ATAAACGATG AAGTTTAGTA ATTAATCATA TATGTATAAA CACACACATT ATTTTGGATG 18 00 

GAAACAACCA AATTGATGTG TGTTTTtTTG TTCTAGTGAA TAATTATTAT AC AAT GAG T A 18 60 

TCTATCCTAG AATTATCAAT AGTAATGGTG ATTATGCAAC ATGAAAAAAT GAATGATG&. 192 0 

AGGAATTTGA CGATGAAGCC TACTAAAGTG AT AT T AAAAG ATGCATCTTA TTTACATAGC 1980 

AAAACATCGA T AAC ATT TAT T T T AAAAG AT GTAGTTATCG AAGAAGATAA TAAAATTTAT 2 04 0 

TATTTCGACA CTAGTGCAcT TTCGAAGATC AAGAAGTTAA ATTTGAATTT GCACTCTTTG 2100 

ATAGTGACAT GGACAATTTG AAACATATGG ATTGCGACAA TCCTGTTACT G AAAT AT ATT 2160 

TTATAGAACC TGATTTACAT TTTACAATTA TTGATTTTAA TCAAGAACTG CTTTGTATTT 2220 

ATATTGATTT TGATTCTGGT TTAAGGCATT CAAACATGGC AAC AG AAT C T GGTATTTCAT 22 8 0 

TAAGGATAAA TGTTGCTAAA TCAGATTTTA CTAAATTTAT TAATGAATTA GCCTCTTTAC 2 34 0 

ATTAATGATT TAAATCTGAT AT G T AAT T AC AATCAAAAAA GACAGCCACA TCCCTCCGTA 2 4 00 
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GTTTAGGCGT GTGGCTATAT TTGAGTCTGA ATATTTATGC TTGTAATTTT AAAAAG G G AC 24 60 

ATGCTATATA C GAT AAAAAG AGGCGGGGAC ATAAATCAAT GTTCTATGCT CTACGAAGTT 2250 

ATATTGGCAG TAGTTGACTG AACGAAAATG CGCTTGTAAC AAGCTTTTTT CAATTCTAGT 2580 

CAGGGGCCCC AACAAAGAGA AATTGGATTC CCAATTTCTA CAGACAATGC AAGTTGGGGT 2 64 0 

GTGGGCCCCA ACACAGAGAA TTTCGAAAAG AAATTCTACA GGCAAAGCGA GTTGGGGTGG 27 00 

GACGACGAAA TAAATTTTAT GiAAATATCA TTTCTGTCCC ACTCCCATGG TGCCAATTAG 27 60 

CATAAGGTAC TTAAATTAAG CATATCTGCT GTCTAGCAGT CGATAAATCA TTAGAACTTC 2 82 0 

GTATAGTATA TGACTTTTAA TTTGATTTTC ACCACTAATT TCAAGTGCTT TTATAGTCGA 288 0 

ACGTAAAGTT TCTACAGAAT CATCTTCTCT CTTAAAAGAA CCATCATAA ATATATCTTT 2 94 0 

GATGCTACTA CTAATTTTTA GCAATGCCAT TTTTTCGTCA CCTGAAAAGT TAACACGAGT 3000 

ATTTTTAGGC AAGTAAATGA TATTTGATAA ATGAGTGATA AACAAACGAT TCGTATATGC 30 60 

ACGTTTAGTT AATTGATTGA GTAATTTCCA AT C AC AT T C T TTTTTCTTAT GATAGCTTAA 3120 

TTCATCACGT TGATAACTTA TTAACGTTTC AACTTGATTA TTTAAATTGA AAATATTTTT 318 0 

ATATGCTTTT TCGCTTTTAT CAGATTGCAG TCTTGATAAG ATAAGTTCTT GGCAGCGATT 32 4 0 

GTAAAATAAT TTATACATCA AGGCATCTGT CTTACTTAAT TTTTCTTCGA CCTGACCATA 3300 

ATACTTAGGT GGAAACACCA TGAAGTTAAT TAAACCTGAT GTCACGAGTC CAATAATTGC 3360 

TGTCAATGTT CGAGACAAAA AGTTGAATAT GTAGGCATCA TGAATACCTG GAAT CAT AGC 3420 

TAATGATGTT AGTACAGCGA CATTCGTACC AACTTGCAAT TTGAGTTTTG TACAGAATAA 3480 

AATCGTGAAC GTTGCACTCA ATGCATATGT AAAAGGTGAT TGATCGCCGA ATAASATGT 354 0 

AAATAATACT GCAAAGCCTG CACCAATTAC CGTAGCAGGT . AAT C T AC GAT AACCTTTAAT 3600 

AAGTGATGCC TTGGCAGTTG GTTCAATTGT GACTACAGCT GTTAAAATGG CATAGATGGG 3660 

TGTTAAATCT AGTGCCATAC AAAAGACAGC TGTTAAAAAA ATGGCAATAC CAGTTTTAAT 37 2 0 

TGTTCTGGCA CCAATTAAAT GTTTATACCA TTGATCGTTC ATTTTTTAAC CTCTAATCAT 3780 

CGTAAAATCT TAGCGAGCGC T T TAT AAT AA TAGTATCGTA CATTGGAAAA GTTCATGTAT 38 4 0 

GTAAAATATT TGAAATAATC ATACATAAGC ATTACTTTGA TTTTCATATA CATTAATCAA 3900 

AGTTCTGGGC T C C AC AAT G T TATATTGTTA GTAGTT&CT GAAATAAAAT GCACTTGTAA 3 9 60 

CAAGCATTTT TCAATTATAG TCCGGGGCCC CAACATAGAG AATTTCAAAA AAGAAATTCT 4 020 

ACAGGCAATG CAGGTTGGCG GGGCCCCAAC ACAGAAGCTG ACGAAAAGTC AGCTTACgAT 4 080 

AATGTGCAGG TTGGCGGGGC CCCAACATAG AGAAATTGGA T C T AC AAT T T CTACAGGCAA 414 0 

TGCAAGTTGG GGTACAACGA TAAAGAAATA TTTTTTCTTT ATCACACTAT GTCTCACTCA 4200 

CTTTCCAAAA TACTAAAGTA ACATCTTTAG TATATCAAAG AATTTTTGCT ATAATAAGTT 4 2 60 
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AT AAT TAT AT AAAAAAGGAA CGGGATAAAA TGATTGTAAA AACAGAAGAA GAATTACAAG 4 320 

CGTTAAAAGA AATTGGATAC ATATGCGCTA AAGTGCGCAA TACAATGCAA GCTGCAACCA 4 380 
AACCAGGTAT C AC T AC G AAA GAGCTTGATA ATATTGCGAA AGAGTTATTT GAAGAATACG 4440 
GTGCTATTTC TGCGCCAATT CATGATGAAA ATTTTCCTGG TCAAACGTGT ATTAGTGTCA 4 500 

ATGAAGAGGT GGCACATGGG ATTCCAAGTA AGCGTGTCAT TCGEAAGGA GAT T TAG T AA 4 5 60 
ATATTGATGT ATCGGCTTTG AAGAATGGCT ATTATGCAGA TACAGGCATT TCATTTGTCG 4 620 

TTGGAGAATC AGATGATCCA AT G AAAC AAA AAGTATGTGA CGTAGCAACG ATGGCATTTG 4 680 

AGAATGCAAT TGCAAAAGTA AAACCGGGTA CTAAGTTAAG TAACATTGGT AAAGCGGTGC 4740 
AT AAT AC AG C TAGACAAAAT GATTTGAAAG TCATTAAAAA CTTAACAGGT CATGGTGTTG 4 800 

GTTTATCATT AC AT GAAGCA CCAGCACATG TACTTAATTA CTTTGATCCA AAAGACAAAA 4 8 60 

CATTATTAAC TGAAGGTATG GTATTAGCTA TTGAACCGTT TATCTCATCA AATGCATCAT 4 92 0 

TTGTTACAGA AGGTAAAAAT GAATGG5CTT TTGAAACGAG CGATAAAAGT TTTGTTGCTC 4 980 

AAATTGAGCA TACGGTTATC GTGACTAAGG ATGGTCCGAT TTTAACGACA AAGATTGAAG 504 0 

AAGAAT AG T T CAACATATAC T AAG AC T AAA GT AT GAAC AT CATTTAGTTC CGGAGCCTAT 5100 

TCATATTGGT TTCGGAACTG TTTTATAATA ATTAAGAACA CAATCAATGC ECATTTCAA 5160 

AAATATGTTG TAACAAAGTA GTTTTTAAGC AAACATATCA TCGACATCAA CGAAGATACA 5220 

TAGCGCATTT GGTATTTTAA AACTTATTAT AAAAGGTGAT AGTTAT GAAC TATGTTGAAC 52 8 0 

GTTATATTGA ACAGTTTTTG AGAGCAACAG TAAGAAATAA TATCAAGCAC TACCTTTTAA 53 4 0 

TGCTAGATGA AAAAATGAAA AATTTAGATG ATTATATGCG TTATTTAATT ACTAAAAAAG 54 00 

AACAACTTAG CAAGTTAATT G AC AG T C T AA TGCTAACATT AGAAAATAAA TATATTGATA 54 60 

TTGCTGAAGC ATTTCAAATT CAATGTGCAA GAGAAATCAA T AAT C AAG AA ATTGAAAATA 552 0 

TTAAATCAGA GTTGAATAAA GTTGAAGCAT ATTVTGCACA AATTGAAACT CAAATTCAAC 5580 

AAACTTCAAC TGAAAAAATA GCAACAGAAA AAACATCGTA TCTAATAAAT T AT AT GAAC G 5 64 0 

CTGTGGCATA GAAAGGCGGC G AAAC AT G AC ACACAAATAT AT AT CAACGC AAATGTTGAT 5700 

CATTTTTACT GCATTAATGA TTATTGCCAA TTTTTACTAC ATATTTTTTG AAAAAATTE 57 60 

CTTTTTACTC GTTCTATTAT TGGGATGTGT ATTAGTTTAT GTAGGATATC TTTATTTTCA 5820 

TAAAATACGT GGCCTTTTGG CGTTTTGGAT AGGCGCGCTA TTAATTGCAT TCACATTATT 5880 

GTCTAATAAG T AT ACAAT C A TCATCTTGTT CGTCTTTTTA TTATTACTTA TTGTGCGTTA 5940 

T T T AAT AC AC AAGTTTAAAC CAAAAAAAGT AGTTGCGACG GATGAGGTTA TGACTTCACC 6000 

ATCTTTTATT AAACAAAAGT GGTTTGGTGA GCAACGTACA CCAGTTTATG TATATAAGTG 60 60 
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GGAAGATGTA CAAATTCAAC ATGGAATTGG CGACCTACAT ATTGACTTAA CAAAAGCTGC 6120 

AAATATTAAG GAAAATAATA CCATTGTTGT TAGACACATT TTAGGTAAAG TGCAGGTTAT 618 0 

ATTGCCGGTT AATTACAATA TTAATTTACA TGTAGCTGCT TTTTATGGAA GTACTTACGT 62 4 0 

GAATGAAAAA T C AT AT AAAG TTGAAAATAA CAATATTCAT ATT G AAG AAA TGATGAAACC 6300 

GGATAACTAT AC AG T T AAT A TCTACGTATC AACGTTTATC GGAGACGTAG AGGTGATTyA 6530 

TCGATGAAyC ACT 637 3 
(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

ATAGnGAAAG CGTTTTACAC TTAATAACTC CCTCTTAAAT GCATCCAGGT TCTATGTAGT 60 

AAAT CAT G AA nATAACATAT AAATnTAGAG GAGATTTACC TTTGAATACA GAGAACAACA 120 

AGAATCAAAA CCAATCTGTT AAAAATTCTG AAAGaCGCGG CATGTTAAAA GGATGCGGCG 180 

GTTGCCTTAT TTCTTTTATT T T AT T AAT AA TCTTATTATC AGCCTGTTCA ATGATGTTTA 240 

GTAATAATGA CAATTCCACT AAT AAT C AAT CATCAAAAAC GCAATTAACT CAAAAAGATG 300 

AAAATAAAAA TGAAGATAAG CCTGAGGAAA AAT C AGAAAC AGfiACAGAT GAGGATTTAC 360 

AATCAACCGA AGAAGTACCT GC AAAT G AAA AT AC T GAAAA TAATCAACAT GAAATTGATG 4 20 

AAATAACAAC AAAAG AT C AA TCAGACGATG ATATTAACAC ACCAAACGTT GCAGAAGATA 4 80 

AAT C AC AAG A CGACTTGAAA GATGATTTAA AAGAAAAGCA AC AAT C AAG T AACCATCATC 54 0 

AATCCACGCA ACCTAAGACC TCACCATCAA CTGAAACAAA CACGCAACAA TCATTTGCTA 600 

ATTGTAAGCA AC T T AG AC AA GTATATCCGA ATGGTGTCAC TGCCGATCAT CCAGCATATC 660 

GACCACATTT AGATAGAGAT AAAGATAAAC GTGCATGTGA ACCTGATAAA TATTAAACAA 7 20 

CAAGCGAATT GAATTCAAAT TGTATf TAGC TTTATGCACT AATCACATAG T AAAT AAT GA 780 

GGGAGATTTT TTAGGCATGA GCAATCAATT C AAAAG C G AA GAAGAGCGAA GACAATGGGA 84 0 

ACAATTCCAA GCTTTCCAAA ATCAACAAAA CCAACAGAAC CAGCAATACG GACAAAAGAA 900 

ATCTAAAAAA GGATGGTTCT GGGGCTGTGG TGGTTGTCTA GTATTATTTATTTTAATTAT 9 60 

CATCGGTATT TCAGCTTGTA CAGCTGGTAT TACAGGTAAC CTTGGCGGAA ATAGTTCTAA 1020 

AGAAACGAAC AAAACCCATA AAATCGGTGa AACTGTTAAA AATGGCGACC T T G AAG T C AC 108 0 

TGTAAATTCA GTGGAAACTA TGAAATCTGT AGGACCATCT CTTGCACCAA CAAACGCTAA 114 0 
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AGGTATATTT GTCGTTGCTG AT G T G AC GAT TAAAAACAAA GGTAAAGAAG CGTTAACAAT 12 00 

TGATAGTTCA ATGTTTAAGC TAAAATCCGG TGATAAAACA TTTGAAGCAG ATAATACAGG 12 60 

TTCAATGTCT GCTAATCAAA GTGACAATGG TAGTATAGAA AATTCATTTT TCTTACAGCG 1320 

TATAAATCCA GATAGCACTG CTCAAGGTAA AffTGTTTcG ATGTGTCAGA AAACATAGCC 138 0 

AACGCAAAaG ATAAAAAATT AGAAGTTATT TCTAGTTTAT TTAGCGTCAA GAAGATTACA 14 4 0 

TTTGATTTAT CCGATGCTAA AAAAAC AT C A AAAGCTAAAA AAGACAAGCA AGATACAGAA 1500 

GTAGCTGTTG CGAGTTCAAA TAGCGATAAT GTAAGTTATG AAGCTTCGGC TACT AC ACT 15 60 

GcTACAACTT CTAGTGCGGA TACTGATTCT GAAGATAGCG AAAAGTCTAG T AAAG AT GAG 1620 

GATAAGCAGA ATGCGTCTAA AAgTGATAAA TCTAGTGTAG AAAAAAGTGA ATCTAATGAG 1680 

GAAACTGCTC CTGTAGAGCC CATGCCCCAT AGCAAACCTA CCACTAGTGA aGCACCACCT 17 40 

AGCCAAAATA TTCACAaTGa AGATAGCmTG TACGACGCTT CAACAGAATA AAATtnyCAG 18 00 

tAGCTCGGCT ACCCTTCTTT TACGGAAAAA TTAATTATAC ATAATCaAAT CaAGGAGATA 18 60 

AAAAAATGAA ATTCAAAGCT ATCGTTGCAA TCACATTATC aTTGTCACTA TTAACTGCCT 1920 

GTGGTGCTAA T C AAC A T AAA GAAAATAGTA GTAAATCAAATGACACTAAT AAAAAGACGC 1980 

AACAAACTGA CAACACTACA CAGTCAAATA CAGAAAAGCA AAT G AC AC C A CAAGAAGCCG 204 0 

AAGATATAGT TCGAAACGAT TACAAAGCAA GAGGCGTTAA TGAATATCAA ACATTAAATT 2100 

ATAAAACAAA TCTTGAACGA AGCAATGAAC AT GAAT AT T A TGTTGAACAT CTAGTCCGCG 3260 

ATGCAGTTGG CACACCATTA AAACGTTGTG CTATTGTTAA TCGACACAAT GGCACAATTA 2220 

TTAATATTTT TGATGATATG T C AG AAAAAG ACAAAGAAGA ATTTGAAGCA TTTAAAAAGA 2280 

GAAGCCCTAA ATACAATCCA GGTATGAACA AT CAT GAT GA AACAGATGGT GAGTCAGAAG 234 0 

ACATTCAACA T CAT G AC ATT GATAATAACA AAGCAATTCA AAATGACATA CCAGAT CAAA 2 4 00 

AAGTCGACGA T AAAAAT GAT AAAAATGCTG TTAATAAAGA AGAAAAACAT GATAATGGGG 2 4 60 

CAAATAATTC TGAAGAAACT AAAGTTAAAT AATGGCATAC TTTGATTAAT CGTAATTTTT 2 52 0 

AG G AG AAAC G CTATGATAAC AT T T GAGAAT ATACAACAAC TTGAGAATA CACCTTAATG 2 58 0 

ACTATGCATG GTCTTTTTAA TCAACTTAAA CTCGGCATTA TTTCAATCGA AAACGCAGAG 2 64 0 

CATACGCTTT TTACACCTTA TATGTTGGAA ACGCTCTCTT CCCTAGGCGT G AAAG AC AG C 2 7 00 

ATTGTCGATT TAATTCATAA AGGGACTGAA TTAGAAGACT TTGCGGCATT TAATTTATCA 27 60 

ATTGAAGACA CAGTTACAGT CTGTTTACAA AGAACTGAAG AAC TAT T AAA ACAATACAAA 2820 

AATGTGGAAT TCAATGACAA AATATTAATC AATTGGCGTA TTATACAAGA GAAATAGACA 28 80 

TATAAAAGTC GAAT G T AAC t ACGTGAGTAT TGATTTTATT CTTTGTAAAT TACAAGCATT 2 94 0 
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TCATATTATA AAGTTTGAAA AGAGGTATff TGAAATGGAG AAAAATGAAT ATATAGCTAA 3000 

AT AT AAT GAA TATAGTCAAT TATTAGACGC TACATACTCG CAAGCTGTAG CATmCCTTTT 3060 

AAGtAAATaT GGCGCTGTAA CCGATGATTA T T AT AAAG a A AAAT CAT AC A CGCGATTTTT 3120 

AAAt GGAGnA ATCAAAAGTA TTTCAAAAGG AAAATACACT AGAGCTAGCG Aa GST TATA 318 0 

TTGCCATCAT ATAAGCGAGG ACAAATTCCA AAATCTATCT GATCTAAGAT TCATtTCCAA 324 0 

ATTTAAGTAC TCATACGACG TTCAAAAGAA AGAAAACTTA GTGTACTGTG ATCTAATCGA 3300 

GCATTTAATT TTACATGCAA T TAT T AC AAA AGAATCCCAT GGCCAATTTG GTGTAGCTGG 3360 

ATTATGTCAA AT GAT C AAAC CAACAGTCAT TGATTGGTAC ATTGGCGAAT ATAATCCAAA 34 20 

ACCAGCATGG ATGCAAGCCA C C AAAG C AC G TGCCTATTTG CCTGGAATAT TAGTAGAGAA 34 8 0 

ATTACTCATT AAAATTGACG ATATGTTAAA AGGAATAGAA ATATAAGATT TCCTTGAGTC 354 0 

TAG AT AAAT G ATTAATGTAG ATTTATTTTT TGCTGTGAG ATTTTGTTAT AGATGTTTAA 3600 

ACCTGTAATT AAATATATTT TATAAAATAG AC C AC GC AT A CCTATCTATA AACGGrCAAT 3 6 60 

GTTTATAAAT GAGTTTGCAT GGtCTTGAAT TGTATTAAAT TTCTTTTGGT TTTAATAAAT 3720 

CGACTAGATT TTCACAATAT T T AT C AAAT A TGTATTCCTA AAT TAT AC AG CCTTAATCCA 37 8 0 

GCAGCTACTT TCGAAACTTC CAACTTAGTT GATATAAGGT TCAATAGTTT GTTTCGTTCT 384 0 

T T T T C AG AT A AACCAGAACT T AAAT T GAT A TTATTGACTT CATAAAAATT ATAGACTAAT 3 900 

GCCTCTATTT GCTTTTTAGG CATAAGTAAG TCGACTGAAA ACTGATTTAC GTCGCTTTCA 3 9 60 

TAAATCATTT CATGTAAATT CTTTAGACTA TTATCGTTAC TATCTCTCAT TAAGTCTGTA 4 02 0 

TTTTTAAATA AATAACGGCC CAATTCACGA GCTATTGAAA ATCTTGTATT ATTAATCGAG 4 080 

T GAT TAT TAT TAATATAGAT TGTTCTTCCA CTTAAATAAC CCGAAGTATT ACCCTCCATT 414 0. 

TTAATATATC TAACATTTAA ATTAAGTTGA AATAATAGCT TGCTATGTC AATAGCAAAG 4 2 00 

TGTTCAGAAG TAATAAAAAG TTGATCCATT TTGTCCTTTA TAAATGCCTG AAATAATCGA 4 2 60 

ACTATTTTTG ATTCTAAAAT ATCTTCATAA TGAACTTTCT CAATAACTTT CAATTGATTC 4 320 

CCCACCCTAA CTAAAAAAAT CGAACC T AC A TCATTTAATA AATTTATAAC TATAGGTGTA 4 380 

AACATTATTA AAATAAAACC CCTCTACTAC TATATGTAAC GAAGGGACAT GATTTCAAAA 4 4 40 

TAAAATACCT TTTTTATAAA TnTATTATAA TATCCCCCAC TATACnAC 4 4 88 
(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

TATGGCGCCA TATTAGTTGT AACTGGTTTA AGAGGTCcAa GAAaTATCaA ATAAAGTTGT 60 

tCCTGGGCTT GGTACTGTTA TCTCaATATT GmwTGCaTTT GGTGGTCTAG CTTTTAATAT 120 

TGGTAATATT GCTGGTGCCG GTTTAGGTTT AAATGCAATT TTTGGATTAG ATGTAAAATG 180 

GGGCGCAGCT ATTACTGCAA TCTTTGCAAT ATTAATCTTT GTAAGTAAAA GTGGCCAAAA 24 0 

AATTATGGAC GTTGTTTCAA TGATTCTTGG TATTGTGATG ATTTTACBTG TGGCATATGT 300 

GATGTTTGTT TCTAATCCAC CTTATGGTGA TGCTTTTGTG CATACATTTG CGCCAGAACA 3 60 

TCCAATGAAA TTAGTCTTGC CCATCATTAC GTTAGTTGGT GGAAcTGTar GTGGTTATAT 4 20 

TACCTTTGCA GGTGCACATC GTATATTAGA CTCTGGCATT AAAGGTAAGC AATATTTACC 4 80 

ATTTGTAAAT CAATCAGCAA TTGCTGGTAT TTTAACTACA GGTATTATGA GAACGTTACT 54 0 

ATTCCTAGCG GTATTAGGAG TTGTTGTAAC AGGTGTGACA CTAAGTTCTG AAAATCCACC 600 

AGCGTCAGTT TTTGAACACG CAATTGGACC AATTGGAAAG AATATTTTTG GTATTGTGTT 6 60 

ATTTGCTGCA GCTATGTCAT CAGTAATTCE CTCAGCATAC ACAAGCGCAA CATTTTTAAA 720 

AACACTTCAT AAATCACTTA AC G AAAGAAG TAATTTAATT GTGATTGTGT TTATCGTTAT 7 80 

TTCAACAATG ATTTTCTTAT TTATTGGAAA ACCAATCAGC CTTTTAATTA TAGCAGGCGC 840 

GAT AAA 8 4 6 



(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

GAATCGCACT CATTGTCTGT TTGTCATCAT TATAAATAGG CATATGTTGT ACCTCAAATT 60 

TCTTTTTAAA AGGTACTAAT ATTTCTTTAG TGAAAATTGA AT CACGGTCG TTTATTGGTG 12 0 

CCTTGAGTAT ATTATTATAG ACGGAATCTG ATCTAATAAT ATTGATTTTA TACATGATAA 18 0 

ACCTCCTTAT GTTGTCAGCA TAAAGGATAA CGTAACGTGA TTTTCAAGCA GTAATTGTAA 24 0 

CTAATTGAmA AAAATTAAGA AAAGTATGTG AGTGTTCCTA AwTAATATGa TTAAAATGAT 300 

GGCGAATAAG TGTCTaAAAG CATCTTAAAG GGACATTGTA TAGGGTAAAT CACTTCATAA 360 

ATAAGGGaAA ATCCTTATGT TCACTTTTTC JCAATCATnA TAAAATATAT ATGTAGTCAA 4 20 

TACTTTGTCT ATATTGAATG TTTTCATATA AATGAAAGCA TTTTTAAATA ACATTGACCT 4 80 
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CTAATATATA GGCAGAGTAT TGATATCTAT TAAAAAATAA ATGATTTTGA TGAAGGTGAA 54 0 

ACGTATGTAC AAAACAAAAG GTGGCTTTCA ACTTACATTA CAAACATTAA GTTTAG3GT 600 

TGGGTTTATG GCTTGGAGTA TAATTGCGCC ATTAATGCCC TTTATTAAAC AAGATGTCAA 660 

TGTTACTGAA GGTCAAATAT CAATCATTTT AGCGATACCA GTTATTTTGG GATCGGTGCT 720 

CCGTGTGCCA TTTGGTTATT TAACAAACAT TGTTGGCGCT AAATGGGTAT TCTTTACTAG 7 80 

TTTTATCGTA TTGTTATTCC CGATATTTTT CTTAAGCCAA GCACAAACAC CGGGTATGTT 840 

AATGGCTTCA GGATTTTTCC TTGGTGTAGG TGGTGCAATT TTCTCAGTTG GTGTTACATC 900 

AGTTCCTAAA TATTTCCCTA AAGAAAAAGT AGGTCTAGCA AATGGTATTT ATGGTATGGG 9 60 

AAATATCGGT ACAGCAGTTT CTTCATTTTT AGCACCACG ATAGCGGGTA TTATTGGTTG 1020 

GCAAACAACA GTTAGAAGTT ACTTAATTAT TATCGCTTTA TTTGCATTAA TTATGTTCAT 1080 

TTTTGGTGAC ACACAAGAAC GTAAAATTAA AGTACCATTA ATGGCtCAAA T G AAAmC ATT 114 0 

ATCTAAAAAC TACAAATTAT ATTACTTAAG TTATTGGTAT TTTATTACTT TTGGTGCTTT 1200 

TGTAGCATTT GGTATTTTCT TACCTAACTA CTTAGTAAAT CATTTTGGAA TTGACAAAGT 12 60 

AGATGCTGGT ATTCGATCAG GTGTATTCAT TGCGCTGGCA ACATTCTTAA GACCAATAGG 1320 

TGGCATTTTA GGTGATAAAT TTAATGCAGT TAAAGTATTG AT GAT T GAT T TTGTTGTTAT 1380 

GATTATCGGT GCCATTATTT TAGGTATTTC AGACCATATC GCATTATTCA CTGTAGGCTG 14 4 0 

TTTAACAATA AGTATTTGTG CAGGTATTGG TAACGGCTTA ATCTTCAAAT TAGTACCATC 15 00 

ATACTTCTTA AATGAAGCGG GATCCGCAAA TGGTATCGTA TCAATGATGG GTGGTTTAGG 15 60 

AGGATTCTTC CCACCACTAG TAATCACGTA CGTAGCTAAT TTAACBGAT CAAGTCATTT 1620 

AGCATTTATT TTCTTAGCGG TATTnGGAnG TATTGCATTA TTTACCATGC GTCATTTATA 1680 

CCAAAAAGAA TATGGCTCAT TGAAaAACGG TTGATATGTA AT ACAT GCC A TTcATTTAGT 174 0 

TAAATACAAA GCCTTaATAT CATGCGCAAT ATTCGTAGCA TGACATTAAG GCTTTAGTAG 18 00 

TGTATTCATT TTAAATTTAA CTATGAAATT GGAATAGATG TTGATGAAGT AGTTTCATTG 18 60 

CTTGGTTTGA TTTTAGGCAA GGTAATGGTT AATAACCCAT TTTCAAAACT AGCAGTAATA 1920 

TGTTGCTTAT CAACAGCTTC AAAATCAAAT TGACGCATTA ATGATTCGAA GTTACGCTCA 198 0 

TCTAAAATGA GTTGTTCAGA TTTGTATTTT GCGCTTCTAG TAGCTTGAAT AGTGAGCGwA 204 0 

TTAttATTGA AATCgATACT AATAtCTccC TG 2 072 
(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

CGTCTTCTCT TGGTTATTCT CTGTGTTTTG TCTTTGTTCA AATTCGATTT TTTGTTGTTC 60 

GAATGAATGT AATACTTTAT TTTTTTTTGG CACATAATCC AAGTnATTT TAGGAATTAA 12 0 

TATACGGTCT TTAAATGCTT CTTGTATTTT GCTCACAATC AATTGGTATA GTTGCTCTTC 180 

TTTTGATAAA CGCACTTCTA GTTTTGTTGG ATGAACATTT ACGTCTACTA AGATTGGATC 24 0 

CATTTCAATA TTAATATAAC AAATCGGGAA CCTACCTATT GTTAAGAGTG TATGATAGCC 300 

TTCTAAAATC GCTTTATTTA GCATAAAGTT TTTAATGTAT CGTCCATTAA T AAAAA TAG A 3 60 

AATATAGTGC TTATTACTTC TAGAATGTTC AGGCTTTGCA ACAAAACCTT CAATGTGATA 420 

ATCACTTGTA TCTCCAGATA TATGTACTAA ATCTCGTGCA ACTTTCATCC CATAAATCTC 4 80 

TGCCATCACT TCATTAGTTC GTCCTGAACC ATTTGTACTT AACATTGTTT TGCCATCTGA 54 0 

AATGAGTGCT ATTCGAATGT CCGGATGGCT CATTGCCATT CTGTTGACAA TATCTGTTAT 600 

TTTACCTAGT TCAGTGTATA AACTTTTAAT ATATTTTAAA CGTGCTGGTG TATTATAAAA 6 60 

TAATGATTCT ACAAGTATAT CTGTTCCTTT TTTCGCTTTT GCAGGCTTAT GATTTMAT 720 

TTCACCATTT TCTACATATA TTTCATTTCC ATTAGCATTA TCCGTGCAAG TCTTCAATGT 7 80 

TACTTTAGCA ACTGATGAAA TACTGGCTAG TGCTTCACCA CGGAATCCTA ATGTCCTAAT 84 0 

ATGAAATAAA TCTTCATCTT GATCTAATTT ACTAGTCGCA TGTCTATGAA ATACTAATCC 900 

TAAGTCTTCC GCTTCAATTC CGCTTCCATT ATCGACTACG CGAATAGATT GGACGCCAGA 9 60 

tTCCTCTACT TCAATGCTTA TTTCTGTAGC GCCTGCATCT ATAGCATTTT CCAACAGTTC 1020 

TTTCACAACA GAACTCGGTC TTTCAACTAC TTCACCTGCT GCGATTTTAT TTGCTAATGA 1080 

GGTTTGGAGT TCTTTAATTT TCCCCATTTT GCAACACCTC TATTTTAATT GATTTTGTAA 114 0 

TTCACTTAAC TTCACCAATG CCTCAATTGG TGTCATATTA GATAAATTCA AATTTTTAAT 12 0 0 

TTGTAGTTCA ATCTCGCTTT TTTGATCATT T T C AAAC AAA TcAAATGATG CyTGTTCAAA 12 60 

GTCTTTTTGA GATAAAGTAT CaGTTGTTTC TTCaACACTT aAGTTTAAAT TTTCTTGATT 1320 

AATTTCCAGG TTCATTTTCG ACCATTTTTA AATTTGATAT CGATGATtTT TCACCAGCAG 138 0 

ACGCTTCAAA CTCGCTTAGA ATCACTTGTG CTCTGCTAAT AACTTTTTCA GGTAAATCAG 14 4 0 

CTAATTTCGC AACTTGAATA CCATAAtATC GTCAACTGCA CCATCTTTGA CTTTATGCAA 1500 

GAATATAAGT TCACCTTTAT ATTCATTAGC AGCGACGTGA ACATTTTTTA GACTTGGTAA 15 60 

TGCTTGATCT AATGTTGTCA ATTCATGATA ATGTGTTGAA AATAACGTTT TAGCATGTGA 162 0 

TGTTTCAGCT ACATACTCTA TCATTGCCTG CGCTAAAGCT AAACCGTCAT ACGTTGAAGT 168 0 
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ACCACGTCCA ATTTCATCGA AAATAATCAA ACTATCCTCT GTTGSTAAG TTAATGCCTT 174 0 

TTGTGCTTCT AGCATTTCTA CCATAAACGT ACTCTTACCT GAAACCAAAT CATCTGCCGC 1800 

ACCTATTCTA GTGAATATTT GATCAAATAT AGGTAACACT GCCTCTTTAC AAGGGACATA 18 60 

AGCTCCCATT TGGGCCATTA TACTAATTAT GGCAACTTGT CTCATATATG TCGATTTACC 192 0 

AGACATATTC GGACCTGTAA TTAAATATAT AAATGTTTCA TTATCTAATC GACAATTATT 198 0 

AGGCACATAG TCATTATAAT CCATTACTCT TTCCACTACT GGGTGCCTAG ATTCCACTAA 20 4 0 

TTCTAATGTT TTATTTTCAC TAAATGAAGG CCTAGTGTAA TTATATTTTT GAGCAATTTC 2100 

TGCAAAGCTC TGTAAACAAT CTAGCTCTGA AATAATTTTA GCTTGTTGTT GTAAACGTTC 2160 

AGTATATTTT TTAACTTCTT CACGTAGCTG AACAAATAAT TGATATTCTA ATTCGATGGC 2220 

TTTGTCTTCC GCACCTAAAA TGATATCTTC TTTTTCTTTA AGTTCATCAG T T AT AAAACG 2280 

TTCAGCATTC GATAACGTTT GCTTCCTCAT ATAACCAAAT TCACTTGGTT CAAATTTTG 2340 

CAAGTTGGCA CGTGTTATTT CTATAAAATA ACCAAACACT TTATTAAAGC TTATTTTCAA 24 00 

TGATTTTATT CCTGTACGTT GTCTTTCTTT GGCTTGTAAT TCTGCTAACC ATGTTTTTCC 24 60 

GTTTTTTGAA GCTTCAAGAT ATTCATCTAA TTGCGTATTA AAACCAACTT TGAATAGTCC 2 52 0 

GCCATCTTTA ACTGAAATTG GTGGTTCTTC TACTAAACTC TGTTCTAATA TATCAAGTAA 2580 

ATCATCaAGG GGTTCTAGTT GATTAACTTG TACAAGAGTA TTCTGATTCA TAGAATTTAG 2 64 0 

TAATGCTTTA ATATTCGGTA TTTCAGAAAT GGAATGTTTA AGTTGAATTA AATCTCTCGC 2700 

ATTAACATTT CCGTAACTAA CACGCCCAAC AAG&GTTCA ATATCATACA CTTGATTAAG 2 7 60 

ATATGTTCTT AAGGTGTCTC TTTCTATGAA ATGAGCACTA AATTCATCAA CGATATCTAA 2820 

TCGTGCTTCA ATTTGTTCTT TACTTATTAG TGGTCTATCT ATCCATTGTT TTAAGCGGCG 2880 

TGCTCCCATT GGTGTTTTCG TTTCGTCCAT TAGCCAAAGT AGCGTTCCTT TTTTTGATTT 2 94 0 

TAATCGAATG CTTTCCGTTA ACTCAAGGTT TCTCTTAGCA TAAAAATCCA TTTTCATATA 3000 

ATCTATAGCT GCATATTGAA CAACATCCTC GATATGCGAT AAATCACGTT TTTGTGTATG 3060 

ATGAATATAA TCTAGCAATA ATTGTGTCGC TTGATACaTT AATTTATGTT CAGTTTGATT 312 0 

C AC AC TAT AG ATTtCTGATG ATAACGTTTC CCTGACTGT 315 9 
(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
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TAAGCGAGAA GCAATTGGTG TTATGTATGC TAGTGATAAA CCAACAGGTG AAAGTACAAG 60 

GTCATTTGCT GTTTATTTCT CTCCTGAAAT TAAGAAATTT ATTGCAGATA ATTTAGATAA 120 

AT AAAT CAT C CATCCATACA TTGATAAATG ATTTTyiSAA ATTAACAACA AAATCAACAA 180 

TTTTAAACAT CTCTGTGATT CTATTTATTC G AAAT GAT T T AAAAAATAAA AC T TCAAAAA 24 0 

CCTAACCTTA TATTTATACG AATACTTAGA GGAGCACAAA AAT G AAT AAA AATATAATCA 300 

TCAAAAG TAT TGCAGCATTG ACGATTTTAA CATCAGTGAC TGGCGTCGGC ACAACAGTGG 3 60 

TTGAGGGTAT TCAACAAACG GCTAAAGCTG AAC AT AAT G T GAAACTAATC AAAAATACTA 420 

ATGTAGCACC ATACAATGGT GTCGTTTCGA TAGGATCTGG AACAGGTTTC ATTGTCGGTA 4 80 

AAAATACAAT TGTTACCAAC AAGCATGTCG TTGCAGGTAT GGAAATTGGT GC AC AT ATT A 54 0 

TAGCGCATCC CAATGGTGSA TATAATAATG GCGGATTTTA TAAAGTTAAA AAAATTGTCC 600 

GTTATTCAGG T C AAGAAG AT ATTGCCATTC TACATGTGGA AG AT AAAG C T GTTCATCCAA 6 60 

AAAACAGGAA T T T T AAA GAT TACACAGGCA TTTTAAAAAT AGCATCAGAA GCTAAAGAAA 7 20 

ATGAACGCAT TTCAATTGTT GGCTATCCAG AACCATATAT AAAfiAATTT CAAATGTATG 7 80 

AGTCAACAGG AAAAGTGCTG TCAGTTAAAG GC AAC AT GAT TATTACTGAT GCTTTCGTAG 840 

AACCAGGCAA CTCAGGTTCA GCTGTATTTA ACAGTAAATA CGAaGTtGTA GGTGTTCACT 900 

TTGGTGGAAA CGGCCCTGGA AATAAAAGTA CAAAAGGATA TGGTGTTTAT TTCTCTCCTG 960 

AAATTAAGAA ATTCATTGCA GATAACACAG ATAAATAAAT CCTTACATAG ATAAATGATT 1020 

TTAAAAATTA ACAACAAACT C AAC a AT T C A AATCATCTCT GTGATTCCAT TTATTCGAAA 108 0 

TGATTAAAAA AAATAAAACT TCAAAAAGCT AACATTATAA T TAT AC AAAT ACTTAGAGGA 114 0 

G C AG AAAAAT GaATAAAAAT ATAATGYTCA AAAGTATTGC AGCATTGGAC GATTTTTAAC 12 00 

ATCCAATAAC TGGGTGTCGG CACAACCATG GGTTGAAG 1238 
(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

TGATAAGTCA TTTAAATTGT CACCTATTGA CATGACTTCT TTCATTTCAA TCCCTAATCT 60 

TTCGGCAATT GTTTCTAGCG CAATACCTTT TTGTGCATCT GAATGCG3TA TTTCTATATT 120 

TCCTCTCGAA GATGATGATA TAGCTAAATT CGGAGAkTCA GCTAAAATTT TACTAGCTTT 180 
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GTCAATTTTT TCTAAATTTC CATCAAATGC TAATATTTTC ATAATTAATT CACCAGGTAT 24 0 

GTTTTCAATA GCATCATAAT TATCAACAAC TyTCAACGTA CCATTATCTA TGCGTCTTTG 300 

AATACCATTT TTAATACGCT CAACGTTTGC ATGTTGACCT GCACGCTCAG CAATATCTAT 360 

GTAAATGTCT AAATCTCTTT GTGGATCTTC AGTATAAATC GCACGACTCG TGTATACTTG 420 

ATAATAAATA CCTGCATCTT TTAAAACATT TGTAATTTTG TGTACTAACG ATTTATTAAG 4 80 

GTGTGAAGTG CTCATTACAT TGAAAGTTX ATCACGTACT TCAGCACCAT TCAAACAAAT 54 0 

ATATGGTACT GTTAAATCTG TGTCAGCAAC TGGTGCTTGk GCTTCATAAA ATGCTCGACC 600 

TGTCGCGATA ACAACCGTTA TCCCTTGTTC TTGAGCGTAT TTAATCGCAT CAATATTAGG 660 

TTGAGAAATT TCATGTGCTG CATTAAGTAG CGTGCCATCC ATATCAGTGG CTATAGTTT 7 20 

TATCATTATG TnACCTCGTT TCGTAAATnT AAAATCTTGT TCTTAAATAA GrATATATAC 780 

TCAGCGCACA TACTTTtCTA TTAmCATTTA TATkGTCATT aATTTATCAT ATAATGTAAT 840 

TCTaACAAAT nTTAAtTAGT ATGTACTATC GTCTAATTGG TGGATTTCTT ATTGGCTCTT 900 

AAgTTTTTAA AAAATGTTGT TAATAATGTG CTACATGCTT CTTTAAGTAC ACCTTTATCA 960 

ACAATTGCAC GAT GAT T AAA ATTAGATTGT TGCAATAAAT T CAT T AAAC T GCcACTACAA 1020 

CCACCTTTAG GATCATCTGC GCCATAGACG ACTCTTGGAA TGCGACTCAT TACAATTGTT 108 0 

CCTGCGCACA TGACACATGG TTCTAAGGTT ACATAAATG TGCAACCTTC TAAACGCCAA 114 0 

CTACCTAACA CTTTGGCTGC ACGTTCAATT GCAATATGTT CAGCATGCGC CGTTGGTTGT 1200 

TGTAGTGTTT CTCTTaAATT ATGTGCTCTA GCGATAACTT CATCATCTTT AGTGATGATA 12 60 

GCACCTATAG GTACTTCGCC TAGTTGAGCT GCTTTTTTAG CTTCTTCAAT CGCTAATGTC 1320 

ATAAAATATA TATCATTTGT CATTTATGTC CAGATACCTC ACTTATGGTA CAATACTCAA 1380 

TGTCTATTTT AACATTTGGA GATGTGAATA T G AAC AAAC C TTTTATAGCA ATTGAAGGTC 14 4 0 

CTATTGGCGT AGGTAAATCT TCACTTGCAC ACAAATTAAG TCAAACTTTA GATTTTTATG 1500 

AAGAAAAAGA AATCATCACA GAAAATCCAT TTTTATCAGA CTTTTATGAA GATATCTCTA 15 60 

AATGGAGTTT T C AAAC T G AA ATGTTCTTTT TATGCAATAG ATATAAGCAA TTTCAAGATG 162 0 

T AAC AC AAC T AAATCAAGGT GTAGTTAGTG ATTATCATAT ACATAAAAAT AAGATATTTG 1680 

C T AAAAAT AC TTTGAGTTCT GTTGAATTTC AGAAAT T C AG TAAATTTAT GATATTTTAA 174 0 

CTGAAGATAT GATTATGCCG AATATGATTA TCTTTTTAGA TGCAGACCTT GATGTGTTAA 18 00 

AATCTAGAAT TGCTAAACGT AACCGTAGTT TTGAGCATCA AATAGAaGtG AtAcTGTaAg 18 60 

TTAAAAAAAG ATTATCGTGA GTATTATGAG TCCTTACAAA GTAATGGTTC AAATGTAGTT 192 0 

TTAATCGATA CnACTTCTAT TGATTTTCTT AAAAATGAAC AAG AT T AC G A AGATATATTA 198 0 

CATATTATAT TACCTATGAT AGGAGATATT ACCAATGAAT AATTACGGTA TTCCACAAAA 204 0 
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TGCCATTATA ACCATTGCAG GTACAGTTGG TGTTGGAAAA TCAACACTAA sGCAAGCACT 2100 

TGCAGATAAA TTAAACTTTA AAACGTCTTT TGAAAATGTC GAACATAATC CATATTTAGA 2160 

TAAATTTTAC AGCGATTTTG AACGATGGAG TTTCCATTTG CAAATTTACT TCTTAGCTGA 22 20 

ACGTTTTAAA GAACAAAAGC GTATGTTTGA ATATGGTGGT GGCTTTGTCC AAGATCGATC 22 80 

AATTTATGAA GAT G T T GAT A TTTTTGCAAA AATGCATGAA GAAGAAGGCACAATGAGTAA 234 0 

AGAAGATTTC AAAAC AT AT T CAGACTTATT TAATGCCATG GTCATGACAC CTTATTTTCC 24 00 

TAAACCTGAT GTAATGATTT ATTTAgAATG TAACTATGAT GAGGTCATTG ATCGTATTAT 24 60 

TGAACGTGGT CGCGAAATGG AAATTAATAC AGACCCTGAA TACTGGAAAA AGCTATTTAA 2520 

ACGCTATGAC GATTGGATTA ATAGCTTTAA TGCATGTCCA GTTGTACGTA TCAATATTAA 2 58 0 

TGAATATGAT ATCCATAAGG ACCCCGAATC TTTAAATCCT ATGATAAACA AAATTGCTCG 2 64 0 

AATTATTCAA AC AT AT C G AC AAGTAGATAC ACGATAAAAG ACTAAAGACA T AGCG T AT AT 2 7 00 

GTTTATATTC AATGTATATT CCATAGATAT TA"CGATTAT TTTATCAATT CTATCGAATA 27 60 

CATTAATTCA CATATACACT ATGTCTTTCT TTTTAATTTA AAGCTTCTAA AATATCTGCC 2820 

GCACTATTTA AAATAATATC AGCTTCATGT AATTCTTCTT TTGTTGCAAT ACCTGTTAAT 2880 

ACACCTATTG CCATACCTAA ATTTGCATTA CTTGCTGTCT TCATATCATT AGCAGTGTT 2 94 0 

CCTACTATAG CTACTTTCTG AGGATCTACA TTATATTGCT CAAATAAAGG CGATAATACT 3000 

TTAGGATTTG GCTTCTCATA GGCATCCGCT TCGGTAGAAA TGATCAAATC GAACAACGAG 3060 

GTAGCATTGG TATGTGCTAA AAATTGTTCT ACACCTTTTT TAGTATCACT CGTAACAATA 3120 

CCAAGTTGAT AGCCTTTTGC TTTCAAATCG ATAAGTGCTT CTTTAACACC TTCTACCCAA 3180 

TTAATTTCAG GAATACGTTC ATCTACCAGC TTTTGACTTG TTGACTTGGA CCAGTCGGTT 3240 

GTATCTTGTC CCGTCACATC ATTAAATGCC TGGATAATTT GTTGTAAAGA TCCTGAACCC 330 0 

ATCACTGATT TTGGATCAAT AGATTCTTTA ATGACACCGAGTTGTCTTAA AGCAGCTTCT 3360 

TTATTATGTA CTGGGAAAGT CTCAAGCAAT GATTGTACAA ATCGTACCCC TATTTTTTCC 3420 

CAACTTCTAT CAAATTCAAT TAACGTACCA TCTTTATCAA ATAATATCCA TTCCATTGaT 34 8 0 

ATCAATACTC CTATTTATTT ATTTCGTATT ATGCTGATTC TATGATATTC GTTATCCCCT 534 0 

GAAAATGAAC TCGTAGTATT GTTCTATTTA AATATTGaAT TAAATATAAT AATAAGTGAA 3600 

ATCCCCTTCA ATACTTAACA AT AAAC AT T G TAAACTTAAT TTATTACCAT GCTTCGCTTC 3660 

ATTGAAAGGG ATTTTAGTCA TGATTAACTT TTGCATATTG TTTTCATGAT TATATTCAAT 3720 

TTTTATTAAT ATTTTGGTAC A^CGACTCTC CAACCATTTT TATCTTCTAA AGTACCATTT 37 8 0 

TGAATACCAG TATAGACGTC GTATAATTTT TGAGTAATTT CACCAGTCTC ATTATTATTA 38 4 0 
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ATAACGATTT CACGATCTTC GTATCTCAAT GTACCCACAG GTGAAATAAC TGCTGCAGTA 3900 

CCACTACCAA ATACTTCTGT TAACTCACCT TTATCATATG ATTCGAXAA TTCATCGATT 3960 

GAAACGCGGC GCTCTTCGAC TTCATATCCT AAGTTTTTAG CTAATTCGAT AAT AG AT T T A 4 020 

CGTGTAATAC CAGGTAAAAT ACTGCCATTC AACTCTGGTG TAATTACTTT GCCATTTTCA 4 080 

ACGAAGAAAA TGTTCATGCT ACCAACTTCT T C GAT AT AT T TCTGTTCAAC ACCATCAAGC 414 0 

CATAATACTT GGTCATAACC TAATTTATTT GCAT TAGTTT GTGCTAATAA ACTTGcCGCA 4 200 

TAGTTACCTG CAACTTTTGC AAAGCCTACA CCGCCACGaA CAGCACGCAC ATATTCATCT 4 2 60 

T C T AC AT AGA TTTTAGTTGG TTTTAAAGTT TCACCACCAT AATATGCACC TGAAGGAGAT 4 32 0 

AAAATAATTA ATAATTTATA CTGATGTGff GCACCAACGC CAAGTGCCCC TTCTGTTGCA 4 38 0 

AAAACAAATG GACGAATATA TAATGATTGA CCTTCCCCTT CAGGAATCCA ATCTCTTTCA 44 40 

ATATCAACTA ATTGTTTTAG CCCCTCTAAC AATTCTGCTT CGTCTACTTG AGGCATTTCT 4500 

AATCGTGCTA ACGAGTTATT AAGACGCTTA AAATTTTCTT CAGGACGGAA AAGOGAACT 4 5 60 

TCCCCATCTC TTTTATATGC TTTTAATCCT TCGAATACCG ATTGACCATA ATGAACACCT 4 620 

TGTGCAGCAG GTGAAATTTC AATAGGACCA TAAGGTACTA TCTTCAAATC ATGCCATCCT 4 680 

TTATCTGCAT CATAATCATA ACTCAACATA TAATCAGTAA AATATTTACC AAAACCTAGT 4 74 0 

TGAGATGTAT TTGGTTTTTG TTTTAATGTT TCTCGTCGTT CAACTTTAAC TGCTTGTGAC 4 8 00 

ATGGTGATTG CCTCCTAATA ATATTGTATA AGAATTTGTT TAACTTAAAT TATAACAATC 4 8 60 

CaTATTTTGC TGTTCAACAA ATTTTCTAAA AATTCAAAAT TAATTAACAG ATTTCTAGAA 4 92 0 

AG AC T AT AT C TTTTAGTATA AACGTATTAA TTTCA&GAG ACAAGTAATC TGTGTTTTAC 4 98 0 

T AAT AT AC T T TACATACAAA AAACTCTTTA CTTTAAAATG AACTAAGCTC GCGAATTCAA 504 0 

TAAGTATAAT GAAT AAT AT T AG AAT T CAT G CACTAGTTTA TTAAAATAAA GAGTAATTTA 5100 

AAAT AT CAT T CCGTGTATTA AAGTGAATGG AAATGATTAG TTATTATTTT TAACAGTATC 5160 

TTTTTGTTCA ATAGCTTCTA ACATTAATTT AGTCATGCTC GCTAAATCAT ATTTAGGATC 5220 

AAATCCCCAT TCGCCACGTG AACAGCTTGT AT C AAT AG AA TCCGGCCAAC TATTAGCGAT 52 8 0 

ACCTTGTCTA ATAGGATCAA CATCGTAATC TAATGTAAAA TTGGGATAGT ATTCTTGAAT 534 0 

TGCTTCTTTT ACCATCTCTG GATCAAAACT CATTGCGCTC AAAT T AT AAC CATTTCTAGT 54 00 

TTCTAATTTA GCGTCGTCTG CTTCCATAAG TTTAATAATT GCTTCAATTG CATCATCCAT 54 60 

ATACATCATA T C CAT AT AC G TGCCTTTATC TATGAAGCTT GTATAATGAC CCTCTCTTAC 5520 

TGCTTTGAAG TATATTTCAA CAGCATAGTC TGTAGTACCG CCfiCTGGCT CTTTAACATG 5580 

CGAGATTAAA CCTGGGAATC T AAC AC T T C T TGTATCTACA CCAAAACGTT TGAAATAGTA 5 64 0 

T T G AC AC AAT AATTCTCCAG CTACTTTATT TACACCATAC ATTGTCGTAG GTTGCTGAAT 57 00 
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CGTTACTTGT GGCGTATTAA CTTTAGGAGT TGAGTCTCCA AATGCACCAA TTGAACTTGG 57 60 

TGTGAAAAAG TGCAAATTAT AAGTTCTTGC AGCTTCTAAT GCATTCATTA ATCCACCCAT 58 2 0 

ATTTAAATCC CAAGCTAGAA TTGGATTTTT CTCAGCAGTT GCTGATAATA ATGCTGCCAT 58 80 

ATGCATTAGA CTATCCGCTT CAAAGTCCCT AACTAACTCA AACATACGGT CACGATCTGT 5 94 0 

TACGTCTAAG ATTTCAAATG GTCCATTTTG TACAGGTGAG TCTGCTTCAG GTTCCCTAAT 6000 

ATCTGTAGCA AGAACATTAT CTGTCCCATA AATTTCTCTG CACTTAACAA .CTAATTCTGT 60 60 

ACCAATTTGT CCTAATGCAC C AG T AAT CAT AATTTTTTTC ATAGAAATAT CTCCTTTGtC 6120 

TCTTCTATAT AGCTATAGTC CATCACAAGC GGaCATAATA TTCATTTTCATAATAATTAT 618 0 

AATATAAAAG CGCTTTCTTG TATATATGAC ATGTACATGT TGCTGATAT k TCTGTAAATG 62 4 0 

GAAATTCTAG TTGTATTAAT TGATTTTAGT AATTTATAGC GTTTATTATT GCT AAT TACT 6300 

GtCAAATTAA ATTTTTTATC CCTCAACTCT TAAACTCTGG ATATCTTTCA TTATATTAGC 63 60 

TTTTTTATAA C CAT GG AT AT CATGTAAAGC CTTATAAGCn TTAAATAATG TTTCATACCT 64 20 

TTGTACTTnT TCCGCTTCTG GATT 64 4 4 
(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4721 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

GCCCATGAGA CAATTTTACT TGCTTTTCCC ATTGGTTATC ACGTTCTTAT TACATAGATT 60 

TAAACCGAGA AATATTATTC AAACGCT7TT TATTGTATCG TTGATTTCTT. TAGGACTTAT 120 

GAT AG T GAT T CATTTCATCA C T G G AG AT AA TTCACGTGTG TATTTTGGGA C AG AT AC AC G 180 

ACTGCAAACT TTATTGCTTG GTTGTATATT AGCATTTATT TGGCCTCCGT TTGCTTTGAA 240 

AAAAG AT AT T TCTAAAAAGA TTGTCGTATC ATTAGATATT ATAGGGATAT CTGTTTTGC 300 

GGTTCTAATG ACTTTGTTCT TTATAGTTGG AGACCAAGAT CAATGGATCT ATAATGGAGG 3 60 

ATTTTACATT ATATCATTTG CAACTTTATT CATTATTGCA ATTGCGGTAC ATCCTTCTAG 4 20 

TTTATTTGCT AAATTTTTAA GTATGAAACC T T T AC T AAT T ATAGGTAAAC GATCATATAG 4 80 

CTTATACTTA TGGCATTATC CTATCATTGT TTTTGTGAAC AGTTATTACG TACAAGGACA 54 0 

AATACCGGTA TACGTTTATA T TAT AG AAA T TTTGTTAACA GCGTTAATGG CTGAAATTTC 600 

GTATCGCTTT ATTGAAACAC CTATACGTAA AAAAGGATTT AAAGCTTTTG CATTTTTACC 660 
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TAAAAAGAAG GGGCAATTTG CTAGAACAGT GTTAGTTATC CTATTATTGG TTCCGTCTAT 7 20 

CGTTGTGCTC AGTGGACAGT TTGATGCACT TGGCAAACAA CATGAAGCCG AGAAGAAAGA 780 

GAAGAAGACG GAATTTAAAA CAACGAAGAA AAAAGTCGTT AAAAAAGATA AGCAAGAGGA 84 0 

TAAGCAGACA GCGAATAGCA AAGAGGATAT TAAAAAGTCA TCACCACTAT TAATTGGTGA 900 

CTCGGTCATG GTGGATATTG GTAATGTCTT TACTAAGAAA ATACCAAATG CACAAATTGA 960 

TGGTAAAGTT GGACGGCAAC TCGTTGATGC TACACCAATT GTGAAATCGC AATATAAAGA 102 0 

CTATGCTAAA AAAGGTCAAA AAGTTGTAGT AGAGCTTGGT ACAAATGGGG CATTTACGAA 108 0 

AGATCAATTA AATGAACTAT TGGATAGTTT TGGAAAAGCA G AC AT AT AT T TAGTTTCTAT 114 0 

TAGAGTACCT AG AG AT TAT G AAGGTAGAAT AAATAAATTA ATTTATGAGG CAGCTGAAAA 1200 

GCGCTcTAAT GTACATCTAG TCGATTGGTA TAAAGCTTCT GCAGGTCATC CGGAATACTT 12 60 

TGCATATGAC GGTATTCACT TAGAATATGC AGGTAGTAAA GGCTGACTG ATTTGATTGT 1320 

AAAAACGATG GAAACACATG CTACAAATAA GAAATAATTT GATGCACTAA ACTTTTGAAA 138 0 

TATTACATTA CTTCTGATAT T TAT TAT C AA AAATGATGTA TTTCATTAAA AGTTTAGTGC 144 0 

TTTTTTATTT TCAAATCCCA TAGTAACGGT GCAGAAAAAG TGTTGTAAAC ATTCTAATTG 150 0 

GTATATTACA TTCAATGAAG CTTTATTAGG AAC AG AT T AC AT TAT GAT AA CAAAGCCCGC 15 60 

AAGACACCTA ATCTCTGTTA TAGTTTGTTT TGTCGCAAAA CTATAAAAGT TATAATTGTT 162 0 

TGCATACTAA AAAAATAAAA AATATAAAAT TTAAAATAAT TGAGTCGCTA AT GAC TAT AT 1680 

GTATAGAGTG TTTTGATTAT TGGCAGGATA TTTAATTATG AAAAAAATCG TTACAGCTAC 17 4 0 

AATCGCTACA GCAGGACTTG CCACTATCGC ATTTGCAGGA CATGATGCAC AAGCCGCAGA 1800 

ACAAAATAAC AATGGATATA ATTCTAATGA CGCTCAATCA TACAGCTATA CGTATACAAT 18 60 

TGATGCACAA GGTAATTATC AT T AC AC T T G GACAGGAAAT TGGAATCCAAGTCAATTAAC 192 0 

GCAAAACAAC ACATACTACT ACAACAACTA CAATACTTAT AG T TAT AAC A ATGCATCTTA 1980 

CAATAACTAC TATAATCATT CATATCAATA CAATAACTAT ACAAACAATA GTCAAACAGC 204 0 

AACAAATAAC TATTATACTG GTGGTTCAGG TGCAAGTTAT AGCACAACAA GTAATAATGT 2100 

TCATGTGACT AC AAC T G C AG CGCCATCTTC AAATGGTCGT TCAATTTCTA ATGGTTATGC 2160 

ATCAGGAAGT AACTTATATA CTTCAGGACA ATGTACTTAT TATGTATTTG ATCGTGTTGG 2220 

TGGGAAAATT GGTTCAACAT GGGGTAACGC AAGTAATTGG GctAACGCAG CTGCATCATC 2280 

TGGCTATACA GTGAACAATA CACCAAAAGT TSGTGCTATC ATGCAAACAA CACAAGGCTA 2 34 0 

TTACGGTCAT GTTGCTTACG TTGAAGGCGT TAACAGCAAC GGTTCTGTTC GTGTTTCAGA 2 4 00 

AATGAACTAT GGACATGGTG CTGGTGTGGT TACGTCTCGT ACAATTTCAG CAAACCAAGC 2 4 60 

AGGTTCATAT AATTTCATTC AT T AATCAAA TGTAAATCAA ATGACGTCAA TATTCTEAA 2520 
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CATGAGAGTA TTGGCGTTTT TGTTTTATAT AAATATAAAT GAGAGCGGTT TATTCACTGA 2580 

TCTTTAGGGA ACTAAGTAAT AAAGTGATAA TTTATACTAT GTCAGTATGA TTGAGAGTGA 2 64 0 

TTCAATTTAG AT G AAAAC C A TGAAAAAATA TATTAAAACA GCATTTTTTT GTAGTATGTA 2 7 00 

TTGGTTAATT GT1CAACTAA ATATAGCAAA TTTAGGTACA AGAATTCCTG ATAAGTATTT 2 7 60 

TCGTCAGAAG TACATAATAT TTAAATCATT TAACTTTGAG AAGCATGGAA AATTTTGGAA 28 20 

CAAATGGTTT TACGTAAGAA AATGGAAACA TAAGATTTTA GATGGTCATC AGCTTAATCA 28 80 

AAATATATAT GATCAGCGTC ATTTAATGAC AATCAATACT GATGAAATTG AAAAAATGAT 294 0 

TATAGAGACA AAGAGGGCAG AGTTGATTCA TTGGATATCG ATACTTCCAG T CAT CAT AT T 3000 

CAATAAAGGC CCTCGTTTAG TAAAGTATAT AAATATTTTC TATGCAATGA TAGCTAATGT 30 60 

TCCAATCATT ATTGTGCAAC GCTATAATCG ACCGAGATTA ACGCAGTTAC TACGCATATT 3120 

AAAACGAAGA GGTGAACGTC ATGACTAAAC ATATCATCGT TATTGGTGGT GGCTTAGGTG 318 0 

GGATTTCTGC AGCAATTCGA ATGGCACAAA GTGGCTATTC GGTCTCATTA TATGAACAAA 32 4 0 

ATAATCATAT AGGAGGCAAA GTGAATCGTC ATGAATCAGA TGGCTTTGGC TTTGATTTAG 3300 

GTCCATCTAT TTTAACGATG CCTTATATTT TTGAAAAATT ATTCGAATAT AGCAAGAAGC 3360 

AAATGTCAGA CTACGTTACA ATCAAGCGAT TGCCACATCA ATGGCGTAgC TTTTTTCCAG 34 20 

ATGGAACGAC TATCGATTTG TATGAAGGTA TTAAAGAAAC AGGTCAGCAT AATGCGATAT 34 80 

TGTCGAAACA GGATATAGAG GAACTGCAAA ATTATTTGAA TTATAEAGA CGAATCGATC 3 54 0 

GTATTACTGA AAAAGGGTAT TTCAACTATG G T T TAG AT AC ACTATCTCAA ATTATTAAAT 3 600 

TTCATGGGCC ATTAAATGCT CTTATTAATT ATGATTATGT AC AT AC TAT G CAACAGGCCA 3 6 60 

TAGACAAGCG TATCTCGAAT CCATACTTGC GACAAATGTT AGGCTATTTT ATCAAATATG 37 2 0 

TAGGTTCTTC AT CAT ACGAT GCGCCAgCTG TATTATCTAT GTTATTCCAT ATGCAACAAG 37 8 0 

AGCAAGGCCT TTGGTATGTA GAAGGTGGAA TCCATCATTT AGCCAATGCC TTGGAAAAGc 38 4 0 

tAGCGCGTGA AGAAGGTGTC ACAATTCATA CAGGTGCACG TGTGGACAAT ATTAAAACAT 3900 

AT C AAAG AC G TGTGACGGGT GTCAGATTAG ATACAGGTGA GTTTGTAAAG GCAGAT TATA 3 960 

TTATTTCAAA TATGGAAGTC ATACCTACTT ATAAATATTT AATTCACCTT GATACTCAAC 4 020 

GATTAAACAA ATTAGAGAGG GAATTTGAGC CGGCAAGCTC AGGATATGTG ATGCATTTAG 4 080 

GTGTTGCTTG CCAATACCCG CAATTAGCAC ATCATAATTT CTTTTTTACG GAAATGCTT 414 0 

ATCTCAATTA TcAACAAGTT TTTCATGAAA AGGTATTGCC AGATGATCCG ACCATTTATC 4200 

TAGTAAATAC GAAT AAAAC T GATCACACAC AAGCGCCAGT AGGTTATGAA AATATCAAAG 42 60 

TCTTACCACA TATTCCATAT ATTCAAGATC AGCCTTTTAC CACTGAAGAT TATGCGAAGT 4 32 0 
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TTAGGGATAA AATTTTGGAT AAAT TAG AAA AAATGGGACT TACTGATTTA AGAAAACACA 



4380 



TTATTTATGA AGATGTTTGG ACACCGGAGg AT AT T G AAAA AAATTATCGT TCTAATCGTG 



4440 



GTGCAATATA TGGTGTTGTA GCAGATAAAA AGAAAAACAA AGGATTTAAA TTTCCTAAAG 



4500 



AAAGTCAGTA TTTTGAAAAC TTGTACTTTG TAGGEGATC AGTAAATCCT GGTGGTGGCA 



4560 



TGCCAATGGT TACATTAAGT GGGCAACAAG TCGCAGcAAg ATAAACGCGC GAGAAGCGAA 



4620 



GAATAGGAAG T GAT AT C TAT GAAATGGTTA TCACGAATAT TAACAGTAAT AGTGACCATG 



4680 



TCTATGGCGT GTGGTGCaTT GATATTTaAT CgTAGACATC A 



4721 



(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SS2 ID NO: 235: 

TATTCGTGCG CAATGGGCTA AATTAGGTCT AGGTTTAGAT TATAGTAGAG AACGTTTTAC 60 

T T TAG AT G AA GGTTTAAGTA AAGCAGTTAA AAAAGTTTTT GTTGATTTAT ACAATAAAGG 120 

AATTATTTAT CGTGGCGAAC GT AT TAT AAA TtGGGATCCn AAAGCACGTA CAGCTTTATC 180 

TGaTATTGAA GTAATACATG AAGATGTTCA AGGTGCGTTT TATCATTTTA AATATCCTTA 24 0 

CGCTGATGGT GAAGGTTTTA TTGAAATTGC AACAACAAGA CCAGAAACGA TGTTAGGTGA 3 00 

TACAGCGATT GTTGTTAACC CTAATGACGA AC GAT AC AAA GATGTAATCG GTAAAACTGT 3 60 

TATATTACCA ATCGTAGGAC GCGAACTGCC TAT TT TAG A GAT GAG TAT G TTGATATAGA 4 20 

CTTCGGTTCT GGTGCTATGA AAGTGACACC AGC AC AT G AC CCTAATGATT TTGAAATTGG 4 80 

TCAAAGACAT CAATTAGAAA ATATTATCGT TATGGATGAA AATGGTAAAA T G AAC G AC AA 54 0 

AGCGGGTAAA TATGAAGGTA TGGACCGTTT TGATTGTCGT AAACAGC T AG TTAAAGATTT 60 0 

AAAAGAACAA GAT T TAG T T A TCAAGATTGA AGATCATGTT CATTCTGTAG GTCATTCAGA 6 60 

ACGATCTGGC GCTGTTGTTG AAC CAT AT TT AT C AAC AC AA TGGTTTGTGC GCATGGAAGA 7 20 

C T T AGC G AAA CGTTCATTAG AT AAC C AAAA AACAGATGAT CGTATTGATT TTTATCCGCA 7 80 

ACGTTTCGAA CATACATTTA ACCAATGGAT GGAAAATATT AGAGATTGGA CGATTTCAAG 84 0 

ACAATTATGG TGGGGTCATC AAATTCCGGC TTGGTATCAT AAAGAAACAG GCGAAATATA 900 

TGTTGGAGAA GAAGCGCCAA CTGATATTGA AAATTGGCAA CAAGATGAAG ATGTATTAGA 9 60 

TACGTGGTTC TCaAGTGCTT TATGGCCtTT CTCyACGTTA GGTTGCCTG ATTTAGAAAG 102 0 

TGAAGACTTT AAAC GAT AC T AC C C AAC AAA TGCCTTAGTT ACAGGTTACG ATATTATCTT 10 8 0 



783 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



TTTCTGGGTA GCACGCATGA TATTCCAAGG CTTAGAATTT ACAGATCGTC GTCCATTTAA 114 0 

TGATGTATTA TTACACGGTT TAGTTCGTGC TGAAGACGGG CGTAAGATGA GTAAATCATT 1200 

AGGTAATGGT GTGGATCCAA TGGATGTTAT TGACGAATAC GGTGCTGATA GCTTGCGTTA 12 60 

CTTCTTAGCA ACAGGTTCAT CTCCAGGACA TGATTTAAGA TACTCAACTG AAAAAGTTGA 132 0 

GTCAGTGTGG AACTTTATCA ATAAAATCTG GAATGGGGCA CGTTTCAGTT TAATGAATAT 138 0 

CGGTGAAGAC TTTAAAGTTG AAGATAXGA TTTAAGTGGT AACTTATCAT TAGCAGATAA 14 4 0 

ATGGATTCTA AC AC G T T T AA ATGAAACGAT TGCAACAGTT ACTGATTTAA GTGACAAATA 1500 

TGAATTCGGC GAAGTTGGAC GTGCATTATA TAATTTCATT TGGGATGATT TCTGTGATTG 15 60 

GTACATTGAA AT GAG T AAAA TTCCAATGAA TAGTAATGAT GAAGAACAAA AfiAAGTTAC 162 0 

ACGTTCAGTA TTGAGTTATA CTTTAGACAA TATTATGAGA ATGCTACATC CATTCATGCC 1680 

ATTTGTAACA GAGAAAATAT GGCAAAGTTT AC C AC AT G AA GGTGACACAA TTGTTAAAGC 174 0 

TTCATGGCCA GAAGTGCGTG AATCATTGAT TTTTGAAGAA AG T AAAC AAA CAATGCAACA 1800 

ACTTGTTGAA ATCATTAAAT CTGTAAGACA ATCACGTGTA GAAGTAAATA CGCCATTGTC 18 60 

TAAAGAAATA CCTATTTTAA TTCAAGCTAA AGATAAAGAA ATT G AAAC AA CTTTATCACA 1920 

AAACAAAGAT TATTTAATCA AATTCTGTAA TCCTAGTACC TTAAATATTA gCtGACGTGG 198 0 

AAAwTCCTGA GAAAGCAATG ACaTCAGTTG TAATGCAGG TAAAGTGGTA T T AC CAT TAG 2 04 0 

AAGGGCTAAT TGATATGGAT AAGGAAATCA GCCGTTTGGA AAAAGAATTA GCTAAACTTC 2100 

AAAGCGAATT AGATAGAGTA GATAAAAAGC TCTCTAATGA AAACTTTGTA AG T AAAG C AC 2160 

CTGAAAAGGT TATAAATGAA GAAAAACGTA AAAAACAAGA TTATCAAGAA AAATATGATG 222 0 

GTGTGAAGGC AAGAATTGAA CAATTAAAAG CATAGGAGTT AGTAACAATG AATTACCTAG 2 28 0 

AGAGCTTGTA TTGGATACAC G AAAG AAC T A AATTTGGCAT C AAAC C AG G T GTTAAACGTA 234 0 

TGGAATGGAT GCTAGCACAA TTTAATAATC CTCAAAATAA CATTAAGGGT ATT CAT G TAG 2 4 00 

GTGGCACAAA TGGTAAA3GC TCTACAGTTG CTTACCTTAG AACAGCTTTA GTTGAAAATG 2 4 60 

GTTATGAAGT AGGTACATTT ACGTCGCCGT TTATTGAAAC AT T T AATGAA CGAATTAGTC 2520 

TAAATGGTGT GCCAATATCA AATGACGCTA TTGTAGAATT AGTATCACGT AT T AAACC AG 2 580 

TAAGTGAAAT GATGGAACGT GAAAC AG AT T TAGGTGTTGC ACTGAATTC GAAATAATCA 2 64 0 

CAGCGATGAT GTTTTTATAT TTTGGTGAAA TACATCCTGT TGATTTTGTC ATTGTTGAGG 2 7 00 

CTGGATTGGG TATAAAGAAC GATTCGACAA ATGTCTTTAC ACCGGTTTTA TCAATCTTAA 2 7 60 

CTAGTATCGG TCTAGACCAT AC AG AT AT T T TAGGTGGTAC TTATCTAGAT ATTGCTAGGG 2 82 

ATAAAGGCGC GATTATAAAG CCTAACGTTC CAGTGATATA TGCTGTTAAA AATGAAGATG 2 880 
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CATTAAAATA TGTTCGTGAA CGCGCAATTG AACAACATGC AAAGCCAATT G AAT TAG AT A 2 94 0 

GAGAAATTGT TGTTGTATCG CAAAATGATG AAT T TACT T A CCGTTATAAA GATTATGAAT 3000 

TAGAAACAAT CATTTTAAGC ATGTTAGGTG AACATCAGAA ACAAAATGCT GCATTAGCCA 30 60 

TAACAGCTCT TATTGAATTA AATGAACAAG GATTAATTGA ATTAGATTTC AAT AAGAT G A 3120 

TAGACGGTAT T G AAT C AG T T CGTTGGACTG GACGTATTGA GCAGGTGCAT GACAAACCTT 3180 

T AAT CAT T T T GGATGGCGCA CATAATTCAG AGAGTATAGA TGCTCTAAT GATACAATTA 324 0 

AACAGTACCA TGATAAAGAA AAAGTAGATA TTTTGTTCTC AGCAATAAAC GGAAAACCGA 3300 

TTAACGAGAT GGTCAAACAT TTAAGTTTAA TTGCGCATAC GTTTTATGCA ACTGAATTTG 3360 

ATTTTCCGAA AGCGTTACGC AAAGAAGAAA TTGTAGGTAG TATTGAAAAT GATGAAATAC 34 2 0 

AATTAGTAGA TGACTACGTT GAATTTATAA AAAATTATCA AGGT GAT AC A TTAGTAATTA 34 8 0 

CCGGTAGTCT GTATTTCATA AGTGAAGTTA AATCAA 3516 
(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

TGAGTGATAG AATCAAAAAA GCCATCTCAA AAATTAATCA AGCAAACAAC ATTCCAAACA 60 

ATGsTCGCAA ATCACCAATG TATCACTCTC CAATTACGTA ACTATGATTT AATTTAAGCA 120 

TAGTTATTGA GGTTTTGTGA TATATAGTAT AAAAT T AATG AG AAT T AAAT TTAATAATGT 180 

AAAATTCATm TTCgGGGTCG GGTGTAATTC CCAACCGGCA GTAAATAAAG CCTGCGACCT 24 0 

GCTAGTATGT ATCATATTAG TGGCTGATCT AGTGAGATTC TAGAGCCGAC fiTATAGTCT 300 

GGATGGGAGA AGATGGAGGT TTTTTGTTGT GCAATAATCC TCCTATTCTT AC GAG AT G AA 3 60 

TGGAAGGAGA AAATTGAATA TGCAACAAAA TAAACGTCTT ATCACAATAA GTATGTTGAG 420 

CGCGATTGCG TTTGTGTTAA CTTTTATCAA GTTTCCTATA CCATTTTTGC CACCATACTT 480 

AACTTTAGAT TTTAGTGATG TACCGTCACT ACTAGCTACA TTTACGTTTG GACCAGTTGC 54 0 

CGGTATCGTA GTTGCACTGG TTAAAAATTT ATTGAACTAC TTATTTAGTA TGGGCGATCC 600 

AGTTGGACCA TTTGCTAACT TTTTAGCAGG CGCAAGTTTC TTATTAACTG CTTACGCCAT 6 60 

CTATAAAAAT AAACGTTCAA CAAAATCTTT GATTACTGGA TTAATCATTG CAACAATCGT 720 

TATGACTATC GTGTTGAGTA TTTTGAACTA TTTCGTTCTA TTACCTTTGT ACGGTATGAT 780 

ATTTAACTTA GCTGATATCG C AAAT AAT C T TAAAGTAATC ATTGTTTCAG G AAT TAT AC C 84 0 
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ATTCAATATT ATTAAAGGTA TCGTTATTTC TATTGTATTT ATTTTACTAT ATAGAAGGE 900 

TGCGAATTTC TTGAAAAGAA TTTAATCAAA TTAAAGCAAA ATAATATACA CATAATAATA 960 

AAAAGCAGGT GACTATCAAT AAACGATAGC TTGCCTGCTT TTTCTATAGA ACATTTGTCT 102 0 

AAAAAATCAA TTATTCaAAT TTTAAAGCGT CCCCATCAAA TGATTCGTCT GCAATTTTAA 1080 

TAGAATCTGt AGGGCATCCa TCAATTGCAT CTTCCATATC TTCATATAAT TCCtCAGGTA 114 0 

CTTCTGCAGT ACCTTGGTTA TCGTCAAGGA TTACGAAAGC AATACCTTCG TCGTCGTAAT 1200 

CATATATATC TGGCGCTGCT GCACCGcATG CACCACATGC AATACAAGTA TCCATATCAA 12 60 

CGATTGTATA TTTTGCCAAT GTCTTCGCCT CCTTTGATAAAAATGCTAAA ATAGTAATGT 132 0 

GACTAAAATT TTAGACAGCA TCATTTTTAT TTTCAAATTA TCCGTTTTAC AG AG T GAG G G 1380 

TTAAATTTGC AACACATTAT AAAAACAGCA T T AC AAC AAA CATTTAACTA TAAAACAAAT 14 4 0 

AAAAGTATTT ACAATATCTT AGTTGGTAAG AAATCTCACC AAACCTTTTT TGACGCTTGT 1050 

AGTCAACAAC AGTTGTCATT ATATCACAGT TTACCACTAT TAAAATATCC GTCTTTTGAG 15 60 

CTATTTtTAG AAAAAATCAA TGAATTTAAT GCTGAAATGG AAATCATGTT GCATCCTAGA 1620 

TATACATTTG aAAGCATGGG GCAAACATTT CAAGCAATTC AACTATTAGT GCAAaCCATG 168 0 

TCTAATACCA AAC AAC AT G T TTTTCATTTT GTACCAATCT CTCAAAATAA TAAGATACAm 17 4 0 

GAAACTGTAA AGATTGTCTA TAACTACATT AAAGAAAACC AATTACAAAT TGATTTTGAA 1800 

AATGAACTAC ATAACTTGTT TAAAGCAATA ACTTTAAAAG GGCCATGTTA CTTACATTAT 18 60 

TATTTGCAAG GCTATGATGA ACCAATGTAT AC GAGAC AGC AAGTTAGTT AATAGAAAAG 1920 

CTATCTCAAC AGCAATTGTT TGAATACGAA ATGAATAATT TAG T G AC AAT GATGTTTGAA 1980 

TTAGAAAGTG GAGAATATAC TATTTTATCA AAAATAATAA TGAAACCTAC ATTATTAAAT 204 0 

CAAACTTATA T T AC T TAT AC AAAATTGCTT G AAC AAT T C A CGATGGAAGA TATAGCGGCT 2100 

CAACAACAAG TTAAAATCAA TACTATCGAA GAT CAT G T AC TTGAAATCTT AAT CAAAGGT 2160 

TACATGTCTA AT T AC GAT G A TTATGTTGAA CTAGAAGATC AACTCCAGTT TTTGAATTTT 2220 

TATCAACAGC ATCGTGGCGA ACGATTAAAA TTTTACAAAG AAC AAT T T G A CACGTTATCA 2280 

TATTTTCAAT TAAAAGTATT AATCGTTGGA TTTGAAAGAG GTGATCTGAA TGTTGCATGA 2 34 0 

TATTTTACGA AACAAATTTG GATTCGAGAG TTTTAAACCG GGACAACAGG AAATTATAGA 2400 

AAGTATAATG TCTCAACAAC ACACTCTAGG TATACTTCCA ACTGGAAGTG GAAAGAGTTT 2 4 60 

GTGTTATCAA ATACCTACGT ATTTATCAGG TAAGCCGACA TTAATTATCT CACCGTAAT 2520 

ATCTTTAATG GATGACCAAG TTATGCAGTT GAAAATAAAT G G AG AAAAAC GTGTAACATG 2580 

TATTCACTCT GGTATGGATG AAAT TGAGAA AAAGCATAAT ATTAAATGTT TACGACATAG 2 64 0 
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CCGCTTCATC TTTCTAAGTC CAGAATTTCT CCTGCAACCG TCAAATTTTA AATTAATATC 2700 

TATGATAGAC TTTGGCATGA TTGTTCTAGA TGAAGCACAT TGCCTATCTG AATGGGGATA 2 7 60 

TGATTTCAGA CCACATTATG CTCTAATAGG AAAAGTAACA AAGCATTTTA AAGAAGCGGT 282 0 

TGTCTTAGCA TTGACAGCAA CTGCACCACC GCATTTACAA GATGATTTGA CGGAAATGTT 2880 

AGCGATTCAA TTCAATGTTA TTAAAACTAC AATGAATGC CCAAATATAA GCTTTAAGCA 2 94 0 

TCTTAATTTT CATGATGATG AAGATAAAAT TGAATGGTTG CTGCCGTTTC TACAACAGTC 3000 

GGGACCAACG AT TAT T TAT G TCTCATCGAA AAAGATGTGT CTGAATTTAG CGCAACTTAT 30 60 

TTATGATTCA GGTTTTCTTA CAGGTATTTA TCATGGTGAT ATGAATTATC AAGAGCGACA 3120 

CACAGTTCAA CAACAATTTT TAAATAATGA TATTCCGATT ATAGTCGCAA CGAGTGCTTT 318 0 

TGGTATGGGA ATTAATAAAA AAGATATTCG CACAATCATT CACTTTCATC TTTCAACAAG 32 4 0 

TCCTTCTAAC TACATTCAAG AAATTGGCCG TGCGGGTCGC GATGGTGAAC TAAGTCAGGC 3300 

AATTAGTTTA TTCCAACCG3 ACGATAAATA TATTTTAGAA ACGTTATTAT T T GC AG AT AT 3360 

GATAACAGAA GAAGATGTAC AAAATTTCGA AATAGGAGAA TTTTTAGCTC CCGATAAACA 3420 

AGCCGTTTTG ACAACGTTGc AATCATTCTA TAGTATCGGC GCCTTGaAAC AGATATTTAA 3480 

GCAATCATTT AAACGAAAGC AATTAGGATT CTTTCGCATG ATTGCTATT GCAAATTGGA 3540 

TCAATGTAGA CGGAAGTATT TATTAGAATT TTTCGGTGAA TATCCACCGG CACAAGATCG 3600 

ATGTTGTGAC AATGATTCTA ATATAACTGA TATCGCAATT TTAAATAAGA AGAAGGTAAT 3660 

TAGAAGTATT GGATTTGATG AAAAGTTGCA AAATTTATTT C T C AG AT AG T ATTACTTTAC 3720 

TAAAAGAAAA TTGACAAGCT ATAATTAGTG TATACACAAT TGAAAAATGA TTGAAATAAT 3780 

TTTGAAAAAT ATACATAAAC ATATGTCATG TGGGTATATT TTATGTAAAA TCATTGTAAT 38 4 0 

AG AAT AG AAA GGAAGATGGC TATGTCTAAT AATTTTAAAG ATGACTTTGA AAAAAATCGT 3900 

CAATCGATAG ACACAAATTC ACATCi&GAC CATACGGAAG ATGTTGAAAA AGACCAATCA 3 9 60 

GAATTAGAAC ATCAGGATAC AATAGAGAAT ACGGAGCAAC AGTTTCCGCC AAGAAATGCC 4 02 0 

CAAAGAAGAA AAAGACGCCG TGATTTAGCA ACGAATCATA ATAAACAAGT T C AC AAT G AA 4 080 

T C AC AAAC AT CTGAAGACAA TGTTCAAAAT GAGGCTGGCA C AAT AG AT GA (TGTCAAGTC 414 0 

GAAT CAT C AC ACAGTACTGA AaGTCAAGAA CCTAGCCATC AAGAC AG T AC ACCTCAACAT 4 2 00 

GAAGAGGAAT ATTATAATAA GAATGCTTTT GCAATGGATA AAT C AC AT C C AGAACCAATC 4 2 60 

GAAGACAATG ATAAACACGA TACTATTAAA AATGCAGAAA AT AAC AC T G A GCATTCAACA 4 320 

GTTTCTGATA AGAGTGAAGC TGAACAATCT CAGCAACCTA AACCATATTT TACAACAGGT 4 380 

GCTAACCAAT C AG AAAC AT C AAAAAATGAA CATGATAATG ATTCTGTAAA ACAAGATCAA 4 4 40 

GATGAACCTA AAGAACATCA TAATGGTAAA AAAGCAGCAG CTATTGGTGC TGGAACAGCA 4 500 
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GGTGTTGCAG GTGCAGCTGG TGCAATGGCT GCTTCTAAAG CTAAGAAACA TTCAAATGAC 4 5 60 

GCTCAAAACA AAAGTAATTC TGGCAAGGCG AATAACTCGA CTGAGGATAA AGCGTCTCAA 4 62 0 

GATAAGTCTA AAGAT CAT C A TAATGGCAAA AAAGGTGCAG CGATCGGTGC TGGAACAGCA 4 680 

GGTTTGGCTG GAGGCGCAgC AAGTAAAAGT GCTTCTGCCG CTTCAAAACC ACATGCCTE 4 74 0 

AATAATGCAA GCCAAAACCA TGATGAACAT G AC AAT CAT G ACAGAGATAA AG AAC G T AAA 4 8 00 

AAAGGTGGCA TGGCCAAAGT ATTGTTACCA TTAATTGCAG CTGTACTAAT TATCGGTGCA 4 8 60 

TTAGCGATAT TTGGAGGCAT GGCATTAAAC AATCATAATA ATGGTACAAA AGAAAATAAA 4 920 

ATCGCGAATA CAAATAAAAA TAATGCTGAT GAAAGTAAAG ACAAAGACAC AT C T AAAG AC 4 98 0 

GCTTCTAAAG ATAAATCAAA AT CT AC AG AC AGTGATAAAT C AAAAG AG G A T C AAG AC AAA 504 0 

GCGACTAAAG ATGAATCTGA TAATGATCAA AACAACGCTA ATCAAGCGAA C AAT C AAG C A 510 0 

CAAAATAATC AAAATCAACA ACAAGCTAAT CAAAATCAAC AACAGCAACA ACAACGTCAA 5160 

GGTGGTGGCC AAAGACATAC AGTGAATGGT CAAGAAAACT TATACCGTAT CGCAATTCAA 5220 

TACTACGGTT CAGGTTCACC GGAAAATGTT GAAAAAATTA GACGTGCCAA TGGTTTAAGT 528 0 

GGTAACAATA TTAGAAACGG TCAACAAATC GTTATTCCAT AAT AT AAC T A TATAAATTGT 5430 

AACTGAACTG C TAT AT AC AA ACGTGCATAT TATAAGTATC TCTAAATATT T T G GAG AAC T 5400 

TAAATTGCGC TTATAAGTAT GTAGCGGTTT TTTCATTTTT CAAAGTTTGT TATTTAACAA 5 4 60 

GGTCTTGTCT CGAATATTGG CATATCAATT TAACTTTTTA AATAGTCATC AAAAAGATAA 5520 

AACACCACAA TCAACAAATT T2&CGAGGAA GAATAAAAAA TAAATCAACA TATTAAATTG 558 0 

TAGTGTTATT CAACTCCGTA GCTAACAATT CTCTATTCAC AT T AAAC AAA TTGTCAAAAA 5 64 0 

TATATCATAA ATCTTCAAGC ACAGACTTAG CGCATCAATC ACTGAACTGT TATAATAGTT 57 00 

TGGGAT TAAA GGAGGCCGAA ACAATGCAAA AAG T T G AAAG TAT CAT AST GGTGGAGGGC 57 60 

CATGCGGATT AAGTGCGGCT ATTGAACAAA AAAGAAAAGG TATTGATACC TTAATTATTG 582 0 

AAAAGGGTAA TGTCGTTGAA TCAATCTACA ATTATCCTAC TCACCAAACA TTTTTCTCAT 58 8 0 

C AAG T G AT AA ATTAAGTATT GGGcGAgTAC CGTTTATCGT T G AAG AAAG T AAAC C AAGAC 5 94 0 

GTAATCAAGC GCTAGTTTAT TACCGAGAAG TTGTAAAACA TCATCAATTA AAAGTAAATG 6000 

CATTTGAAGA AGTATTAACT GTTAAAAAAA TGAATAATAA ATTTACTATT ACTACGACGA 60 60 

AAGAT G T T T A TGAATGTCGA TTTTTAACAA TCGCGACAGG CTATTATGGT CAGCATAATA 6120 

CAT TAG AAG T TGAAGGTGCG GATTTACCTA AAGTGTTCCA TTATTTTAAA GAGGCACATC 618 0 

CGTATTTTGA T C AAG AT G T T GTAATTATCG GTGGTAAGAA TTCGGCTATC GATGCTGCTT 62 4 0 

TGGAGTTGGA AAAAGCTGGT GCTAACGTGA CGGTTCTATA TCGTGGTGGA GATTATTCGC 6300 



788 

( opied h orn 1< 80(ii>0(i on 08/ 1 , /"> ■ ; 



CTTCAATTAA ACCGTGGATA CTTCCAAATT TCACAGCATT AGTAAATCAT GAAARATTG 6360 

ACATGGAATT TAATGCTAAT GTTACCCAAA TAACTGAAGA TACTGTGACT TATGAAGTAA 64 2 0 

ATGGTGAAAG TAAAACGATA CACAATGATT ATGTATTTGC GATGATTGGT TATCATCCCG 64 8 0 

ATTATGAATT TTTAAAATCT GTAGGCATTC AAATTAATAC AAATGAATTT GGAACAGCGC 654 0 

CTATGTATAA TAAAGAAACA TACGAAACAA ATATCGAAAA TTGCTATATT GCAGGTGTAA 6600 

TTGCTGCAGG GAACGATGCG AATACCATTT TTATTGAAAA TGGTAAATTC CACGGGGGCA 6660 

TTATTGCTCA AAGCATGCTA GCTAAGAAAC AAACGCCCTT AGAATCATAA AAATAAAGGT 6720 

CTATGTACTA AAT AAC T TAG TTTTACAACG ACT GAGS' TC ATGATATGTC AAATGAGGTT 6780 

GATGACTATT GATTGTACAT AGACCTTTTT ATGTTACGTA TTCATTATAA TTCAAAATAT 6840 

GATTTGATTT CAGCTTTATC TAAATTGTTG CTTAACGCGA C T AAT AAT T T TAATCTTGCT 6900 

TTTGGACCAT TCAAGCCGTT AGAAAAAATA AAACCTTGTT GTGCGAGTTG GTAACCACCA 6960 

CCATCGTATG CGTAAGTTGG ACTCACAATA CCATTAAAGG AACGTGAAAC TAGCACAATA 7 02 0 

GGTATATTTA AAGATACTAA TTGTTGAATG CCTTCTAATG CGCTTGGAGG TATGTTGCCT 7080 

TGTCCTAACG C T T C AAT AAC CATACCATCG ATACCTTCTC G AC T AT AAAA ACTAAAAATG 714 0 

TCACCTGGCA TACCCATATA TGCTTTTACT AATGGTACAT TTAGTTTGTC AT T G AC AT T T 7200 

TCCAATGCTT GTTGGCGATA TGGCATATGA TGGAATTGCA CACGATCCTT TGTCAATACA 72 60 

CCTAGCGGAC CATGATTTGG ACTTTGAAAT GTGTTTGTAT TAGACGTATG TGTTTTGGTA 7320 

ACATTACGCG CCGTATGAAT TTCATCATTA Aa T ACAACCA TCAGCCTTT ATGACGGGCC 7 38 0 

TTTTCATCAG AGGCAACGCG AATAGCGGAA ATATAATTAT ATAATCCGTC AGAACCAATT 7440 

TCATTAGACG AGCGCATTGG CCAAGAAnTA ACAACAGGTT G 7 4 81 
(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

ATGAATTATA CCTTCCACTA GAAGTGTCGG TATTTATGTG CCTGGTGGTA AAGCAAGTTA 60 

TCCTTCAACG GTATTAATGA CAGCGACGTT AGCACAAGTA GCAGGTGTGG AAAATATTGT 120 

CGTTGTGACA CCACCTCAAC CTAATGGTGT GTCTCAAGAA GTATTAGCAG CATGCTATAT 180 

TACGCAAGTT GATCAAGTAT TTCAAGTTGG TGGTGCTCAA AGTATTGCTG CGTTAACTTA 24 0 

T G G C AC AG AA ACGATACCTA ffiGTTGATAA GATTGTAGGT CCAGGCAATC AATTTGTTGC 30 0 
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ATATGCCAAG AAATATTTAT TTGGACAAGT AGGTATTGAC CAAATAGCTG GACCAACAGA 360 

AATAGCACTT ATTATTGATG AAACAGCAGA TTTAGATGCC ATCGTATATG ATGTTTTTGC 420 

ACAAGCAGAA CAT GAT G AAT TAGCACGTAC ATATGCCATT AGTGAABTG CGCAAGTCCT 4 80 

TAAAGATTTA G AAT C AC G T A TTGCTAAAGC ATTGCCTAAT GTGGACAGAT ACGACATTGT 54 0 

TTCTAAAAGT ATCGCTAATC AACACTACCT TATCCATGCT AGTAATTTTG ATGAAGCATG 600 

CCATGTCATG AATACAATCG CGCCTGAACA TGCGTCGATT CAAACAGTAA ATCCTCAACC 6 60 

AT AT AT T G AA AAAGTGAAAT ATGTGGGTGC ATTGTTTATT G G AC AT TAT T CGCCAGAGGT 7 20 

CATAGGAGAT TACGTTGCAG GTCCAAGTCA TGTATTACCT ACAAATAGAA CAGCTAGATT 780 

TACCAATGGG TTATCGGTCA ATGATTTCTT AACACGGAAC ACGGTCATCC ATTTATCAAA 84 0 

AGATACGTTT GAACAAATTG CTGATTCACE ACAACATATT GCTCATGTTG AAGCATTATA 900 

CAATCACCAG CAGTCTATTT T AAT AC G T C A GTCTTAGGGG AGTGTAATTG AAATGATTTA 9 60 

TATTGATAAA AATGAAAGTC CAGTTACGCC GTTGGATGAA AAAAC AAT GA CGTCTATTAT 1020 

TAGTGCAACg CnATATAATT TATATCCTGA TGCAGCATAT GAACAATTCA AGGAGCTTA 1080 

TGCTAAGTTT TACGGATTAT CGCCTGAACA AATTATTGCA GGAAATGGAT CTGATGAATT 114 0 

GATTCAAAAG TTAATGCTGA TCATGCCAGA AGGTCCGGCA TTAACGCTAA ATCCTGATTT 12 00 

TTTTATGTAT CAAGCATATG CGGCACAAGT AAATCGTGAA ATTGCATTTG TAGATGCAGG 12 60 

ATCAGATTTA ACGTTTGATT TGGAAACCAT TTTAACGAAA ATCGATGAAG T AC AAC CAT C 1320 

ATTTTTTATT ATGAGTAATC CACATAACCC TTCAGGCAAG CAATTTGATA CGGCATTTTT 138 0 

AACAGCTATT GCAGATAAGA TGAAAGCATT AAACGGATAC TTTGTCATTG ATGAAGCATA 144 0 

TTTAGATTAT GGTACGGCAT ATGACGTGGA ACTGG(ACCA CACATCTTAA GAATGCGTAC 1500 

AT TAT C AAAG GCGTTTGGAA TTGCCGGCTT AAGATTAGGT GTCTTAATTA GTACTGCTGG 1560 

AACGATAAAG CATATTCAAA AAATAGAACA TCCATATCCA TTAAATGTAT TTACGCTAAA 162 0 

TATTGCGACT TATATTTTTA GACATAGAGA AGAGACAAGA CAATTTTTAA CGATGCAACG 168 0 

ACAGTTAGCT GAGCAGTTAA AACAAATATT T GAT AC AC AT GTTGCAGATA AAATGTCAGT 17 4 0 

GTTCCCATCA AATGCTAATT TTGTACTTAC TAAAGGCTCA GCAGCGCAAC AAT T AGGACA 1800 

ATACGTATAT GAACAAGGAT TTAAACCTCG CTTTTATGAT GAGCCGGTGA TGAAGGGCTA 18 60 

TGTAAGATAC TCAATTGCAA CAGCATCACA GTTAAAGCAA T T AG AAG AAA TTGTTAAAGA 1920 

ATGGAGTGCA AAATATGATT TATCAAAAAC AACGAAACAC AGCTGAAACG CAACTAAATA 198 0 

TTTCAATATC T GAT GAT C AG TCACCATCGC AT AT T AAT AC AGGTGTGGGC TTTTTAAATC 204 0 

ATATGTTAAC CTTGTTTACA TTTCATAGCG GTCTGTCATT AAEATTGAG GCACAAGGTG 2100 
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ATATTGACGT AG AT GAT C AC CACGTAACTG AAGATATCGG CATTGTCATT GGCCAATTGT 2160 

TACTTGAAAT GAT T AAAG AT AAAAAGCATT TCGTTCGTTA TGGAACGATG TACATTCCAA 222 0 

TGGATGAAAC ATTAGCACGT GTCGTTGTGG ATATAAGTGG GCGCCCATAC CTATCATTCA 2280 

ATGCATCATT AAGTAAAGAA AAAGTTGGTA CGTTTGATAC GGAGTTAGTA GAAGAATTTT 2 34 0 

TTAGAGCGGT CGTAATCAAT GCAAGATTAA CAACGCATAT TGATTTAATT CGTGGAGGTA 2 4 00 

ATACACACCA TGAAATTGAA GCTATATTCA AAGCGTTTTC CCGTGCATTA GGCATAGCGC 24 60 

TAACTGCAAC T GAT GAT C AG CGTGT3CCGT CATCGAAAGG TGTGATTGAA TGATTGTCAT 2520 

CGTTGATTAT GGATTAGGGA ATATTAGTAA TGTAAAACGC GCTATTGAAC ATTTAGGGTA 2580 

TGAGGTGGTT GTCTCAAATA CCTCAAAAAT AATCGATCAA GCAGAAACAA TCATATTGCC 2 64 0 

CGGTGTCGGC CATTTTAAAG ATGCGATGTC AGAGATAAAA CGATTAAATCTCAATGCAAT 2 7 00 

ATTGGCTAAG AATACTGATA AGAAGATGAT TGGTATTTGT TTAGGCATGC AATTAATGTA 2 7 60 

TGAGCATAGT GATGAAGGCG ATGCATCTGG ATTAGGGTTT ATCCCAGGAA ATATTTCGCG 2820 

TATCCAAACA GAATACCCAG TGCCACACTT AGGCTGGAAT AATTTAGTGA GTAAGCACCC 28 80 

TATGTTAAAT CAAGATGTTT ATTTCGTACA TTCTTATCAA GCGCCGATGT CAGAAAATGT 2 94 0 

AATTGCATAT GCGCAGTATG GGGCTGATAT TCCGGCAATT GTTCAATTTA ACAATTATAT 3000 

TGGTATTCAA TTCCATCCTG AAAAAAGCGG TACATATGGG TTACAAATTT TGCGTCAGGC 3060 

AATACAAGGG GGATT TATAA AT GAT T GAAT TiTGGCCAGC GATTGATTTG ATTGGGTCAA 3120 

CAAGTGTGAG G T T AAC AG AG GGTAAATATG ATAGTGAAGA AAAAATGTCA CGCTCGGCTG 318 0 

AAGAAAGTAT TGCTTACTAT AGTCAATTTG AATGTGTGAA TCGTATTCAT ATCGTCGACT 324 0 

TGATAGGTGC TAAGGCACAG CATGCCCGAG AGTTTGATTA TATTAAGTCA TTGAGGAST 3300 

TAACAACCAA AGATATTGAA GTAGGTGGTG GCATTCGTAC GAAGTCACAA ATCATGGACT 3360 

ACTTTGCCGC AGGGATTAAT TATTGCATAG TTGGAACGAA AGGTATTCAA GATACTGATT 3420 

GGCTTAAAGA GATGGCACAT ACATTTCCAG GTCGCATTTA TTTATCTGTT GATGCCTATG 34 8 0 

GAGAAGATAT TAAAGTGAAC GGATGGGAAG AGGACACAGA GTTAAATTTA TTTAGTTTTG 354 0 

TGAGACGGTT ATCGGATATA CCTCTTGGCG GC AT TAT AT A TACTGATATT GCTAAAGATG 3600 

GCAAAATGTC CGGACCTAAC TTTGAATTAA CTGGTCAATT AGTAAAGGCA ACGACGATTC 3660 

CTGTCATTGC TTCCGGTGGT ATTAGACATC AGCAAGATAT TCAACGATTA GCGTCGCTAA 3720 

ATGTTCACGC TGCTATTATA GGAAAGGCTG CACATCAAGC ATCTTTTTGG GAGGGGCTAA 37 80 

AATGATTAAA AAACGTATCA TTCCATGTTT AGATGTCAAA GATGGTCGTG TCGTTAAAGG 38 4 0 

GATTCAATTT AAAGGATTAA GGGATATTGG GAATCCTGTT GATTTAGCAA TGTATTACAA 93D0 

TGAAGCGGGT GCTGATGAAT TAGTATTTTT AGACATCTCT AAGACGGAAG AGGGTCATAG 3 9 60 
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CTTAATGCTA GAAGTGATTG AACAGACAGC GTCACGCTTG TTTATCCCTC TTACTGTAGG 4 020 

GGGTGGGATT CAAAGTCTCG ATGATATTAC CCAATTGCTA AATCATGGTG C AG AT AAAG T 4 080 

ATCATTAAAT TCAAGTGCTT TAAAAAATCC ACAGCTCATT AAACAAGCGA GTGATAAATT 414 0 

CGGTAGACAA TGCATCTGCA TAGCAAT TGA TAGCTATTAT GATCCTGAAA G AAAAG C AC A 4 200 

TTATTGTTGT ACGACTGGTG GTAAAAAAAT GACAAATATT AAAGTATATG ACTGGGTACA 4 2 60 

GCAAGTAGAA CAGTTAGGTG CAGGTGAGCT CCTCGTTACA AGTATGGAC ATGATGGTAT 4 320 

GAAACAAGGC TTTGATATTG AACACCTAGC AAATATTAAG TCTCTTGTAA ATATTCCAAT 4 38 0 

CATTGCTTCT GGTGGTGGTG GCAATGCACA ACACTTTGTA GAATTATTTG ATCAGACGGA 4 440 

TGTTTCTGCA GGTTTAGCTG CAAGTATATT ACATGATCGA GAAACGACGG TTCAATCTAT 4 50 0 

TAAAGAAGTG ATACGGCAAG GGGGTATAGC AGTAAGATGA CCAAATATAA AATTGATTTT 4 5 60 

AGCAAAGGTT TAGTGCCAGC AATTTTACAA GATAATCAAA CAAAACAAGT ATTGATGTTG 4 62 0 

GGTTATATGA ACCAAGAAGC TTTTGATAAA ACGATAGAAG ATGGTGTGGT ATGTTTCTAT 4 68 0 

TCGCGGTCGA AACAACGGCT AT GG AC AASi GGTGAAACAT CTGGTCATAC GCAACGTGTT 4 74 0 

AAAGATATTC ATGTAGATTG CGACAATGAC ACTATTTTAA TTGATGTCAT ACCAAATGGA 4 800 

CCAACATGTC ATACAGGCAG TCAAAGTTGT TTCAACACAG AAGTTCCATT TTCAGTGCAA 4 8 60 

ACATTAGCGC AG AC AG T T C A AGATAGTGCC CAATCCAATA AT G AAAAG T C ATAffiCAAAA 4 92 0 

TATTTATTAA CAGAAGGTAT AGAAAAGATT ACAAAAAAAT ACGGTGAAGA AGCTTTTGAA 4 98 0 

GTCGTAATTG AAGCAATTAA AG G T G AC AAA AAAGCATTTG TAAGTGAAGT AGCAGATGAA 504 0 

CTTTATCATT TATTTGTCTT GATGCATGCG CTTGGCGTCG ATTTTTCAGA AATTGAGGCG 5100 

GAATTAGCGC GTAGACATCA TAAGCGCAAT AAC T T T AAAG GTGAACGACA AAATATCGAA 5160 

CAGTGGTAAA GCAAGTATGG ACTAAGATAT AAGGAAAAGG ATCATGGCTT AT AC AC TT AC 522 0 

AAATATTGTG GAAAACGTGA CATTTTCAAG TTTAAAATAC G AC AC C AAC A TATTTTAACT 528 0 

ATGAATGCTG TGATGGTACT AAAGTTGCGA ACTCGTATA GATAAGTAGT GGATAATCAC 534 0 

AATACGAAAT CAAAAATAAT T AT AAAAAG T AAATTGAGCA ACTCAGGAAT AGATGTCACT 5 4 00 

GTTAAAGATG TCGAAAAGTA TATGAATCGA TATAATGAAG TTATGAAGGG AAAAAATGGC 54 60 

G AAAAAGC T A AAGAGTTATG TTTGTCGTTA CTACCTATTA ATATCATAGT TGTCTTTACA 5520 

TTCTTTGTAT T TAT AC TATA AATACAAATA TATCTAGCCT GAAATAGAAA TGTCATAGCC 5 58 0 

TATTT AAAAG ACAATCTCCA TTAGAACTAA GATATGCATC CCGAAAGTTA GACTAAAAAA 5 64 0 

CTAACTTTAT GGGATGTATT TTTATGCTAA TCATCATAAA TTCGAGATTA AGTTAAAGGT 5 7 00 

AGTTCAAGAG TAATTAAACA ATAAATwAAA AATAGTAGGA TACTTACTTT GAGGGAAGAA 57 60 
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AATTAACTGT ATATATTTAG TTTAGGAACA AGTATTACGG TTTATCCTGA TACAATTATT 5820 

GTGGATGGGA TGATATTTTT AGGTTTAAAA TACGACACCA GCAAACATAA TAACTGTAAT 58 8 0 

AGCTCATAAA TCTCCCCATA TAGCTAATCT AAAAAAATAA TAfiTCATTG GAATTAAGCC 5 94 0 

CCAAGCATGT AAATATTAAA AATCAAAAmA GATATmTGTA AAAaAGTTAC AATTtGCATA 6000 

ATTAAATTGT GTCTAATTAT TGACTAATTA AATTTTGCCA AATATAATAT TAATTAATAA 60 60 

TTTGaAATGA TTAGCGTATA CACTTTAAAT TCTCTTTGGA GAATATATTT TTTAAATACA 612 0 

AATGTAAACG CTTTCTCGTC AAATTAAACA ATAGAAAGGA TGGTCATTAT GAGTGCTTGG 618 0 

TTAAGTAAAT TATTTGAGTT TATTCCTCGA ATAATTATCA ATTTGTTTAT CTAAAATAAA 62 4 0 

AAAATAGAGG TGCTGACAAT GATGAAAAGT CAAAATAAGT ATAGTATTCG TAAATTTAGT 6300 

GTAGGTGCAT CTTCCATTTT AATACETACA TTACTATTTT TAAGTG 634 6 
(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

TGATAATGAT TCTCATTGTC ATACATCACG AAGGAGGCTA ATTAGTCAAT GAATAAAGTA 60 

ATTAAAATGC TTGTTGTTAC GCTTGCTTTC CTACTTGTTT TAGCAGGATG TAGTGGGAAT 120 

TCAAATAAAC AAT CAT C T G A TAACAAAGAT AAGGAAACAA CTTCAfflTAA ACATGCAATG 180 

GGtACAACTG AAATTAAAGG GAAACCAAAG CGTGTTGTTA CGCTATATCA AGGTGCCACT 24 0 

GACGTCGCTG TATCTTTAGG TGTTAAACCT GTAGGTGCTG TAGAATCATG GACACAAAAA 300 

CCGAAATTCG AATACATAAA AAATGATTTA AAAG AT AC T A AGATTGTAGG TCAAGAACCT 360 

GCACCTAACT TAGAGGAAAT CTCTAAATTA AAACCGGACT TAATTGTCGC GTCAAAAGTT 4 20 

AGAAATGAAA AAGTTTACGA TCAATTATCT AAAATCGCAC CAACAGTTTC TACTGATACA 4 80 

GTTTTCAAAT TCAAAGATAC AACTAAGTTA ATGGGGAAAG CTTTAGGGAA AGAAAAAGAA 54 0 

GCTGAAGATT TACTTAAAAA GTACGATGVT AAAGTAGCTG CATTCCAAAA AGATGCAAAA 600 

GC AAAG TATA AAGATGCATG GCCATTGAAA GCTTCAGTTG TTAACTTCCG TGCTGATCAT 660 

ACAAGAATTT ATGCTGGTGG ATATGCTGGT GAAATCTTAA ATGATTTAGG ATTCAAACGT 720 

AAT AAAG AC T TACAAAAACA AGTTGATAAT GGTAAAGATA TTATCCAACT TASTCTAAA 780 

GAAAGCATTC CATTAATGAA CGCTGATCAT ATTTTTGTAG TAAAATCAGA TCCAAATGCG 840 

AAAGATGCTG CATTAGTTAA AAAG AC T G AA AGCGAATGGA CTTCAAGTAA AG AG T G G AAA 900 
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AAT T TAG AC G CaGTTAAAAA CAACCAAGTA TCTGATGATT TAGATGAAAT CACTTGGAAC 9 60 

TTAGCTGGCG GATATAAATC TTCATTAAAA CTTATTGACG ATTTATATGA AAAGTTAAAT 1020 

ATTGAAAAAC AAT C AAAAT A ATTAAGGAGT TTTACGATGC TACTTAAACC AAAATACCAA 108 0 

ATCGTTATTG CTGGTTTATG T C T T GC AAT A GTAGCTATCT TAAGTTTAAT GATTGGAAAT 114 0 

ACGCTTGTGT CACCAGGTAC GGTGATACAG GCGT'ATTCA ACTTTGATAG TGAAAACGAT 1200 

TTACATGATG TTGTCACTGG TGCACGGGCG TCGAGAACAA TCATTGCGTT ATTGACTGGT 1260 

GCTGCCCTTG CTGTCTCAGG TTTGTTGATG CAAGCACTTA CACGAAACCC AATAGCCTCA 1320 

CCAGGGCTTT TCGGTGTCAA TGCAGGCGCA GTATTTTTTG TCATTTTTAG TATTACATTT 138 0 

ATCCAAATTC AATCTTTTAA AATGATTGTA GTTATTGCAT TTTTGGGGGC TATTGTTGTT 14 4 0 

ACTGTATTAG TTGTTGCACT AGGTATGTTT AGACAAACAC TATTCTCACC TCACCGTGTC 1500 

ATTTTGGCAG GTGCTGCGAT TGCGATGCTA TTTACAGCCT TTACTCAAGG CATACTTATT 1560 

ATGAACGAAA C AG AC T TAC A AGGCCTATTA TTTTGGTTAA GTGGCTCCGT TTCATTACGT 162 0 

AATATTTGGG ATATCCCATG GATTATTCCG CTTGTATTGA TACTTATTTT AATTGCATTT 168 0 

AGCATGGCTG C AC AC AT C AA CATCTTGATG ACAAGTGACG ACATTGCAAC CGGCCTCGGT 174 0 

CAAAACATAA AATTAATCAA ATGGATGATT ATTATGCTCA TfiGTATGTT AGCCGGTATT 1800 

TCGGTAGCCG TAGCTGGATC AATCGTCTTT GTGGGTCTTA TCGTACCGAA TATTAGCAAA 1860 

CGATTATTAC CACCAAACTA TAAGTATTTA ATTCCTTTTA CTGCATTAGC TGGAGCAATC 192 0 

CTAATGATCA TTTCAGACAT TGTTGCTCGT ATAATAATTA AGCCACTAGA GTTGCCTATC 198 0 

GGTGTCGTTA CCGCTGTCAT TGGCGCTATT GTCTTAATCT AT AT TAT GAA GAAAGGACGT 204 0 

CAACGCTTAT GACCGAAAAG ATTAATAAAA AAGACAATTA CCATCTCATC TTCGCGTTAA 2100 

TCTTTTTAGC CATCGTTTCA GTGGTAAGTA TGAT GATTGG TTCAAGCTTT AT ACC AT T AC 2160 

AACGCGTACT GATGTACTTT ATASVTCCAA ATGACAGTAT GGATCAATTC ACTTTAGAAG 2220 

TATTACGCTT ACCTCGCATT ACACTTGCGA TTTTAGCAGG TGCCGCACTA GGAATGAGTG 2280 

GTTTAATGTT GCAAAATGTA TTAAAAAATC CAATTGCCTC ACCTGATATT ATCGGTATCA 234 0 

CAGGTGGTGC TAGCTTAAGT GCTGTTGTCT TTATTGCATT T TTCAGCCATT T AAC AATAC 24 00 

ATTTACTTCC ACTATTTGCA GTATTAGGTG GCGCAGTTGC AATGATGATA CTATTAGTGT 24 60 

TTCAAACGAA AGGACAAATA CGCCCGACAA CACTCATAAT CATCGGTATT TCGATGCAAA 2 520 

CGTTGTTTAT TGCGCTTGTC CAAGGATTAC TCATTACAAC GAAGCAATTA TCTGCTGCCA 2580 

AAGCTTATAC ATGGCTAGTC GGAAGTCTTT ACGGTGCTAC GTTTAAAGAT ACAATCATTT 2 64 0 

TGGGTATGGT TATTTTAGCT GTTGTGCCGT TGTTATTTCT TGTTATACCA AAAATGAAAA 27 00 
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TATCTATACT TGATGACCCT GTAGCGATTG GCTTAGGCTT ACATGTACAA CGTATGAAAC 27 60 

TAATCCAATT AATCACTTCT ACTATACTCG IVTCTATGGC AATCAGTTTA GTAGGTAACA 2 82 0 

TTGGGTTTGT CGGTTTAATC GCACCACATA TCGCGAAAAC AATCGTTCGC GGAAGTTATG 2880 

CTAAAAAGTT ACTAATGTCA GCAATGATTG GTGCCATATC AATTGTTATT GCAGACTTAA 2 94 0 

TTGGGCGTAC CTTATTCTTG CCTAAAGAAG TGCCAGCAGG TGTATTTATT GCTGCTTTG 3000 

GTGCCCCATT C T T CAT AT AC TTATTATTAA CCGTGAAAAA GTTATAACGA TAT TAT T AAA 3060 

ACAAAATGAC CTCACAACGA AGTTAGCTAA ATGATTCAGT TAACTAACCG TTGCGAGGTT 3120 

TTTTTATACA TATAGTTGTT GTTATTGTTA ACAAGCGTCG ACTTTCTTAA TTACATATTA 318 0 

ATACTTTATA TACAAATAAC ACCGACTCAT ATTCTATAAT ATCAATCAAT ATTCTTCGAT 32 4 0 

TTTTCAAATA TCGATAACTA TTTCTTATTT AAATATAGTG TTTGATAATG TCATTTATTC 3300 

AAAAACACAA ATTTTAATAA AAATATCATA TTATTTTTAA TTGTAAATTA TGGATTATTT 3360 

TTCTTATTAT ATTGTGTTAT ATTAAAATAA AGTGCATAA CAAGGAGATA AAACAATGAA 34 2 0 

ATATAATAAC CAT G AC AAAA TTAGAGATTT TATAATCATT GAAGCATATA TGtnTCGTTT 34 80 

TAAGAAAAAA GTCAAGCCTG AAGTCGATAT GACTATAAAA GAATTTATAT TACTGACTTA 35 4 0 

TTTATTTCAT CAGCAAGAAA ACACACTTCC ATTTAAGAAG ATTGTTTCAG ATTTATGTTA 3 600 

TAAACAATCG GATTTAGTAC AGCATATAAA AGTACTTGTG AAACATTCAT ATATTAGTAA 3660 

AGTTCGAAGT AAAATTGATG AGCGTAATAC TTACATTTCA ATATCTGAAG AACAACnAGA 372 0 

nAAAATTGCA GAACnTGTTA CATTGTTTGA TCAAATCATT AAACAATTTA ACCTT 377 5 
(2) INFORMATION FOR SE2 ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

AGAAAAATTA GCCTAGCTAT GCAAGTTGTT ATTGCATTAG TGTTAGGTGT TGTCGTAGGA 60 

CTTTTATTAT ATGGACAAGA AAACGTAGCA AAT T AC AT T A AACCATTTGG TGACGTATTT 12 0 

TTAAATTTAA TTAAAATGAT CGTTATACCA GTTGTATTTT GCTCACTAGC GCTTTCTATT 180 

TCGAACGTTG GGGAATCGAA AACTGTAGGG CGTTATGGCT GaAAACAAT TTTATACTTT 24 0 

GAAATTATTA CAACAATCGC AATAGGTTTA GGGATTATCT TCGGTAACCT ATTTAAACCA 300 

GGTGCTGGAT TAGACCCAAC AAAATTACCT AAAGGTGATA TTTCTAAATA TCAATCAACT 3 60 

GCACATGCAG CAGAACAATC TACATATGGa AAT CAT T T T A TTGATACCAT TGTACATATT 402 
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ATTCCGACAA ACTTTTTTGA AGCTTTAAAT AAGGGTGAAT TATTACCTAT TATCTTCTTC 4 80 

GCAGTATTCT TTGGATTAGG ATTAGCTGCT GTAGGTAAAA AAGCAGAACC AGTTAAAGAA 54 0 

TTTTTAAGCG GATCGCTTGA AGCTGTGTTC TGGATGATTA ATAAAATTTT AAAATTAGCA 600 

CCACTTGGAG TGTTTGCATT CATTTGTACT ACAATTATTA CATTTGGTGC ATCCGCATTA 660 

T T AC C AC TAT T AAAAT TAG T ATTAGTTGTT GTCTTTGCAA TGGTGTTCTT TGTATTCGCT 72 0 

AT AC T AGGAC TAGTTGCATG GATGTGTGGT ATTAATATCA TGAATATTAT TAGAATCTTG 78 0 

AAAAGTGAAT TGCTTTTAGC ATTTTCTACA TCAAGTTCGG AAGCTGTAtC TCCTGTAATG 84 0 

ATGAAGAAAA TGGAAAACTT CGGTTCTCCA AAAGAAATTA CTTCTTTTGT TATACCAATT 900 

GGTTATACGT T T AAC T TAG A TGGATCAGCA CTTTATCAAT CTATTGCAGC ATTATTCGTT 960 

GCACAGATGT ATGGAATGCA CTTAACATTA T C AG A G C AAA TTGTGTTGAT . GTTAACATTA 1020 

ATGATTACGT CTAAAGGTAT GGCGGCAGTA CCAGGAACTT CAATCGTtGT TTTATTAACA 1080 

ACATTAGGTG CCATGGGCTT ACCGGCACAA GGTTTAGCAT TAATTATTGG TGTTGACCGT 114 0 

ATCTTAGATA TGGTACGTAC ATGTGTAAAC GTTATTGGTA ATGCATTATC AACAATCGTT 1200 

ATAGCTAAAT GGGAAAACGT ATATGACAAA GCAAAAGGTC AAGAATATTT AAAATCAATT 12 60 

TAAAAAATAC TATCTGACAT TTAArGnCCC TTACAACCTT TGGTTgTnAG GGCTnTTTTA 132 0 

TGTCATGCGT CTTAAAGCCA GGCCGTATAn CGGTAAGCGT A 1361 
(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

TCAATATGTA AATCGGCGAT TGTCCCTATC tTCATACCCT TACCTTCTTT STCAAATTA 60 

AAATGTTATT TATTATGCCA TaGCTTATCT AATATATATA GTTAACTGCT TCATTTAGAT 12 0 

GATTATTTTA TATTTTTGCA TAAAAACTTA TATCTTTTCA AAAAAATCGA TAAGTTTTAG 18 0 

TTATCATACC CTTACCTATC AAATGTTTTT TCTTATATTT AAAAAAATAA TTGCTTTATT 24 0 

AAATGGATTT CTTTAGTATT TATAATTAAG AAAACGCTTA CACACAACTT TTTTATTTGC 300 

TTTATCCTGA GGAGGAAAAT TATGGCAAGA AAATTGCATA GAG AG T T G AA TAACAGACAC 3 60 

ATCCAATTAA TAGCAATTGG GGGCGCAATT GGAACTGGGT TATTCCTAGG ATCAGGTCAA 420 

ACAATATCTT TAACTGGTCC ATCACTGTTA TT (AC AT AC A TGATTATTGG GGTTGTACTA 4 80 
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TTCGCTTTTA TGCGCGCATT AGGCGAATTG TTGTTGAGCA ATACAAGATT TAATTCATTT 54 0 

GTTGATATTG CAAATGAATA TTTAGGCCCT TTTGGTGGCT TTGTCATTGG CTGGACTTAC 600 

TGGTTATGTT GGATTGTATC AAGTATGTCA GACCTAACTG CGATGGGACA ATACTTTGE 6 60 

TTTTGGTATC CACAAGTCCC AAATTGGATT ACCGTGCTAT TTATTGTTTT AATCTTGATT 720 

AGCTTCAACT TATTAGGTGC CAGATTATTT GGTGAACTGG AGTTTTGGTT CTCGATTATT 780 

AAAGTTGTCA CAATTATTGC GATGGTTATC GTTGGTCTTG TATTAATCTT TTTCTCATTT 84 0 

AAAAC AC AT T ATGGACATGC AT CAT TC AC A AACTTAATCA GTCACGGTGG CATGTTCCCT 900 

GGTGGAaCAT TTGGTTTCTT AATGTCATTC CAAATTGCTG TATATTCATT CATTGGTATT 9 60 

GAAC T TAT AG GTGTAaCTGC TGGTGAAaCG AAAGATCCTG AAAAAACCTT ACCGAAAGCA 102 0 

ATTAATAATG TACCTATCCG TATTTTATTA TTCTATATCGGTGGTCTATT AGTAATTATG 1080 

TCAGTCATAC CTTGGAATGA TATCGATCCA AATAGTAGCC CTTTCGTTAA ACTCTTTACA 114 0 

TTAATCGGCG TACCATTTGC AGCAGGTGTC GTTAACTTTG TCGTGCTAAC TGCCGCGGCC 1200 

TCTGCTACAA ATAGTGGTAT CTATTCGAAT AGTCGTATCT TATTCGGACT GTCACAACAA 1620 

GGGTTAGGTC CTAAAGTTTT AAAT AAAAC G AATAGTCATG GCGTGCCTTA TTTATCAATG 1320 

TTAGTTTCAT CAATTGCATT ACTTATAGCA GCCTTGTTAA ACTACATTTT CCCTAATGCA 1380 

ATTCAACTAT TCATATACGT TACAACGTTA tCAACTGTGT TGTTTTTAGT TGTtTGGGCA 1440 

ATGATnATTG TCGCTTATCn AATGTATTTG GAAAAAGCAT CCTGAGGCA 14 8 9 
(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

TTTTCCATCA TrATcACCAT TTAAGACAAT AGCTGTATGA CCATATCCTC C AC C AAAT C T 60 

TCCACTAAAA ACCACTAAGT CCCCAGGTTC AGCTTTGAAA GTTGGTGTGT TGTGATAAAT 120 

TTTAGCTTCA CTATTAAAAT TATTTGCATA TGGTATATCT TTJ5CTCCAT ATCCTTTtAA 180 

TCCAkGACCA TAAAGAkGAT TCCAATATAC ATTAACTAAA TCGAAACATT GCCATCCATA 24 0 

ACTACCGTCG AAATCCCATC CTCTATTCTC TAATGTTTTC AAATAATTTA ATGTACTGCT 300 

ATTACTATTA CTTTTATTAT TTGAAGACAC TGTTTTTGGT TTTGGTTCTA CTAATGGTGT 3 60 

CATTGGCACT TTTAATTTTT GACCAATAAA TATTAAATTA GGATTTGCTA TATTATTTGT 4 20 

ATTTTGAATA TTTGAAACTG TAGTTTTGTA TTTTAATGCT ATAGCACTAA GTGTGTCTCC 4 80 
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TTTTTTTACA GTATAGATTT GTGTTTTTGG AGCTTCTCTA AAACTGTAGT AACCAAAGTA 54 0 

ATTATTAGTA ACTTTATTCG TTTTCTTCTG ATTAGAATTT TGAGCTTCCA AGTTTGCAAT 600 

TTTAATTTCT TTAGTAAGTT CATTGTTATT AAT AAC TAG A TTGTTACCTT GGCTTGAGTT 660 

TTTCGGAGTA TTTGAAATCT TTATATCTTG AT T AAT T T C A TTTCCGTTTG AAATTGCTGA 720 

TTTGTTGTCT AACTTTAAAC TTGTGTCCGA TGTTTTAACA GCACCTTCATTTTTTATTTT 7 80 

GTCTTTTGTC GTATTTTTAT TAGCATTTAA CTCTGATTTC GCGAATACAT TTTGCTCATA 8 40 

CCCTCTTGTA AAATCTTTAG ATTTATCAAT TTCATCTGCA TATGCTTTGT TCGACATACC 900 

CAATGCCAAA AACATACCTA TTGAAATTGA CAAAATTCCA ATACTAACTT TTCTAATTGA 9 60 

ATAGCGTACT TTACTTTGTT GTTTATTCAT GATGAAACAC TCCTTACAAT AATATACAAA 102 0 

CATTATAACA TAGTGCGCTA TATATTTTAA TTAAAATTTT CTTTGTTTTA TAAAGCTATT 1080 

GTAGGAGGTA AAATAATTAA CTTGTCTTTC CAAAATATGA AAAGTGTACT AAAaTTCATC 1140 

GCACGACAAA TAGCCCATTT CCGATACTTT TAAAAGTAT GGAaTGGGCT ATAGCCATTT 1200 

ATATCATCTT TTAACTTTAT TTATTAACAG TTAATAATGA TTCATAAATA CCTGCTTCTT 12 60 

TAGCAGCTTC AATTAATGTT GAACCAATTT CTGAAGGTGT TGCCGCTGTT TTCACACCAC 1320 

AACTATTTAA TGTTTTAATT TTCTCTTCAG CAGTACCTTT AC C AC C T G AA ATGATTGHC 1380 

CAGCATGTCC CATACGTTTT CCAGGAGGTG CTGTTTGTCC AC C G AT AAAG CCTACAACTG 14 4 0 

GTTTTGTCAT ATTCGCTTTA ATCCATTCAG CTGCTTCTTC TTCAGCCGTA CCACCGATTT 1500 

CACCAATCAT AACAACTGCT TTCGTTTCGT CATCTTCATT GAATGCTTTT AAAAC AT C AA 1560 

TAAAGTTTGT TCCGTTGACT GGGTCTCCAC CAATACCAAC AGCTGTAGTT TGACCAATAC 1620 

CTTCTTCAGT CAATTGGTGC ACTGCTTCAT ATGTTAATGT ACCTGAACGA GATACTACAC 168 0 

CAACATGACC TTTTTTGTGA ATATAGCCAG GCATAATACC AATTTTACAT TCATCTGCTG 17 4 0 

TAATCACACC TGGACAGTTC GGACCAACTA AACGTGTTTT TCTACCTTGT AAGTAGCGTT 1800 

T AAC T T T AAC CATGTCTAAT ACAGGAATAT GTTCAGTGAT ACAAATAACC ATATCTAAGT 18 60 

CTGCATCAGC TGCTTCTAAA ATTGAGTCTG CAGCAAATGG TGCTGGAACG TAAATGACTG 1920 

AAACCGTTGC CCCAGTTTCA TTTTTAGCTT CTTCAACAGT G T T G AAAAC A GGAACGCCTT 38 0 

CAACAACTTG ACCACCTTTA CCAGGCGTCA CACCTGCTAC TATTTTCGTA CCATAAT CAA 204 0 

GCATTTGTTT TGTATGGAAA AGGGCAGTAG ACCCTGTAAT ACCTTGTACC ATTACTTTAG 2100 

TATTCTTATC TATAAATACA CTCATCTTAG TGCTCCCATC CTTTCCTTAT GCTTCTTTGA 2160 

CTAGTTTAAC AATTTTTTGT G3ACCTTCAG CCATTGTTGC TGCTGGTTCA ATTGCTAATC 2220 

CTGAGTCTTT TAAGATTTTT T T AC C T AAC T C AAC AT T T G T ACCTTCTAGG CGTACAACTA 228 0 
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GTGGTAAAGT TAAATCTACT TCTTTTACAG CTTCAACGAT ACCTTCTGCG ATAACATCAC 234 0 

ATTTCATAAT GCCACCGAAA ATGTTTACAA AAATACCTTT AACATTTCA TCACCTAAAA 2 4 00 

TGATTTTAAA TGCTTCAGTT ACTTTTTCTC TAGTAGCGCT TCCGCCTGCA TCTAAGAAAT 24 60 

TGGCTGGGTT TCCACCGAAA TGATTAATCG TATCCATTGT TGCCATGGCT AAACCTGCAC 2520 

CAT T AAC CAT AC AT C C GAT G TCACCATCTA ATGCAATGTA TGATAAATCA TGTTTAGACG 258 0 

CTTCAATCTC TTTCGGATCT TCTTCTTCTA AATCACGTAA TTCTACAACA TCTTTATGTC 2 64 0 

TGAATAATGC AT TAT CATC A AAATTAATTT TAGCATCTAA TGCCAATACA TCACCATCAG 27 00 

CTGTTGTAAC TAATGGGTTG ATTTCTACGA TTGAACAATC TTTTTCAATG AAT AC AT TAT 27 60 

AAAGTGCTAA TAAGAATTTA GCAGCTTTGT T AAC AG AT T C TTTAGGAATA TTAATATTAA 2820 

ACGCAATTCG TCTTGCTTGG AATGGTGAAA GTCCGATTAC TGGATCGATA GTTTCTTTGA 2880 

AGATCTTTTC AGGAGTCTTC GCAGCAACTT CTTCAATCTC AGTGCCCCCT TCTTCAGACG 294 0 

CCATCAATGT TACTTGGTCA GTCGCACGAT CAATAACGAA TCCAACGTAA TATOTTTTT 3000 

G AAT AG C AC A ACCTTCTTCG ATATATAAAC GCTTAATTTC TTTACCTTCT GGACCAGTTT 30 60 

GATGTGTCAC CAAAGTTTTC CCTAATAATT CTTTTGCATA TGTTTCTACC TCAGATAAAG 3120 

ATTTAGCAAT TTTTACTCCG CCTGCTTTAC CTCTACCTCC AGCATGAATT TGTGCTTTTA 3180 

CAACATAAAC ATCAGAATTT AATTCTTTTG CTTTCTCCAC CGCTTCTTCA GCAGTAAATG 32 4 0 

CTACTCGTCC TTCTGGAACT GCAACGCCCA TTGAACGAAA TATTTCTTTA CCTTGATACT 3300 

CGTGGATATT CATCTTCCAT CCTCCTGTTA CTTAGGTTAA GTTCCCTTAC AATTATAAAA 3360 

AATGTAAGCG CTATTGTAAA CTTAAATGCT ACTTTTTTAT CAT T T AAT T G AATTTTACGA 34 2 0 

TTTACAGTAA CGATTTTATA GGTTCAAAGC TTTTTCTATG CTCTTTCATA ATGCCAATAT 34 80 

CATCGATTGC TAGTAAATGT TGTTTGGTAC CGTAACCCGC GTTTTTTTCA AAACCATATT 354 0 

CAGGATAATC T T TAG AT AAC TGTGTCATAT AATCATCACG AAAAACCTTT GCCATGATAC 3600 

TTGCAGCTGC AATGGACACA CTTCTTGCAT CACCCTTGAT TAAAGATACT TGAGGCAGTG 3660 

CATTATCAAG CGTCATCGCG TCTATCAATA AATGCGTTGG TTGTACTGAT AATCCATCAA 3720 

TAGCTCGCTG CATGGCGATT TGAGTAGCTT TATAAATATT AAATTCATCT ATTTCTTnCA 37 80 

GtGTCGCGAT CCCATATGCA AAAGCAGTAA CTTCATTTTT TAGTGCTTCA TTTAATTCTA 38 4 0 

GACGTTTCGT AACAGGTACT TTTTTCGAGT CATCAAGGCC CAAATAATTG T GAT T T G AAT 3900 

TTAAAATTGT TGCGCATGCA ACGACTGGAC CTGCTAAAGG TCCTCTTCCA AC T T C AT C AA 3960 

TCCCACAAAT AATAGC AT T A GGATGCTCTT TTAATATTTC ATTTCAAAG TAAGTCATTT 4 020 

CAACATACTT TTCTTTTAAA GCTTGTTCTT TTTCTAACGC TTTTCTGCGC CTAGCTATGG 4080 

CATTTTGAAC ACCTTTTCGC TCATCTAAAA AGCATTCATG ATTTTCTAAT TCTTCTATTG 414 0 
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TATTAACCGC ATTAATCAAC TGCGTAACTT CTTTAATTGT TAGCGTCATT TGCTAATTCC 4 2 00 

TCAGTCATAT CTTTAAAAAT ATCAAAACAA TAATTTCCTA TTTTAGCATT TCGAATATCA 4 2 60 

TAAATAATCA GTTCAATGAC TGCTTCGTAA TCAATTTCAT TACCACGTCG AATTAAGCCc 4 32 0 

ACGTTTTTTC CCTATCGCAT CAAACCACGC TATGATTTCT GCAtCTTCAG GAACTTCAAT 4380 

ATTATAATGT GACTTTAATC GCGCTAAATC ATTTTGAATT AAAAAGT T T A AT CC AT AGAT 44 40 

GGCAACTTCA TCTAAGTGCA CAATACTATC TTTTATCGCA CCAGTTAAAC TCAACTTCTT 4500 

ACCGACTTCT TCATCTTCAA ATTTAGGCCA AAGTATCCCT GGTGTGTCTA ATAGTTGTAA 4560 

TGCATTACCA ACTTTAATCC ATTGTTGTTG TTTGGTCACA CCTGGTTTATTACCAGTCTG 4 620 

CGCAATACTA CGCTTTGCCA GTTTATTTAT TAATGTGGAT TTACCAACAT TTGGAATTCC 4 68 0 

AACGATCATT GCTCTTATCG CTCTAGGTTT AAGTCCTTTC GCTTTTTCGC GTTCAAATTT 47 4 0 

TTCAGCAGTC GCCTTAATTG CTGCAGCTTC CACTTTCTTT AAATTTTTAC CGTGCTTAGC 4 800 

ATCCACTGAT ACAGGATAgT AtCCTTTATC AATAAAAAAT tGTTCCCATT TTGACATCTC 4 8 60 

ATTTAAATTA GACATATCTT TTTTATTTAA TATAACAACA CGTGGTTTTT GGTTAATAAC 4 920 

TTCATCTATC ATAGGGTTTC TTGAACTATA TGGAATTCTT GCATCTACTA GTTCAAACAC 4 980 

TACATCTACT TTTTTTAATT 5000 
(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

AACCCGnAGA CGAAACTnCA TAGTTGCCTG TTATGTTGAT TAGTGCGTTT GTAGCTATTT 60 

nAAATCAAAC ATT ATT AAA T ACAGCGTTaC CTAGTATAAT GAGAGAATTA AATATCAATG 12 0 

AAAGTACATC GCAATGGCTA GTTACTGGGT TTATGCTTGT TAATGGCGTC ATfiTACCTC 18 0 

TGACGGCATA TCTAATGGAT AGAATTAAAA CTAGACCTTT ATACTTAGCG GCGATGGGGA 24 0 

CATTTTTATT AGGTTCTATT GTTGCAGCCT TAGCTCCGAA TTTTGGAGTT TTAATGTTAG 300 

CTCGTGTAAT TCAAGCGATG GGTGCAGGCG TACTTATGCC CTTAATGCAA TTTACGTTAT 3 60 

TTACATTGTT CAGTAAAGAA CATCGAGGTy TtGCAATGGG ACTAGCAGGT TTAGTAATTC 4 20 

AATTTGCACC AGCAATAGGA CCTACAGTTA CAGGATTAAT TATTGATCAA GCGAGTTGGC 4 80 

GAGTTCCATT TATTATAATT GTAGGAATTG CTATACTTGC CTTTGTTTTC GGTTTGGTTT 54 0 
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CAATCTCGAG TTACAATGAA GTGAAATATA CG AAA* TAG A TAAGCGTTCA GTAATGTATT 600 

CAACTATTGG GTTCGGGTTA ATGCTATACG CATTTAGTAG TGCAGGAGAT TTAGGATTTA 660 

CAAGTCCAAT AGTAATAGGT GCGTTGATAT TAAGTATGGT TATTATCTAT TTATTTATAC 720 

GTAGACAATT TAATATTACT AATGCACTTT TAAATTTAAG GGTTTTTAAA AATAGAACAT 7 80 

TTGCATTATG TACGATTAGT TCAATGATTA TAATGATGTC AATGGTTGGA CCTGCGCTGC 84 0 

TTATACCGCT ATATGTTCAA AACAGTTTAT CTTTATCTGC CTTGTTATCA GGACTTGTTA 900 

TCATGCCTGG TGCAATAATA AATGGTATTA TGTCAGTTTT T AC AGG T AAA TTTTATGATA 960 

AGTATGGTCC TAG AC C AT T G ATTTATACTG GTTTTACAAT TTTAACAATT ACTACAATTA 102 0 

TGTTGTGTTT CTTGCATACA GACACATCTT ATACGTATTT AATAGTCGTA TATGCAATTA 1080 

GAATGTTTTC AGTTTCTTTA CTCATGATGC CGATAAATAC TACAGGAATT AATTCTTTGA 114 0 

GAAATGAAGA AATCTCACAT GGCACGGCTA TTATGAACTT TGTCGTGTA ATGGCTGGTT 1200 

CACTAGGCAC AGCTTTAATG GTTACATTAA TGAGTTTTGG TGCAAAAATA TTTTTATCTA 12 60 

CATCGCCATC GCATTTAACT GCAACTGAAA TTAAACAGCA ATCCATTGCT ATAGGGGTGG 1320 

ATATCTCATT TGCTTTTGTA GCTGTGCTTG TTATGGCAGC TTATGTGATA GCACTTTTTA 1380 

TAAGAGAACC TAAAGAAATA GAAAGTAATA GAAGGAAATT TTAAAATAAT TATAGTAGTT 14 4 0 

GGTCTATTTA AAATAATAGG CTAACTGCTT TTTTTATTTA AT AAAAAGT T TTATACTTTT 1500 

AGTGATAGAC TAAGCAAAAA TTGTTATTTG CTATGATGTA GATGTCTTAA AATGATTAAG 15 60 

GGGGATTTGC TTTGTTAACG GTAGVTCAAG TGAAAGAATT GGTAGGAGAA AT T AAAG AT C 162 0 

CTATTATAGA TGTGCCTTTA AAAGAAACAG AAGGTATTGT TGAnGTTTCT ATTAAGGGAG 168 0 

AAnAAGAACA TGTGAGTGTT 17 00 
(2) INFORMATION FOR SEQ ID NO: 24 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

TGCATCAACT TTCAAACAAT AAATCATCAC AATAACCACA CCTATTCAA CACTTTTCAA 60 

ACATAAGTAT T G AC AC AT T G AGCAAAATGA TTTTTAATTG TAACTAATAC AGTTACAATT 120 

ATGAGGTGAG AAACATTGAA TTTAGAATTT AACATTGCCG TGCATGTATT AGCTTTTTTA 180 

ACTAAGCATC aTTCAGAAAA ATTCAATAGT AGTTCATTAG CAGAaTTAaC TTGTTTAAAT 24 0 

CCTGTTCAAT TAC GACGCGT GACGACTCAA CTTGTCGATT T AAAAAT GAT T G AC AC AAT A 300 
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CGAGGTAAAG ATGGCGGTTA TTTAGCAAAT GATCAAAGTG CTGATGTCTC TCTAGCAACA 3 60 

TTATaTAAAC ATTTTGTCTT AGAGAAAGAA CACCACACAC GTCTATTTAC TGGCGACGAA 4 20 

GGCAGTCACT GTCAAATTGC TCGTAATATT GCAACTACCA TGTCACATTA TCAGCAAGAC 4 80 

GAACAGAATA TCATTATTAA TTTTTATAAT GGAAAAACAA TCAAAGATGT CATTGAAGAC 54 0 

ATTCAAAAGG AGGATTTATG TCATGAAAAC ATATGATTTA ATTGTAATAG GATTTGGGAA 600 

AGCTGGTAAA ACTTTAGCGA AATACGCTGC ATCAACAGGT CAACACGTCG EGTTATCGA • 660 

ACAATCTCCG AAAATGTATG GAGGCACTTG TATAAACATA GGATGTATAC CTTCGAAGAC 7 20 

ACTTGTACAT GATGGATTAG AAGGCAAATC TTTTGAAGCA AGTTATAACC GTAAAAACGA 7 80 

TGTTGTCAAT GCGCTAAACA ATAAAAATTA CCACTTATTA GCAGACGATA AC AAC AT T G A 840 

TGTACTGGAT TTTAAAGCGC AGTTTAAATC TAATACTGAA GTGAATTTAT TAGATCAACA 9 00 

TGACGATATC GTCGATAGTA TTACTGCACC TCATATCATT ATTAATACCG GTGCTACCTC 9 60 

TGTCATTCCT AACATTAAAG GCCTTGATCA AGCTAAACAC GTCTTCGATT CGACAGGTTT 1020 

ATTAAACATT AGCTATCAAC CTAAGCACCT CGTATTGTA GGTGGCGGTT ATATCGCCTT 108 0 

AGAATTTGCT TCAATGTTTG CGAATTTAGG T AG T AAGGT A ACAGTATTAG AACGTGGCGA 114 0 

ATCATTTATG CCACGCGAAG ATCAAGATGT CGTTGCATAT GGTATTACTG ACTTAGAAAA 12 00 

TAAAGGCATT GCATTGCATA CAAATGTTGA AAC G AC T G AA TTGTCATCTG ACAATCATK 12 60 

TACAACAGTC CAT AC C AAC G TTGGTAACTT TGAGGCTGAT GCAGTACTTT TGGCTATCGG 1320 

GCGCAAACCG AATACGGATT TAGCTTTAGA AAA T AC T G AT ATCGAATTAG GCGACAGAGG 1380 

CGAAATTAAA GTCAATGCTC ATCTTCAAAC AACTGTGCCG CATATTTATG CTGCAGGTGA 14 4 0 

TGTTAAAGGC GGACTTCAAT TTACGTATAT ATCTTTAGAT GAT TAT C G AA TTATCAAATC 1500 

AGCGTTATAT GGTAATCAGT CACGTACGAC TGACAATAGA GGCAGCGTGC CTTATACAGT 15 60 

ATTTATAGAT CCACCATTAT CACGTGTTGG ATTAACTAGT AAAGAAGCTG CCGCTCAACA 1620 

TTATGATTAC AC T G AAC AT C AACTTTTAGT AAGTGCTATACCTCGTCATA AAATTAACAA 1680 

TGATCCAAGA GGTTTATTTA AAGTAGTCAT TAATAATGAA AATAATATGA TTTTAGGTGC 17 4 0 

TACATTATAT GGTAAGCAAT CTGAAGAATT AAT TAATAT A AT TAAAC T.T G CGATTGATCA 18 00 

AAACAT TCC A TATACCGTAT TACGAGATAA TATTTATACG CATCCTACGA TGGCCGAATC ISO 

ATTTAATGAT TTATTTAATT T C TAG AC AAA ACATAAAAAC CTGGTGGCAC GCATTGAATG 1920 

ATGCTGCCAT CAGGCTTTAT TGTTGTGCTT TTCGCTTTTC TAATTTTTCT TTAAGCTTTC 1980 

TATCTTGTTC TTCTTTACGA CGTTTACGTT CTTCATGTCG TTTTCTTAAA CGCTCTTCTT 2040 

CTTCAGGATC ACGTGGTTTC TTTAATTGTT GAGAAACTTT TTCGATTAAT TCTTCTTCAG 2100 
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TAAGCGCagc CAGTGGGCGG TTATTAACAA AAGTGAATGT TTTTCGGCGT CCAGGTCCAC 2160 

AATAAGATTG ACAACCTATC ACGATTTCAG CATCGGGATC TAATTTTTCC AACTTCTTTT 2220 

GTAACGTTCT TATATTGACT GCCTGACATT CAT C AC AAAT AAGGAATEA TTTTTCATAT 2280 

TGCTACCCAC CTTTCTTTAT CATATCTATA TCGTCGATTT CATTAATTTT TTCGTTAACT 2340 

CTATCTATTT TACTCTTTTT AATATTTTTT TCAAGATACG TAACACGGCT GaCAATAAAA 24 00 

AATGGAGCAT TTATCTTCTA ATTAAATTAG ATGaTTGCTC CCCTATCAAA TCATTTATTG 24 60 

CCCATGATAA ATATTAAATT TTAATGGTTT AATACCATGT TTTGTCCATT CAT CAT AAAT 2 520 

TTTTTCTTTA TCTACATCTT TAT TGATAAT TGTAATACCA CAGTCTCCAC CACCAGCGCC 2580 

TGATGTTTTA GATGCGCCGT GATACTTTTC AGCAATATCA CACAAATATT TTAGCTTTTC 2 64 0 

AGTTTCTATA TCAACTGTAG CTTCTTTATC CATACGTTGA ATAATTGTAC GATTCTGACG 2700 

CACCATCTTT TGCACACCTT TAATGTTATT TGTTTTAAAA GCATGAATAA GTTTTTCAAC 27 60 

ACAACGATGT GAATCTTCTA AGAAGTCACC GTAAAATGAA GGATCTGATT TCAAACGTTT 2 820 

CACTTCGCTA ACAAAGTGTG GTGATGACGC CGGTGAGCCA GTCCAACCGA TAAGSCTTC 28 8 0 

CATATTTTCA GGTGCTTGTA ATGGTTCGAT GTGCAATCCA GGCCAGTTTT TGATTAAAAC 2 94 0 

TTCTTCAACC GTAGTATCTT CAATTTGATG CTTAACCCAT TCATGATCAA AAG T AC TATA 3000 

CGCTAGCCAT CC ACT AT AT A CACTCACAGC AATATCTCCG CATGAACTTA AACTTTGTAA 3060 

CTTCATATTT GCAATCACTG CTAGTTTATA AATGTATAAA T TAG A T AAC T T CAT AT CAT A 3120 

AAATTCATTT AATACTTTTA TAACTGACAC AAGTACTGCT GCACTTGAAC CTAATCCATA 3180 

TTTATGACCA TTTGAATCAT CTAAATTACT ATCAATAGTC AG AT G AAAAT GCTTCATCGC 32 4 0 

TATATCGCAA CTTTTCGCGT ATTGTTCAAA TATTTC&TA GCTGTGaCCA CATAATTTAA 3300 

TTGTTTTGCT GcATGTGGAT CTGAAATGAC AATACTATCT TCATCTCTAC TAAATGTAAC 3360 

TGGGTTATGA TGTAATGCTT TTGAATGAAT GGTACCTTTA TATTGGTCTG CTTCTTCAAT 34 20 

AGTAGCAGTT ACAAAACGAT CTAACGCAAT AAGTACAGAT TTATATCCTG GTTCTGTTAC 34 8 0 

AGCATATTCT CCAGCAATAT AAAGTTTTCC GGGTGCTTTG ACCTGAATCA TTTTATCTCT 354 0 

TCCTTACTCA ATTATTTCAA TTCCTGTGGC AATAATGTCA CTATCAATAA TTTGGTTATT 3600 

ATCAAACTGT GTTAATAATT TATCTATAAT CTGTTGCTTG TTTTTCTTTT CT AC AAG TAT 3660 

TTTCACATTA GGTCCCGCAT CCATTGTAAA ATAACACGGA TACCCCGCTT CTCGGCATTC 37 2 0 

GTGAACAAGC GCCATGACAT CATAACTTTC TTGCACAAGA TATGTGAACG GCGGTGTTGA 37 8 0 

TCCTAGATTC GTGGCATGCA TACGCAAACC ATTTTCTTCA ATTACTTCAC CAAGGCGTTT 38 4 0 

AAAATCTTTG TCTTGAATCG CTGCTTTTGC TTCAGCTAAA TCTEATCAA TATGATCTAA 3900 

CCAATATTGA TAAAACCTTG ATGTGTTTCG TGTCAATGAC ATACCATATC GACTAGGTAC 3960 
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CTTTTTAGAA TGTTGATTAA T C AC AAC AAA TATCATGGCA AGGTCATCTT CAAAATGATT 4 02 0 

CGATTCAAGT GGAACGGCAT ATGACGTCTC ATCACTATAC CCTTTTTCCC ATTCTGCAAA 4 080 

TCCACCATAA ATACTACGCG ACGCAGAACC CGAACCAATT CGCGCCAATC TCGATAAATC 414 0 

CTTATCTGAC AGCTGCATGT CTAGCGCTTG ATTACAAGCT GCTGCTAAAG CTGCATATGC 4200 

GCTTGCCGAT GAAGCCAACC CTGCTGCTGT TGGTACAAAA TTGTCGCTTT CAATTTCTGC 4 2 60 

ATACCAATCG ATGCCAGCTC TATTTCTGAC AATATCCATA TATTTTGAAA TTTTCTCTAA 4 32 0 

TTCTTTGCCA CTAACCTTTT CACCATTCAA CCAAAATTGA TCCTGTGTTA ACTGGTCGTT 4 38 0 

AAAAG T G AC T TTCGTTTCAG TGTAAAATTT TTCTAATGTA ACAGATATGC TATTATTCAT 4 440 

TGGAATGATT AGTGCTTCAT CTTTTTTACC CCAATATTTT ATAAGTGCAA STTCGTATG 4 500 

TGCACGTGCT TTGCCACTTT TAATCAACGC ATTAACCTCC TAAATTCTCA ATCCAAGTAT 4 5 60 

GTGCTGCACC AGCTTTTTCT ACAGCTTTTA CAATATTTTT CGCTGTTGGT AAATCTTTGG 4 620 

CAAGCAATAA CATACTTCCA CCACGACCAG CGCCAGTAAG TTTTCCAGCA ATCGCACCAT 4 680 

TTTCTTTACC AATTTTCATT AATTGTTCTA TTTTATCATG ACTAACTGTC AACGCCTTTA 474 0 

AATCCGCATG ACATTCATTA AAAATATCCG CTAAGGCTTC AAAGTTATGA TGTTCAATCA 4 800 

CATCACTCGC ACGTAAAACT AACTTACCGA TATGTTTTAC ATGTGACATG TACTGAGGGT 4 8 60 

CCTCACAAAG TTTATGAACA TCTTCTACTG CTTSTCTTGT TGAACCTTTC ACACCAGTAT 4 92 0 

CTATAACAAC CATATAGCCG TCTAAACTTA ACGTTTTCAA CGTTTCAGCA TGACCTTTTT 4 98 0 

GGAACCAAAC TGGTTTGCCT GATACAATCG TTTGCGTATC AATACCACTT GGTTTACCAT 504 0 

GTGCAATTTG CTCTGCCCAA TTAGCCTTTT CAATGAGTTC TTCTTTCGTT AATGATTTC 5100 

CTAAAAAATC ATAACTTGCA C G AAC AAAAG CAACCGCGAC AGCTGCACTC GATCCTAATC 5160 

CACGTGATGG TGGTAAATTC GTTTGGATCG TTACTGCTAG CGGCTCTGTA ATATTATTTA 5220 

AT TC T AC AAA ACGGTTCACC AAAGACTTAA GATGGTCAGG CGCATCATAT AACATACCAT 5280 

CGTAAACATC GCTTTTAATA GACGAATAGT TCCCGCTCTC TAAGGCTTCT ATTAAAACTT 534 0 

TGATTTTACC TGCGTTAAAC GGTACTGCAA TAGCAGGCTC TCCAAATGTA ACAGCATGTT 5400 

CTCCTATTAA AATAATCTTA CCTGTCGATT CCCCATATCC TTTTCTTGTC ATGTCAATAT 54 60 

CACCTTTTAT ATTTATCCTA TACTTGATTC ATTATTTTTATTTATTAGTA AAAG AC AT C A 5 520 

TATTCTAAGT TGCAtngCAT TCGCGTTAAA TTTCATTGCA GTCTTTATCT CACATTATTC 5580 

ATATTATGTA TAATCTTTAT TTTGAATTTA TATTTGACTT AAC T T GAT T A GTATAAAACT 5 64 0 

AACTTTCGTT TACTTCAAAG TTTAAATCTT ATCGAGTGAT ATTTCAGATT CTTTATCTTT 5070 

T T AT AAAAT A GCCCTACAAT TTATAATTTT CCACCCTAAC TATAATACTA CAAATAATAA 57 60 
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TTGGAATATA TAGATTTACT ACTAAAGTAT TAGAACATTT CAATAGAAGG TCGTTTCTTT 5820 

CATAGTCATA CGCATTATAT ATACCCTATT CTCAATCTAT TTAATACGTA AAAC AT G AAA 58 8 0 

TTTTCTTATT AAATTTATTA TTTCCATCAT ATCATTACTT TTAATTTAAT GATGTTCAAT 5940 

TTAAATATTA GGTCAATAAC ATATTTATGC TTTTTATGGA TACTTTCAAA AATAACAGCC 6000 

CCAAACGATA ACTTGAAAGG GGCTGTTAAA TATTTAACTA TTGCATTTGA TCTATCATTT 6060 

TCTTGTTTCT TTCAATCATT TTATCAAAAT ACGTATCGTA TCTTTGCET TCTTCTTGAG 6120 

TAATTGGCGT CATATTTAAT ACACCGCCAA GATCTTTGAT TGCATATAAT AG T AAAAAC A 618 0 

TTACATCTTG AACAGTAATC GTTTTGTTAA GCAATGTCTC TAATGAGGCC AT AC AAGAT G 624 0 

GTTCAATTTC AGGATATTTA AATTTAGTCA CTTCACCTTT TAAAGCATGT TCATAAAATG 630 0 

TTTGCATCAT CAATGCACGT TCTGAACCAG AGCCTTCAAC ACAAAGATAA ATTTGTACAG 6360 

CAATACCGCC TCTAACTCTT CGTTGCGATA TGCCTGCAAA TTTCTTACCA TCGATACTTA 64 2 0 

AGTCAAATTT TCCTGGGCAA TAAGAATGTT CAATTTCCAT CGTATCAATA TCAACATTCT 64 8 0 

CATTTTCGAA CATTTTGCTA ATTAAGAGGT ACATCACAGT AAACGCTTCA TCAATCGTTG 6540 

TTTCTGTTTG TCCTTTGAAC ATCAGCGATA TATTTAATAC ACCTTGATCT AGAACGACAC 660 0 

CTAAGCCACC AGAATTTCTA ACAATGGCAT TATAACCAAT CTCATTCGTT AAATAATCAA 6 660 

TGCCATCTTT TAAAAACGGC AATCTTGAAT CAT G AAT AC C AAGAATAACA GTATETGAT 67 20 

GAATCCAAGT ACGCACAACA TTATCTGATA TATCTTTGCC CACACTTTCG CAAAATGTAT 67 8 0 

CATCGAATGC GAAAGATTGC ATAGGTTCTA AT C C AG AAG A ATGATCGATA TATCGCCAGT 684 0 

TGACGCCATT AAAAT AT T T A CTCGCTAAAT CCATCGTTAT TGTAAGGCTT GCGCTGCTGT 690 0 

AATAATTGAA AGATTGTATA CATCTTCAAT TGAGCAGCCA CGTGATAAGT CATTTACTGG 6 960 

AGAAT T T AAA CCTTGTAATA CTGGACCAAC TGCATCATAT CCACCTAAAC GTTGTGCAAT 7 02 0 

TTTGTAACCA ATATTACCAG CTTCTAAACT TGGGAATACA AAGACATTTG CATCACCTTG 7 080 

TAATTTAGCA CCTGGCGCTT TTTTCTCAGC AACACCEGT ACAATCGCAG CATCAAATTG 714 0 

GAATTCGCCA TCAATGATTG CTTCTAATTT TtCTTCTTCA GCTTTTtGTT GTGCTAATTT 7200 

GACAGCTTCT TGAACTTTTG TCACGTCGTC TGATTTAGCA GACCCTTTTG TTGAAAAGCT 7 2 60 

TAACATTGCA ACTTTTGGAT CCATGCCAAA GCTTAATGCT GATTTTGCAC TTTCTACTGC 7 320 

AATTTCTGCA AGTCCTTGTG AATCAAGTTC TGGATTGATT G C AC AAT C AC CAAAGATGTA 7380 

TTGTTCATCA CCTTTAATCA TAAAGAAGAT ACCTGATGTT C T T GAT AC AC CTGGTTTCGT 7 440 

TTTGATGATT TGTAAAGCTG GACGCACAGT GTCGCCTGTT GAATGTGCTG CACCACTAAC 7 500 

TAAACCATCT GCTTTACCA3 CATAAACAAG CATTGTACCG AAGTAGTTCA CATTGTTTAA 7 5 60 

TAATTCTTGT GCTTGTTCTT CAGTCGCTTT ACCTTTACGT CGTTCAACAA ATGATTGAAC 7 62 0 
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TAATTCAGCT TTCAATTCAC TTGTCGCAGG ATTAATTAAT TCAATATTAG AAATATCAAG 7 68 0 

ATCAAGTTTT TGCGCTAAAG ATTGAACCTT AGTCTCATCA CCTACACGA TTGGTGTAAC 774 0 

ATAATCTGTT GCTTGTAATT GTGTTGCAGC TGTTAGAACA CGTTCGTCCT CTCCTTCAGG 7800 

TAATACGATT TTAACGTTTT TACCAGAAAG TTTGTCTTTT AATACATTTA ATAAATCAGC 7 8 60 

CATAATGTCC TCCTGTAATA TAAATCTTAT TAATCATTCA CGGTATAATT ATACGCCATT 7 920 

TTATTTCATA ATGCCACGTC ACCAATTTGT AATCATAATT TTATCGTTAC CCCTTTGCTT 7980 

TATGATAAAA TTTATAAAGA ACTGATGATT TTTGAAAAGG AGCGATAAAC ATGAGTCAAG 8040 

CAGCCGAAAC ATTAGATGGT TGGTATAGTC TACATTTATT TTATGCAGTT GATTGGGCAT 8100 

CATTACGTAT AGTTCCAAAG GACGAATGCG ATGCACTTGT CACTGAATTT CAATCATTTT 8160 

TAGAAAATAC AGCAACTGTA AGATCATCAA AATGTGGTGA TCAAGCTATT TATAATATAA 822 0 

CTGGTCAAAA AGCAGATTTG TTATTATGGT TCTTACGTCC TGAAATGAAG TCTTTAAATC 8280 

ATATTGAAAA TGAATTTAAC AAATTGCGCA TTGCTGACTT CCTAATCCCT fiATATTCAT 834 0 

ATGTATCAGT CAT T GAAT T G AGCAATTATT TAGCTGGTAA AT C T GAT G AA GATCCTTATG 8 4 00 

AGAACCCTCA TATCAAAGCA AGATTATACC CAGAATTACC ACATTCTGAT TATATTTGTT 8 4 60 

TCTATCCAAT GAACAAACGT CGTAATGAAA CTTATAACTG GTACATGTTA ACTATGGAAG 8520 

AACGCCAAAA ATTAATGTAT GACCATGGTA TGATTGGTAG AAAATATGCT GGCAAAATCA 8580 

AACAATTTAT TACTGGTTCT GTAGGGTTTG ATGATTTCGA ATGGGGCGTA ACATTGTTCT 8 64 0 

CAGATGACGT ATTACAATTC AAAAAAATTG TATACGAAAT GCGCTTTGAT GAAACAACAG 87 00 

CACGATACGG TGAATTCGGT AGTTTCTTTG TACSACATAT TATTAACACA AACGAATTCG 87 60 

ATCAATTCTT TGCGATTTCT TAATACATTG GTACGTTTAT AAATTAATAA AAAAAT TCCA 8820 

AGCTTATCGG TTTAAGCTTG GAATTTTTCG TTTATCTTCA GTATATTCCC GTATACATAA 88 80 

GACGTGATTT GGTAAATAGT TGAAATCTGT ATGTTTAAAC T TAT AT AT AT GTGCTAATE 8 94 0 

ATTATCAATA ACAAAGtACA CTTTGCTCAT AGCAAgTsaC CCGAgTAGTC TTCCTTGGGA 9000 

GAACTTTAAC T AC TAT C AC T ACATATAAAC GTTAACCTCA ATAGAAATTA TACAGTCGCT 90 60 

ACTCTATACA ATTTTTGTAA TGGTTAACTA ATATTATTTT AACCTATTTG AAATATTTGA 9120 

AACATATTTT TGTCGAATTT TTTTCAATAA TTTTTCCTTT TTATACTTCA AGAGAATTTT 9180 

AACTACTAAA AATTCCGATG ATTATTATTA CAATAGTATC AAATATTAGT TTTTTAAAAT 92 4 0 

CAATAACAAC TTATCAAAAA GCTCATGTGG TTATTTTATA GTGTATAAAC TATAATGAGT 9300 

ATTAAATTCT TATAAACAAT GGTGATGAAA TGGACATAAATTCAGAAGAA TACAAACAAG 9360 

AGGTACTTAT CAAAGACGTT GTCATGCTTG CTGCTCGCAT ACTATTAGAA TCTGGTGCAG 94 20 
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AAGGTACGCG TGTAGAAGAT ACCATGACAC GTATTGCAAA AAAACTTGGT T AC AG T G AAA 94 8 0 

GTAACAGCTT TGTTACAAAC ACTGTCATCC AGTTTACGTT ACATTCGGAA TCGTTTCCTA 9S0 

GAATATTTAG AATTACCTCT CGAGATACAA ACT TAATAAA AATTTCTCAA GCTAATAAAA 9600 

TTTCGCGTCA AATTACAAAC AATGAAATTT CTTTAGCCGA AG C AAAAAC G CAACTTGAAA 9660 

AAATATATGT TGCTAAGCGT GACAGCAGTC TTCCCTTTAA AGGTTTTGCT GCAGCAATGA 9720 

TTGCAATGAG TTTCTTATAT TTACAAGGTG GTAGATTGAT TGATGTTTTA ACTGCGATAT 97 8 0 

TAGCAGGTAG TCTAGGATAC CTAGTCACTG AGATTTTAGA TCGTAAGTwA CACGCACAGT 984 0 

TTATCCCAGA ATTCaTTGGT TCaTTAGTTA tTGGGATTAT CGCCGTTATT GGACATACAC 9900 

TTATTCCAAC AGGTGACTTG GCAACTATTA TCATTGCGGC AGTCATGCT ATTGTTCCTG 9 9 60 

GTGTATTAAT AACAAAC GC A ATACAAGATT TATTTGGTGG ACACATGTTG ATGTTCACAA 10020 

CGAAATCATT AGAAGCATTG GTTTnGCGTT TGGCATCGGT GCTGGCGTTG GTAGCGTATT 10080 

AATTTTAGTA TAGGAGTATC AGACTATGTT TTGGATCTTA AACTTTATCT TTAGCTTTTT 10140 

AGCTTC 1014 6 
(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

ATTTAATTGG TTGGTGGCGT ATTCTCnTTT CCAAGATTTG CCACGATTGG ATGTTCAGTC 60 

G AC AAAT AT G TAACCAGTCC CGATAAATTA CGTTCCTCAA TCATTGATAG TATGTCTGGT 120 

GTTTGTAACG ATTTATCGAG TAAGGCATCA AGCAATTGAT AATGTCCCAA TACAACAAAA 180 

TGCACGTTGT CTCTCAACTG CTGTTGAATA AACTGAATAA AGAGCTTTAA GCTCTGTTGC 24 0 

ACATTCGCTA ATGATGGTGC ATAGTTTTCC AAACCAACTT GTACAGCCGC TTCATTATTC 300 

CGAATGATTA AACCTGTGTA TGCCACTTTT GTTGCTGCAG TTGGATACATTGAGTAATAA 360 

CGCAATAATT GATCTGTAAA ATCATTTCGA AGTGCATAAA TTTGATGCTC ATGTTGCCAA 4 20 

AAATTTCGCT CACCCATCTG CTgCAAATCC TCATGGTTCA ATTGTTTCCA GTCCAACTTT 4 80 

TCAACCACAC TAAAATCAAC TAACTCATAA TCCGCTTTAT TAAAATATTT TAAAAATGCT 54 0 

GTTTCCGATT CTTTTAACGC AATTAATTGT TCTGnATTAT TCACTCGACC ACCCTTTACT 600 

TTCAATACTG TATTTAAAAT CACTTGGTAT TTTCGTTGTT TGCTTTACTT CTCTACCACG 660 

CTAAAGTGTA ATATGAT TAA TAACTTATCA TTTTTAGCAA TACATTACAA CCTTTTTCAG 720 
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AAAATTCGGT GTATTGATTT TAAAATTTTT TAAATAnAA AAGGCAAGAC ATTTGTGCCT 780 

ATAAAAATGC TTAACCAAGA TTTTTATATT GaAGTTGTAC TTCTTGCACA TATTGTCCTT 84 0 

GCCTTATTAT GTAAAGTTAT TTTCTTTCTA TCTTTTTATT AAATTTAACT ATTCTTCATA 90 0 

ATCCCGATTC CCTTTAAAGT AACGTCTATC TTGTTTACTA TATACATTTT CAGGATTSA 960 

AAACGGTTTG AGTTTATATT TAAATTCAAA TAAGTT TAGA CCATCAATCG CAACCCTATC 102 0 

TTGGTAAACG TTCGTTGCTG ATATATCTGT AAAATTGTTT GGACCGACAC CTGCAATAAA 108 0 

CTTAAACTCT GCTTCATCTA CCAAATAATC ATACGCTTGT GTATGTCTAT CCTGTGCGCC 114 0 

ATGTGGAAAT ACAAACATAT CTGTTTTACC TACAATTGGT TCAACTTCAT CTTTCCATCT 1200 

TTTAGTATCA CGTTTAATAC CTTCTAAAGA TGTTTTTTCA AAATTAATGT GACCATATGA 12 60 

ATGACTCGCA AATGACCATC CATCCCGTTT CATTGCGCGA ACAACTTCCT CAGCTGCCTT 132 0 

TTTATTCTTT GTATAATCTT TACTCGTTAA TTCATTCGTGCGATAACCTA ATACGCCCTC 1380 

ATAACCGGTT AAAGCAACAA CACCTTTTTC AC CAT T T AAA GAAAAATCTG GATGCTCTTT 14 40 

TACAAATTTA TTTAAAATTG GCACGATATC ATTGTCATCA GAATAAGTAG CATGGCCTTT 1500 

TTTGTCTGTA GTTTCAGAAA CAACATGTTT ATTTTTATCG AGTACTAAAC GGTCAGCATA 5L60 

ACCATGGTGT CTCATGTAAC TATAGTAATT CATATCATCA ATTGAGATGA TTAGTGGCTT 162 0 

TTTACCTTTC GGCAATTTTA TTTTTTTGGC TTTTACATGA TGAGATGATA AGTCGTATAC 168 0 

ATCATGTGGA TTAACGATGA TGTAATTATT TTTATATAAT TCGTTCAATG ATTTTTTAAA 174 0 

TTCACTTACA GTAATCATCC JATCATTGTT GCCCTTAGCT TGGTGTGTAT CTCCTGTAAA 1800 

CGCAACTTTT GGGTCTGTAA TTAATGGGTG ATAAAACACA TGATAAACTT GGCCGTGATA 18 60 

TGTTTCCCAA TGTTCATCCA TTTTcGATTT aTGCTTTGCA TACTCATTTG GATTAACAGA 1920 

TTTATTkTGA GCTTTCTCAT TTTGCTTGGA AC AG C TAT AT mACAATGAA CTGATAATAA 1980 

CAGAAAAAAT AGCAATAAAT ATTTTTTATG CATTAAACAT TC 2 022 
(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

ATAGAGTGAT AAAGGATGGT TGTCATATGA TAAATGCAGT AGTAATAGCA GTAATTTTAA 60 

TGATTGTGCT ATGTTTATGT CGATTAAACG TnTTATTAAG CTTATTTATC AGTGCGCTAG 120 
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TTGGTGGCTT AATTTCAGGC ATGAGCATTG AAAAAGTTAT AAATGTATTT GGGAAAAATA 180 

TAGTCGATGG TGCTGAGGTA GCATTAAGCT ATGCTTTATT AGGTGGATTT GCAGCATTAA 240 

T T T CAT AC AG TGGTATCACA G AC TAT T TAG TAGGAAAAAT TATAAATGCA ATTCACGCTG 300 

AAAATAGTCG ATGGTCAAGA GTTMAGTCA AAGTGACAAT AATCATTGCA TTATTAGCTA 3 60 

TGAGTATCAT GAG T C AAAAC TTAATTCCTG TACATATTGC ATTCATTCCA ATTGTCATCC 4 20 

CACCATTGTT AAGTCTGTTT AATGACTTAA AAATAGATAG ACGTTTAATC GGTTTGATTA 4 80 

TCGGTTTTGG TTTATGTTTC CCGTATGTGT TATTACCATA TGGATTCGGTCAAATTTTCC 54 0 

AGCAAAT TAT TCAAAGTGGC TTTGCAAAGG CAAATCACCC AATTGAGTTT AATATGATTT 600 

GGAAAGCAAT GCTTATTCCT TCAATGGGGT ATATTGTTGG CTTACTTATC GGTTTATATG 660 

TATATCGTAA ACCACGTGAA TATGAAACAC GTAAAATTTC AG AT AG T G AC AATGTTACAG 7 20 

AGTTAAAACC ATATATCTTA ATAGTAACAA TTGTAGCAAT ACTAGCTACA TTTTTAGTAC 780 

AAACATTTAC AGATTCAATG ATTTTTGGTG CAC TGGCAGG GGTACTCGTA TTCTTTATTT 840 

CACGTGCATA TAATTGGTAT G AAT TAG AT G CTAAGTTTGT TGAAGGTATT AAAATTATGG 900 

CTTATATTGG TGTAGTTATT TTAACAGCAA ffGGATTTGC TGGTGTAATG AAT GC T AC T G 9 60 

GTGATATAGA TGAATTAGTT AAAACTTTAA CAAGTATTAC TGGTGATAAT AAATTATTTA 102 0 

GCATTATCAT GATGTATGTG ATAGGTTTAA TTGTCACTTT AGGTATTGGA TCATCATTTG 1080 

CAACAATTCC TATTATCGCA TCATTATTCA TTCCTTTTGG AGCGTCAATT GG ACT AST A 114 0 

CAATGGCATT AATCGCATTG ATTGGAACAG CGAGTGCATT AGGTGACTCA GGTTCGCCTG 12 00 

CAAGTGATTC AACATTAGGA CCAACTGCGG GATTAAATGT TGATGGCCAm CATGATCATA 12 60 

TACGTGATAC ATGTGTACCA AACTTCTTGT TT TAT AAT AT TCCTTTAAAT GATTTTCGGT 1320 

ACTATTGCTG CTATGGTACT 134 0 
(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

CAAAATCTAA GAGAATAAnA TTTGTTAATT TnAAATAGCA AGCAATTCAA AGTTATATGT 60 

GTAATAGATA AAATAGATAT CCCTATAGTG ATGCGTTACT AGCTAAACAT AATAACACAT 120 

TAGAAGATAA TGAAGTTAAG GAGTTACTGG ATTG'TTCGA CTATGTAATT AAGTATAAAA 18 0 

ATATCCAACG ACAAAACGTA ATTATAAAAT GGTAAAAGCT ATGGTACAGT TTCAAATTGC 240 



809 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



TAATGACATG CGTATCGGTG AGCTACTTGC AATAAAGAGA GTAAATATAA ACTATGAAGA 300 

TAAAACGCTA GATATCGACG GTAAAGTTAA TTGGATAACT GAAAAAAGAC GGGAGCATTC 3 60 

GGAGTAAAGG AG AC AAC T G A AAGAAGTAAT AGCTATAAGG CCACAGGGCT CACTACCCAA 4 20 

AGCATCGACT TACTAAGAAC ACTTATGCTT GGAATGATAA GTTTATTGAT AGAGAGTACA 4 80 

T AT TCACAAA TACGTCTGGT AGCCCTATCG AC T C G AAC AA AATTAGCCAC ATTATTAAAG 54 0 

GGGGGCGCTG ATATTAGTTC TATTAAGAAA CCTATAACGA CGCATACATT ACATCATTCG 60 0 

CATATATCTA CACTTGCTCA ATTAGGAATT AACTTAAAAG CAATGCAAGA GCATGTAGGT 660 

CATTCAGATT ATAAAArAAA TCTAGAGATA TACACACATG TTACTAATCA GATGGCGAAA 720 

GATATGATGA ATAAATTTGA AC GATTGGGG AGTTAAAATT GAAAAAGAT GaTACACTAG 780 

CAGAAATTAA GCCTATGCTC AATTTTGATG AGCAAATAGC AAAATTAAAA CAGATGAATA 8 40 

TATTTTTTAA TAT TAT TG AC AC C G AAAAAG CAAATGAAAT TCTTAGAAAA AATAATTACT 900 

TCTTCAAACT wGcTTATTTC CGaAAAAATT TCGrAAAAAA GaATGGCGGC TATTTCATAG 905 

AATTTGCTTA TTTATCAGAT TTAGCAACTA TAGATATGaA AT T AAGAT AC ACAATGTTGC 1020 

ATTTAACTTT AGATATTGAA CATAGTTTAA AGTATCTAGT CTTAAAACTA ATAACAGAAA 1080 

ATAACCAAGA AGATGGTTAT AAAATAATAG ATGAGTTCTT ATGTATTGaT AAAT CAT AT A 1140 

GCAATTCAAA TTTTGACACA AATTCAAGAA CACCAGAAGA AGTTATGGAA AC C AAAAT C A 12 00 

AAAATAAAAA CGAAATATTC AAGCATATGA ATAAACGAGG ACAACTACCC GAGAAGTTGa 12 60 

ATAAATACTA T CmAAAT C C A CCCGCnnGGk TTTGCaTTGr ATTCATGCAA CTAGGTCAAT 132 0 

TCGTTTCGTT TCTCAACTTC TATTACAAGA AGTACAATGA CGAAGAATCB AGAGTTGCTA 1380 

ATATTTTAAT GCCTTTAGTT AAAAATATAA GAAaCAAATC AGCTCATAAC CAACCCATCA 14 4 0 

TAGCAAATCT AAATTATGAC AGTAGATTAC CTCAATATTT ATTTGAAAAA GGGAATAATA 1500 

TAGGCATATC T AGAAAC AT G TTCGGAATAA AAAAT TTCAT AGAT ACT kTC ksTACGCTAG 1560 

AATTACATAA TCAAGTTTGT AGTAATGCAA TTATCCAAGC AAG A T AT C AC GATTTGGACC 1620 

AACTTCAAAA GCGATATAAA AGrAACGrAA GCTATTATAA TAATGCATTA GCTATCAAAA 1680 

GATTTTTTAT AGCTTTAGAT AAAATTATTG ACTTCAACAG ACCAAAAGTA TAAACTATCT 17 4 0 

AG T G AGG AAA GAGACTTATA GGTCTCGCGA GTTATTTTAA TTCGTATGCA AGAAAAAGAA 1800 

GAGCTATGCA TTTTATTTAA AATGCGTAGT TCTTtTTTTA TGCATCTAAA TTCATATTAT 1860 

TTTTGCAATA TAAACATATC TTTGTGCAAA TTCCGAACAC AAAAC AT T C A CATCATCCTT 1920 

TTTtGCCCTT TTTCTATACC C C AAAAC AC A AAAAGCCCCG TAAGCCTATG CcTACGGgT 198 0 

TTGACAATAA ATTATATATT ATTGTTCTTC TTTAACATAT GGTAATAATG CCATATGACG 204 0 
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AGAACGTTTG ATAGCTGTAG TCAaCATACG TTGATATTTA GCTGAAGTAC CAGTTACACG 2100 

ACGTGGTAAA ATTTTACCGC GTTCTGAGAT AAAACGTTTT AATAATTCAG TGTCTTTGTA 2160 

GTCGATATGT GTAATACCAT TTGCTGTGAA ATAGCATACT TTTTTACGAC GACGTCCGCC 222 0 

TCTTCTTGGT CCACCTGCCA TGATTGTGTG CCTCCTTTGA TAATTTTTTC GCTTTAATTT 2280 

CGTTAATTTT TATTAGAATG GTAAGTCATC AT C AC T TATA TCAATCGGTC CGTTTGCATT 234 0 

TGCAAATGGA TTATCAGATT GTTTCGTGTT TGATGAATA TTGTACGAAT TGTTTTGTCC 24 00 

TGATTGTTGA CCACCGAATC CTTGACCGTA ATCTTGGAAT TCATTTTGTT GACGTTGGCC 2 4 60 

ACCATTTTGT TGCGCATTTT TAGGTTCAAG GAATTGAACG C TAT C AC AC A CAACTTCAGT 2 52 0 

AACAAACACA CGACGACCTT CTTGATTTTC ATAATTACGG GATTGTAAGC GACCATCTAC 2580 

ACCAGCTAAA CTACCTTTAG ATAAATAGTT ATTTACATTA TCTGCTTGTC TTCTAAAAAC 2 64 0 

AACACAGTTA ATAAAATctG ctTCGCGCTC CCCTTGAGCA TTCGTGAACG T AC GAT T T AC 27 00 

TGCAAGAGTG AATGtCGCTA CACTCACACC TGAGGGAGTG GTTCTGTATT CCGGATCTTT 2 7 60 

CGTTAAACGA CCTACTAATA CAACTCTATT TAGCATTTAA ACGCCCCCTC TAATTATTAC 2 820 

TTGTCTTCGT CTTCACGAAT AACCATGTAA CGAATGATAT CGTCACTGAT TTTAGCTAGA 28 80 

CGTTGGAATT CGTCAGTAGC TTTGTTGTTA TCAGATTTAA CACGTACGAT GTTGTAGAAG 2 94 0 

CCATCTTTGA AAT CAT T GAT TTCATAAGCT AGGCGACGTT TACCCAGTC TTTTGCTTCT 3000 

AAAACTTCTG CACCTTCAGT AGCTAAGATA CCGTTGAAAC GTTCAACTAA CGCTTTTTTA 3060 

GCATCTTCCT CAATGTTTGG GCGTACGATG TACATAACTT CATATGTTCT CATTTTATAT 312 0 

TTGCACCTCC TTGTGGTCTA TACGGCTTAT CAATCTTAAA ACAGATAAGC AAGG aATAAT 318 0 

TTTCATTACT CACAATAAAG AATTaTATCA TGCGCCATTA CTTTTTACAA TaATAATTcA 324 0 

AACTACTCTT CATATCATTT TTGATATt AA TTCATTTGaA ACTTTCnATG ATATTTTnAA 3300 

AAAT AC AC T T CACAAAAGCG AACATATGTn CTATAAnAGT TGTGAGGTGG TAAGGAATGA 3360 

ATTTA 3365 
(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

GTTAAAAGTA ATTGGTGGTA T T GAT GAT G A TTTTACAGCC AATGTTATGC ATCCAAATCA 60 

ATATCGAATT CGATATTCGT CTCAGAAACA GGACCTTAAT GAAGATATGA CAGTTTTTGA 12 0 
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TGCAGTATTA AGTTCTGATA CAACAACTTT ACGCATCATC AAGCAATmG AGCAGGCAGT 180 

ACAAGCTTAT GCGGATGACC AAAGTGATAA ATTGTTCAAG CGAATGATGG ATGCGCAAGA 24 0 

TGCTATGGAT CAACATGATG CTTGGGACTA TAACGCTGaA ATTAAAACAA TCCTCTCAAA 300 

ACTAGGTATA CATGATACTA CTAAATACAT TAAAGAATTA TCCGGCGGAC AACAAAAACG 3 60 

TGTTGTACTT GCTAAAACAT TAATAGAACA ACCAGATTTA TTGTTATTAG AT GAAC C T AC 4 20 

G AAC CAT T T A GACTTCGAAT CAATCAGCTG GTTGATCAAT TATGTGAAGC AATATCCTCA 4 80 

TACTGTTTTA TTCGTAACCC ATGATCGATA TTTTTTAAAT GAAGTTTCCA CTAGAATTAT 54 0 

TGAACTAAAC AGAGGTAAGT TAGCGTCATA TCCTGGTAAC TATGAATCTT ATATTGAAAT 600 

GCGCGCTGAA AGAGAAGTAA CACTTCAAAA G C AA C AAC AA AAGCAACGAG CTTTATATAA 660 

GGAAGAACTT GCTTGGATGA GGGCTGGgAG CTAaGGCTCG T AC T AC AAAG cAACAAGCTA 720 

GAATTAATCG ATTTAATGAC CTAGAmAATG AAGTTaACCA GCAATATAAA GACGAAAAG 780 

GTGAATTGAA TCTTGCTTAT TCaAGATTAG GTAAGCAAGT GTTCGAATTA GAAGACTTAT 84 0 

CAAAGGCTAT T AAT GAT AAA GTATTATTTG AACATCTGAC GGAAATTATT CAAAAmGGTG 900 

AGCGTATTGG TGTTGTTGGG CCAAATGGAG CTGGTAAAAC AACACTCTTA AATATTTTGA 960 

GTGGAGAAGA CC AAC AAT T C GAAGGTAAAT TGAAGACTGG GCAGACGGTT AAAGTAGCTT 102 0 

ATTTTAAGCA AA 1032 
(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

TGTGATTAAC GAAGCTTATT TTCGTACACC T T C AAC AAC T GAT T AC AACG GCGTTTATCA 60 

AGGTTATTAT ATTGATTTTG AAGCAAAGGA AACTAAAAAC AAGACGTCCT TTCCTTTAAA 120 

TAATATTCAT G AC CAT C AAG TCGAACATAT GAAAAATGCA TATCAACAAA AAGGTATTGT 180 

GTTTTTAATG ATTCGTTTTA AAACGCTAGA TGAAGTTTAT CTTTTACCCT ATTCAAAATT 24 0 

CGAAGTATTT TGGAAGAGAT ATAAAGATAA TATTAAAAAG TCTATAACAG TTGATGAAA 300 

ACGAAAAAAT GGTTACCATA TTCCTTATCA GTATCAACCA AGATTAGACT ATCTAAAAGC 3 60 

AGTTGATAAG TTGATATTAG AT G AAAG T GA GGACCGCGTA TGACGGAAAA CAAAGGATCT 4 20 

TCTCAGCCTA AGAAAAACGG TAATAATGGT GGGAAATCCA ACTCAAAAAA GAATAGAAAT -4 80 
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GTGAAGAGAA CGATTATTAA GATTATTGGC TTCATGATTA TTGCATTTTT CGTTGTTCTT 540 

TTACTAGGTA TCTTATTGTT TGCTTATTAT GCTTGGAAAG CACCTGCTTT TACCGAAGCT 600 

AAATTACAAG ATCCGATTCC TGCAAAGATA TATGACAAGA ACGGAGAACT TGTTAAAACA 660 

TTAGATAATG GCCAAAGACA TGAGCATGTA AATTTAAAAG ACGTGCCGAA ATCAATGAAA 72 0 

GACGCAGTAC TTGCAACTGA AGACAATCGT TTCTACGAAC ATGGCGCACT TGATTATAAA 780 

CGTTTATTCG GTGCAATTGG TAAGAACTTG ACTGGTGGAT TTGGkTCtGA AGGtGCCTCA 840 

AC AT T AAC AC AA 3B2 
(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID KD: 249: 

CACTTTTTTC ATTAAAAATC TCATATTTAT AC AC T G AAC G TAATCTCGAA TATTTTTCAA 60 

CCCAAGTTTT AACTTTAACT TTTTCTGGAT AAAAAATAGA CTTTTTATAA TTGACATTGA 120 

GGTCAGTCAC AGGTGAAATG ATTCCTTGTT TTTCCATATC AGCATAACTA AAACCTAACT 18 0 

TCGATATATA ATCCAACCGC GCAACTTCAA ACCAAGTTGC ATAATTCCCG TGATAAATTA 240 

CACCCATCTT ATCAGTTTCA GCATAACGCG CTTCTATTTC TGTAATACTA TATATCATTT 300 

TAAGCCTTCT TTCAGTTTAA CTTTATATCT CATTCTAACA TAAAATACAA GAAGAGGCCG 360 

GCCAAGAACA CAAAGGkTTT GAACCGACCT ATTATATCAT AASTTTATA GAAGTATTTT 420 

TGAGCACTAT CAAAGTGCCT CAAATACCGA TTAAAATTTT ACTGTGATAT CTATTTTTTA 4 80 

TTGCGCTAAT TTATTTCTTA AAACCATTTG TAAAATTCCA CCGTGACGAT AGTAATCCAT 54 0 

TTCAACAAGT GAGTCAAAAC GAACCATAGC GTCAAATTCT ACCAAATCAC CATCTTGCTT 600 

CTTAGCAGTA ACTTTGACGT AGtCATGTGG TTGAACATTT TCATCAATAT T AAC AG r AAT 660 

TTCTTCTGTA CCATCTAGAC CAAGAGAATC AGCTGATTCA CCTTTTTTAA ACTCTAATGG 720 

TAATACACCC ATCATAACTA AATTTGAACG ATGGATACGT TCATAACTTT GTGCAATAAC 7 80 

TGTTTTAACA CCTAATAAGT TTGT7CCTTT TGCTGCCCAG TCACGAGATG AACCCATACC 84 0 

ATAATCGTTA CCAGCTAATA CAACTAAACC TGTACCATCT TCTTTATATT TCATTGCAGC 900 

AT C AAAGAT A GGCATTACTT CATTTGTTGG CCAATAAGTT GTAAAACCAC CTTCAGTACC 960 

TGGCGCTAAT TGGTTTTTAA TACGTATATT AGCAAACGTA CCTCGAACCATTACTTCGTG 1020 

ATTACCACGT C T T G AAC CAT ATGAATTAAA TTCACGAATA GGCACTTGAT GATCTTGTAA 108 0 
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AT AT T T AC C A GCTGGCGTAT CTTTACCAAT TGCACCTGCT GGAGAGATGT GGTCAGTTGT 114 0 

C AC AGAAT C A CCGAATTTAC CCATAACACG TAAACCATTT AATGGAACAA TCGTACCCGG 1200 

TTCTTTAGAT AATCCTTGGA AGAATGATGG ATTTTGAATG TATGTTGAAT T AGGAT C AAA 12 60 

G T CAT AT AG A GGTTGATCAG TTACATCAAT CTCATTCCAT AATTCGTTGT TATTGTATAC 1320 

GTTATTATAT TCTTCAATAA ATAATTCAGG TGTTACAACA CTAT CAACGG TATCTGAAAC 138 0 

TTCTTTAATT GATGGCCAAA TATCTTTCAA ATVTACATCT TCACCGTCAT TACCTTTACC 14 4 0 

AATAGGTTCA TTTTGTAAAT CAATATCAAC CGTTCCAGCT AATGCATAAG CAACAACTAA 1500 

CTGTGGTGAA GCTAGGTAAT TGGCTTTAAC AAGAGGATGG AT AC G AC C T T CAAAGTTACG 15 60 

GTTACCAGAT AATACAGATG TCACTAATAG GTCCTCATCA GCAATCGCTT TTTCAATTC 162 0 

TGGTAATAAA GGACCTGAAT TACCGATACA AGTTGTACAT CCATAACCAA CCAAGTTGAA 1680 

GCCTAAATCA TCTAAATAAG GTTGTAAGCC AGCATCTCTT AAATATCCGG TAACAACTTT 17 4 0 

TGATCCTGGT GCTAGAGAAG TTTTAACGTA TTCAGGAACT TTCAAGCCTT TTTCAACTGC 1800 

TTTTTTAGCA ACTAAACCTG cACCTAACAT TACATAAGGG TTAGATGTAT TTGTACATGA 18 60 

TGTAATTGCT GCTATTGCAA TATCACCTGT TTTCATTGTA GCTTTTGATC CATCTTTAAA 1920 

GTTAATTTCA GCTTTCTTAT CAAATTCACT TTTATCTAAA CCGTGTCCTT GGTTGCCTGC 1980 

TGGAGCTGTT ACAGAATTTT CAAATGATGA TTTCATATCACTTAAGAAAA TTAAATCTTG 2040 

AGGACGTTTT GGTCCTGAAA GCGATGCTTC AACTGTTGAT AAATCCAATT CGATAACATC 2100 

TGTATAATTA GGATCTTCTT TCTCAACATC AAAGAACATA TGGTTTTGTT TCAAATATTC 2160 

TTTTACTAGC GCGATATGTT CGTCTGATCT ACCAGTTAAC TTCATATATT TAAGAGATTC 222 0 

ATCATCAACT GGGAAGAATC CGCAAGTTGC TCCATACTCT GGTGCCATGT TTGCAATTGT 22 8 0 

AGCACGGTCT GCTAGTGGTA AATGTTGTAC ACCTGGACCA AAGAACTCCA CAAATTTACC 234 0 

AACAACACCT TTTTTACGTA GCTCTTGAGT TACTCTTAAC GCTAAATCAG TTGCTGTTGC 2 4 00 

GCCTTGTGGT AATGAATTTA CTAGTCGTAC ACCAATAACC TCTGGAATTG GGAAATAAGA 24 60 

AGGTTGTCCA AGCATTCCAG CTTCAGCTTC AATACCACCA ACACCCCATC CTAGTACGCC 2520 

AATACCATTT ATCATTGTTG TATGTGAATC AGTACCAACT AATGTATCTG GAAATGCAGT 2580 

TTTTTCACCA TCTACATCAC GAACATGTAC AACACTTGCT AAATATCETA AGTTAACTTG 2 64 0 

GTGAACTATT CCAGTTGCAG GAGGAACTGC ATTGTAATTA TCAAATGCTT TCGTTGCCCA 27 00 

ATTTAAAAAC TGATAACGTT CATAGTTACG TTCAAATTCT AATTTCATAT TACGTTCAAG 27 60 

AGCTTCTGGA TTTGCATAGC TATCCACTTG AACTGAGTGG TCAATAACTA AATCCACCGG 2 8 20 

TACTTCTGGA TTAATTTTAG TAATATCTCC CCCAACGTCA TCCATTGCTT TACGTAAAGA 28 8 0 
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AGCTAAATCA ACTACGGCTG GTACACCTGT GAAATCTTGT AAAATAACAC GAGAAGGTTT 2 94 0 

AAATGGTACC TCGCCTTCAT TTCCATCTTT TCCAAACTGA CTTAAAGCTT TAATATGATC 3000 

GTCTGTAATT ACAAAATCAT CTTCTTGAGS AAGTAAAGAT TCTAACAAAA CACGAATTGA 3060 

ATAAGGTAAA TTGGAAACTT TAG T AAT AC C TTGCTCTTCT ACAGCTTTTA AAT CAT AG T A 3120 

AGTATAACTT TGGCCATTCA AGTCAAAATG TTTTTTTGAT TGCTCTTTAA AATTTGCAGC 3180 

CATTTAATGA TCCCCCTTGA TACATTTTTA TATTTATATG CCTTGATTAA AT TEAT TAT 324 0 

TATATTTATT GATAAACAAC TCATCATGCT TAGAAAACGC TTAATTTAGG TTTTGACTTT 3300 

TTAATCAGAG TATATAAGCA AAACTTATCA TACAGGTAAG GTGTAATAAG TATTTTTTAT 3360 

TAATTGAGAA T AAT TAT C AA TTTCGCGAAT GAT T C AAT T C AATTTTTAAA CGTATTATTT 34 20 

CATTGAGCAG AAAGAAAATT ATGGCACCAA ACTTTAATAT TTTTTTCAAT GTCATTCTTT 34 80 

TGATGGGAGT GGGACAGAAA TGATATTTTC GCAAAATTTA TTTCGTCGTC CCACCCCAAC 3540 

TTGCATTGTC TGTAGAAATT GGGAATCCAA TTTCTCTTTG TTGGGGCCCA TCCCCAACTT 3600 

GCACATTATT GTAAGCTGAC TTTTCGTCAG CTTCTCIGTT GGGGCCCTCA CCCCAACTCG 3660 

CATTGCCTGT AGAATTTCTT TTCGAAATTC TCTGTGTTGG GGCCCCTGAC TAG AAT T G AA 37 2 0 

AAAAGCTTGT TACAAGCGCA TTTTCGTTCA GTCAACTACT GCCAATATAA CTTCGTAGAG 3780 

CATAGAATAT TGATTTATGT CCCAGCCTGA GTTAATTTTC TATAAAAGTA TATTTAATTT 384 0 

GCGTTTATAC CGTCAAACTT CACTTTAGCT TTGTCAAACC CCTTTCTATT AAGTTTTCAG 3900 

AAATAAACCT ATCTTAAAAT ATAAAAAAAT C GAG AAT T C G TAGTTTAATA ACGAAATTCT 3 9 60 

CGTTCTTATC CTTTTGAATA TACTCAATTT TCCACAAAAA CAAACAAGTA GTATATCTGT 4020 

TCTAGCTACT AGAATGACAT ACTACTTGTT ATTAAAATAC TTAACTAAAC TTTATTAGTT 4 080 

ATCTTTTTTC TCTATATTTC TACGTGACTG ACGCTTTTCA AG AAT G T C AG ATTCATAATC 414 0 

TTCTTGTTGA CTCTTGATAT ATTCTTGTAA GCGATGTTTA TTCGGAGTCA ATGTTAAACC 4200 

TAGGAATTTA CGTTCCTGGT TCGCATCCTT GTAGAAACTT ACSTCATGA GTATGACGAC 4 2 60 

AAAGGAGAAT GGGAATGCAC TTATAATTGC AGCACTTTGA ATCGCATTTA AAGCTTCAGC 4 320 

GCCGTTACCG CCACCAGCTA ATAAAAGTAC AAATGCTATT AAGGCCTGTG AAATTCCCCA 4 380 

AACAACTTTT ACCATACTAG ATGGATTTAA TGAACCAAAT GTTGTTTGCA TTCCTAATAC 4 440 

AAATGTTGCT GAGTCAGCAG ATGTAATAAA GAATGATGCA ATTAATAATA ATGCAATCAA 4 500 

C GAT AAAAC A ATGCCAAATG GCACATGATT AAACACTCCA AATAGCTGTG TTTCAGGAGT 4 560 

CAT AT C AAAA ATTTCTTTGT GTTTCTTACC TGTCTCGATG C C T AAT AC AC CAAAGACACT 4 620 

AAACCAAACA AAACTAACAA TTGCT3GAAC TAGCAAGACA CCAGAAATGA ACTCTCTAAT 4 680 

TGAACGTCCT TTTGAAACTC GTGCAATAAA CACTCCAACG AATGGACTCC AACTTAACCA 4 74 0 
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CCAACCCCAG TAATAAAGTG TCCATGAAGA CATCCATTCA CGTTTTTGAG GATTTAAAGC 4 8 00 

TGCTGTATCA AAACTATTAA ACAAGAATGT GTTTAGTAAA CTACCCGTAGAGCTAGTTAA 4 8 60 

CATATTTAAA ATAAGAACAG TTGGTCCAAC AATTAAAGCA GCTACCATTA AAATAGTACC 4 920 

TAAACCAATG TTCAAGTTAC TTAAGTATTG AATACCTTTA CTTAATCCAG ACCATGCACT 4 98 0 

TGCTATAAAT AAGATAGTAA CAACAATGAT GATAATCGCT TGTACAAACG TATTGTTTGG 504 0 

AACATTGAAC AAGTAATGTA AAC C AC C AT T AATTTGTAGA GCACCCATAC CTAACGAAAC 5100 

GGCTACCCCA ACGATTGTCG CAAATACAGA TAAAACGTCA ATAAAAATCC CAATAGGACC 5160 

TTCTACTTTA TCACCTAAAA GAGGACGTAA AGTTCTAGAT AATAAACCTG GTTCACCTTT 522 0 

ACGGAATTGC GAATATGCCA ACGTAACGCA ACAACACCAT AAACAGCCCA AGCAT GGAAT 52 8 0 

CCCCAATGGA AAAATGTTGA ACGTAGAGCT TCAGTATAAG CTTCAGTAGT TTTGGGATCT 53 4 0 

GCTGTAGGTG GCGTAGCAAA GTGCGCCATC GGTTCAGCTG CACCATAAAA CACCAAACCT 54 00 

ATCCCCATAC CAGC AC T AAA CAACATAGCA AAC CAT G AAA TTGTATTAAA CTCAGGTIG 5460 

TCATTTGGTT TACCTAGTTT AAGTTTTCCA ATAGGACTAA AAATAAGGAA TATACAGAAG 5520 

AACACGATAA TCGTAGTAAG AATAAGATAA TACCAACCTA ACTTTTCTGT AATCCACATT 55 80 

TTAATATTAT TGGTAACATA GTTGAATTGT TCAGGTAAAA ATGCACCAAG TAATACGACT 5 64 0 

ATAGCAACAA CAATTGCACT AT AG AT G AAG ACTGGTGAAT ACTTCTTTCC ATTTGGATTC 57 00 

TCTGGTGAAG AAG AAT T CAT AATTAATTAC TCCCTTCAAT TCTATATTTA ATTTTATGTA 57 60 

GTAGAATAAA AATATTATCT AAACATTTTA T T C AAT AAC T CACG 5804 

(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

■ (A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

ACCCGCGAAT ATGGTCCATC CTATCGATTT ATTTTHACT GGTTTGACAA TATTTAATTT 60 

TTCATAATCA TTCTTAGTGA TTTTGACATA TGTTTTCGGT ATGAGCCAGT TAATAAATGG 120 

AAAGAAGAAG ACAATCCAAT TACTTGCCAA ATCaATCATT AAATATTCAC TATCGTATTT 180 

GATTATTCGA TATTTAGGGT TTTTATTAAT AACTTTAGAT TCGCAAAGCA ATGTCTCCAC 24 0 

ATCCCTTTAA TTTTATGTGT AATACATTTT TCGATACTTC AAAAGACATT CAAATACTAT 300 

CAAGTTACTG TCATCAAAGG TTTTATTAAC TGATATTtTC ATATTTTTAa TCTGAATTTA 3 60 
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GTACTTTCAC TTAATTTTTA AG TAT G AC AT GrATGGTTTT 



400 



(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCAGGGTGCG GAAAGCTTTA AAATTTGGAC TAATAAAGAT GCTGATATTA ATTCTATGAA 60 

AACAGCAGTT TTACAACAAT TAAAAGGAGA AT AAC AT AT G CTTACTGGcA AACAAAAAAG 12 0 

ATACTTAAGA AGTTTAGCAC ACAATATTGA TCCGATTTTT CAAATTGGAA AAGGCGGTAT 180 

CAACGAAAAT ATGATTAAAC AAAT AG AT GA T AC G T T AG AA AAC AG AG AAT TGATTAAAGT 24 0 

ACATGTACTA CAAAATAACT TTGATGATAA AAAAG AAT T A GCTGAAACAT TAAGCGAAGC 300 

TACTCATAGT GAATTAGTGC AAGTGATTGG AT C TAT GAT A GTGATTTATA GAGAATCTAA 360 

AGATAATAAA GAAATTGAAT TGCCATAATA AT GAAAAAG A TAryACTTTA CGGCGGTCAG 42 0 

TTTAACCCTA TCCATACTGC ACATATGATA GTAGCTAGCG AAGTATTTCA TGAATTACAG 4 80 

CCAGATGAAT TTTATTTTTT ACCTAGTTTT ATGTCTCCAT T GAAAAAGC A CCATGATTTT 54 0 

ATAGACGTTC AGCACAGATT AACAATGATA CAGATGATTA TCGACGAGCT TGGTTTTGGA 60 0 

GATATTTGTG AC GAT G AAAT TAAACGTGGT GGTCAAAGTT ATACCTATGA CACGATCAAG 660 

GCATTCAAGG AGCAACACAA AGAC AG T GAG TTGTACTTTG TTATTGGGAC GGATCAGTAT 720 

AACCAACTAG AG AAAT G G T A T C AAAT T G AA T AC T T AAAAG AAATGGTTAC TTTTGTAGTT 780 

GTAAATCGAG ACAAAAATAG TCAAAATGTT GAAAATGCTA TGATTGEAT TCAGATACCT 84 0 

AGGGTAGATA TAAGTTCGAC AAT GAT T C G A CAAAGAGTTA GTGAAGGGAA ATCTATCCAA 900 

GTTCTTGTTC CTAAATCCGT TGAAAACTAT ATTAAGGGGG AAGGAT TATA TGAACATTGA 9 60 

AAAA 964 
(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
CCCATTTTTT AATTTATTTG GTTGAATGAn TGGGGTTATG AAAAAGGAAG TTTTAATCTT 60 
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TTGTCATGAA ATAAATGGGA TGAATATCAC GACTAGAAGT AATGTTACGA ACAGGAGCGT 120 

ATAAACTAGA GACGCTAAAT TCGACATAGT ATGTnGCTCA ATTATGGCTG ATGATGAATT 180 

TAAAGTATGT GCGTTGGAAC TGTCGGATTT TTGTTCATAA TGTTTTGCAT ATTGCGCCAT 24 0 

GATGAATAGT GTAAAAATAA ATAAAACAAT AAGAGATATA ATGCCCATAA TCAAAAGTAT 300 

TTGTTTAGAG CCTTTCATTA TTTCACATCC TTTCTAAAAT ATATTTGTAA CTAAATTTAA 3 60 

AATAGTTATT TTTGTAATTC TAAACCTTTT TCATCGCGAA AACAATTAAATAGGTCGCGG 420 

TATTAATTAT TAT ATT ATT A CCGCTTAATA T G AAAAAT AC ATGAaAATTA ATTTTCTAAT 4 80 

ATACTTTTGA AAAATTATTA CAAATTAGCC CCTTCAAAAC GCGAAAACAT AAGGATTCTA 54 0 

GTTTCAAAAG GGCTGATAAG C AT AAAAT G A AATGTAATAT TTCGATGTAT AAAATTTTTA 600 

AT T AGCTAAA AATCATCGCA TTAATTTTTT GAGCTACATC ATCAAAATTC GGACATTTTA 660 

ACGACACATA TAATTTAATT TTAGGTTCAG TACCAGAAGG ACGTAAAGCG ATAAATCCTT 720 

CGTCAAATAA GACACGAATA ACATTTGATT TAGGAGAATT AATCTGCGAC GTTGTATCTT 7 80 

TATCCAAATG ATAAACCTCG CTAGTTAAAT ^TCTTCAAT TGCTTTCACT TTGAGTCCTT 8 40 

GAATCTCTTG CGGTGGATTT GAACGGAATT TGGTCATTAT TGCATTAATT TTCTTTTTCC 900 

CTTCAAATCC TTCTAGCGTA TGCGgAATAA TGTATCCTCA TGTCTACCAA C AG T T T GAT A 9 60 

AATCTGTTCT AATTCATCTT TCAATGTTTT GCCATATAAT TTTAACTCAG AAGCGTATT 102 0 

TATAATGAGT GGCACAATTT GTACGGCATC TTTATCACGT ACAAAAGGCT CTGATAGAAA 1080 

ACCGTAACTC TCTTCAAATG CGAAAATCAT ATTTGATGAT CATCCAGTTG TCTTATTTCC 114 0 

TGAGCAATAA ATTTAAGCCC GTCAGCACCT CTTTGGTATT CAACATTATT ATA 1193 



(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

TATCAGGATG ATTATGnCGG GGTTTTATTA AGTCTGAGCG TATATnCCTT TTATTCTCCT 60 

AAATGGTTTG GAATTCAAGG AGAAGACGAT GAAATGGTAT CCAAAAACCA AnGAATATAT 120 

nGCATTAAGT CTGAGGATGA TAGTGCGGTG GCAATTCGTT CATTAATTTT GCATAAAGAT 180 

GAACCTATGT ATTTAAAAAA ACGTACATGT GTACTACTT TGTTAATTAA TGGGGAACAT 240 

GATCCTTTGA TTAAAGATAA GAATCATTTT AAATTGGAAG CGCATTTTTT AAATGTTACG 300 
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AAAAAAATCT TCGAACATTC AGGACATGCA CCGCATATTG AAGAACCAGA AGCATTTATG 360 

AATTATTATT TAAAATTTTT AAAAAGCGTA TCATAATATG TGATATATAA ACCTAGGGCA 4 20 

TAAAGTCCTT AGGCAATGTG AAAAAGCTGA TTACTATTCA TTATTTGATA GAAATCAGCT 4 80 

TTTTTTGAAA TGTATTTGAT ATATACTGCT CGTTATGCGG CTATCTTCCT TATATTAAGT 54 0 

GCCATTAGTG CAAAACCTCT TAACAATTAG GTAAAAAGAG CATAAAAAAA GGAAGTTTAA 600 

TAGAATGTAT CATCTATCAA AC T T C AC C AA ATTGCGCTAA ACAAAATTAT AGTTCAATTT 6 60 

CGTTGTTTGC TTCAGTGATT CGTTTATTTA CTCGACTCAA TAATGATTCG ATTTTTTTAC 720 

GTTGTTGTGC ATTAACAAGA AT T AAT AC AG TTCTTTCATC ATGCTCATTA CGTTTTTTAT 780 

CGAAGTAATC TTCTTGAGAT AAAATTTTAA CTGCTTTAAC ACTTGTGGT TGTTTGTAGT 840 

TTAAATGATT AATAATATCt TTAAGATAGT ATTCTTTCTC TTTGTTTTCG CTGATGTATG 900 

TCAATACAGC GAATTCTTCA AAGCTAATTG AnAATTCCTT T T T AAT T AAA CTTTTTAATT 960 

TGTCAGCATA AGTGACCATT GATAACAACT CAAAGCAATC ATTGATTTTT GTAATTGCCA 1002 

TGTTTAAAAC CTCCCTATTT GATGCATCTT GCTCGATACA TTTGCCCCGA TAATATAnTG 1080 

TATCTAATCT TTATGnAT 10 98 



(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2881 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

CCAGGTAAAA TTGTGCAATC ATTTGACGCA TTAATGGACG CCTTGGACAA TGAAGATTAT 60 

GAAGGAGAAA AAGTCATTCC ATTCTTAGAT AAACATTTTA AAT AT C AAG A TGGCCGATCA 12 0 

AGTGAGCGTT TAGTCAGAAA TTTATTTGGT AGCTAAGTTT AT AT AG TAG T CAAAGTGGGA 180 

GAGGTATAAT GATGAAATTT T C AG T AAT AG TTCCAACATr C AAT T C AG AA AAGTATATAA 24 0 

CAGAATTACT TAATAGCCTT GCGAAACAAG ATTTTCCGAA AACCAATTT GAAGTGGTTG 300 

TAGTTGATGA CTGTTCAACA GATCAAACGT T AC AAAT AG T TGAAAAGTAT CGCAATAAAT 3 60 

TGAACTTGAA AGTAAGTCAA CTCGAAACAA ATTCTGGTGG TCCAGGTAAA C C TAG AAAT G 4 20 

kGGCGTTAAA ACAAGCAGAA GGTGAATTTG TATTATTTGT GGACTCCGAT GACTATATAA 4 80 

ACAAAGAGAC TTTAAAGGAT GCAGCAGCAT TTATTGATGA ACATCACTCA GATGTCTTAT 54 0 

TGATTAAAAT GAAAGGTGTT AATGGTCGTG GTGTACCACA ATCTATGTTT AAAGAAACAG 600 

CACCTGAAGT TACTTTGTTA AAT T C AAG AA T TAT C TAT AC TTTAAGCCCG ACTAAAATCT 660 
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ATAGAACAGC ATTACTAAAA GATAAEACA TTTATTTTCC AGAAGAATTA AAGAGTGCAG 720 

AAGATCAATT ATTTACAATG AAAGC AT AT T TAAATGCAAA TCGAATCAGT GTGTTAAGTG 780 

ATAAAGCGTA TTATTATGCT ACAAAGCGTG AAGGTGAACA TAT GAG TAG T GCGTATGTTT 84 0 

CACCTGAAGA CTTTTATGAA GTCATGAGAT " TGATTGCTGT AGAAATATTA AT GC AG ATT 900 

TAGAAGAAGC CCATAAAAAT CAAATCTTAG CAGAATTTTT AAATCGTCAT TTTAGTTTTT 960 

CTCGTACGAA TGGCTTCTCA CTTAAAGTTA AACTAGAAGA TCAACCACAA TGGATTAATG 1020 

CTCTAGGAGA CTTTATACAA GCAGTTCCAG AACGTGTAGA TGCATTGGTG ATGAGTAAAT 1080 

TACGACCATT G T T GC AC T AC G C GAG AG C G A AAGATATAGA CAACTATAGA ACTGTGGAAG 114 0 

AAAGTTACCG TCAAGGTCAA TACTACCGTT TTGATATTGT AGATGGTAAA TTAAACATTC 1200 

AATTCAATGA AGGCGAACCA TACTTTAAAG GCATTGATAT CGCTAAGCCA AAAGTGAAAA 12 60 

TGACAGCATT TAAATTTGAT AATCATAAAA TTffTACAGA GCTAACGTTA AATGAATTTA 1320 

TGATTGGCGA AGGACATTAT GATGTCAGAC TTAAATTACA TTCACGAAAC AAGAAGCACA 1380 

CAATGTATGT ACCTTTAAGT GTCAATGCGA ATAAACAATA TCGTTTTAAC ATTATGTTAG 14 4 0 

AAGATATTAA AGCGTATTTA CCTAAAGAAA AAATTTGGGA TGTTTTCTTA GAAGTCCA& 1500 

TAGGTACGGA AG TAT T T G AA GTGCGTGTTG GTAATCAACG TAATAAATAT GCATATACTG 15 60 

CAGAAACAAG TGCATTAATT CATTTGAATA ATGATTTTTA TAGATTAACA CCGTATTTCA 162 0 

CAAAAGACTT TAATAACATT TCGTTATACT TTACAGCTAT TACATTAACG GATTCAATCT 168 0 

CATTGAAGTT AAAAGGTAAA AACAAAATCA TTTTAACTGG TCTGGATCGT GGTTATGTAT 17 4 0 

TTGAAGAAGG TATGGCTAGT GTCGTACTAA AAG AC G AC AT GGTGATGGGA ATGTTAAGCC 18 00 

AAACAT CAGA AAACGAAGTG CnAAATCTTA CTTAGCAAAG AT AT T AAAAA GCGAGACTTC 18 60 

AAAAAT AT TG TTAAGTTAAA CACTGCACAT ATCACTTATC C ACTAAATAA ATAATAAATG 1920 

CCCTCAAATC ATTGTGAGCC AACATGATTT GAGGGCTTTA TTTTGCTGTT T ATGACAT GA 198 0 

T TAT G AC ATT TCCCTGATTT TCATTTTCAT AT AC AT T AAA TTGTATACAC TGGAAATGAG 204 0 

GAGGTTATCT ATAATGATAA AT AAAAAT G A CATAGTAGCA GATGTAGTAA CTGATTATCC 200 

GAAAGCAGCG GATATTTTTA GAAGTGTGGG AATAGATTTT TGTTGTGGCG G AC AAG T AAG 2160 

TATAGAAGCA GCAGCCTTAG AAAAGAAAAA TGTAGATTTG AACGAATTAT TACAGCGTCT 2220 

CAACGACGTT GAACAAACGA ATACACCAGG TTCGTTAAAT CCTAAATTTT TAAATGTTTC 228 0 

ATCACTTATT CAATATATTC AATCAGCATA TCATGAACCT C T AAG AG AAG AATTTAAAAA 234 0 

TTTAACACCT TATGTGACGA AGTTATCGAA AGTACATGGA CCTAATCATC CATATTTAGT 24 00 

TGAGTTAAAA G AAACAT ACG ATACATTTAA AAATGGCATG TTAGAGCATA T GC AAAAAG A 2 4 60 
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AG AC GAT G T C GATTTTCCAA AACTCATTAA ATATGAGCAA GGTGAGGffiG TAG AC GAT AT 



2520 



TAATACTGTG ATAGATGATT TAG T T T C AG A CCACATTGCA ACGGGAGAAT TGTTAGTAAA 



2580 



AATGAGCGAA TTAACATCTA GTTATGAACC TCCGATAGAA GCGTGTGGTA CTTGGCGACT 



2640 



TGTTTATCAG AG AT T AAAAG CACTTGAAGT GTTAACACAT GAACACGTAC ATT T AG AGAA 



2700 



TCACGTATTA TTTAAAAAAG TAT CAT AAA T AACGCGATTA GAAACTGTTG GCAAAAATAA 



2760 



GTCCAGCAGT TTTTCGCTAT GTATAAAAGT CATAATAGTG ACATAAACAG CATTATTTGA 



2820 



AAAGAAnAAT GGTCAACTTA GCATAAAAAT TGATATGAAn ATTTAATGGT AT AG AT AAT T 



2880 



A 



2881 



(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

ACCGTCGAAT ATCGCTTGTG ATTTACAATT TGTGTATTAA GATGCTCAAC TAATTTGGGT 60 

ACATATTCCG AAT T TAG AT T TGCAAGTACA ACAATTCCAT AATTTTGTTT TGGATTTAGT 12 0 

AAAATAAATG ATGAAAAGTT ATCTAGCGTT CCTGAATGAA ACAC T AAATGTT C ATCAT T A 180 

TTGGTAAACC AGCCGGAAGC ATATGCATTG GCATTAGGTT C AC C AAT T G T T GAAG AT AAA 24 0 

TTTTTATGTG ATTGTTGAAC TAATGATTTG TATTTATCAG GTGGATTAAG TTGGAATTTT 300 

ATCCAATGTT CCAAATCTTC AGTTGATGTC ATCATATATG CTGATGGTGT ATCCCAAAGG 3 60 

TTAAATTCAG G T T TAG AG AC GACAGGTGTC GAACCTTGTA ATTCATAGCC AATAGCATCA 420 

TGTTTTGATT TGTAATTGGT TTGTTTGAAT GATGTATGTG TCATATGCAA AGGCTTGAGC 4 80 

CATGAATTTG TAATATATTT TGTATAGGAT TGCTTCGTAA CGTTTTGGAT AATTAAACCT 54 0 

AAT AAAT CAT AGTTCATATT TGAGTATTCA AiTTCTTCTC CGGGCTTATG ATGTAATTCA 600 

TCACCCATAA TTGCATGGGT TACATCATTT AAACGATTAT TTTTGCTTGT C AC AG AAT C T 660 

tCGCTTGTAA TAtCACTAGG TATACCACTT GTTTGAGCCA AAAGTTGCTT AATCGTAATA 720 

GTTTCATTTT GACCATTATA GTTCATTTTA AAATGAGGCA CATGTTTGGA TACGGCACEA 780 

TTTAAGTTTA ATCGACCTTC TTGAGCTAAT TTTAAAATTG CAAGACCTGT GAAAGCTTTC 84 0 

GTATTAGAAG CAATTTCATA CTTTGTTGTG GGAGAAGCTT TGACTTTTTT ATCAACATCT 900 

TGATAACCAT AACCTTTATT TAAAAAAACT TTGCCATTTT TTACTAyTAA AATTGATGCT 960 

CCAGGAATGT GTCCCTTTTG TAAATCATGC TCGATAATTG TATCTATTTG TTGTTGCGAA 102 0 
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TCATTGGTTA ACCGTGTCTT CGTATTGCTA TTTAAT 



1056 



(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

ATGCCaCACT TATTGGTGGC AGGATCGACG GGTAGTGGTA AATCTGTTTG TATAAATGGT 60 

AT TATTACAA GTATTTTATT AAATGCTAAG CCGCA"GAAG TTAAACTTAT GTTAATCGAT 120 

CCGAAAATGG TTGAACTAAA TGTTTATAAC GGAATTCCAC ACTTATTAAT TCCGGTTGTT 180 

ACAAATCCTC ATAAAGCTGC TCAAGCTTTA GAAAAAATTG TAGCTGAGAT GGAAAGACGT 240 

TATGATTTAT TCCAACATTC AT C AAC TAG A AACATTAAAG GTTATAACGA ATTAATCCGT 300 

AAGCAAAATC AAGAATTAGA TGAGAAGCAA CCAGAATTAC CTTATATCGT TGTTATTGTA 3 60 

GATGAGCTTG CAGATTTAAT GATGGTAGCT GGTAAAGAAG TTGAAAATGC G AT T C AAC G T 4 20 

AT T ACACAAA TGGCACGTGC AGCAGGTATA CATTTAATTG TAGCGACACA AAGACCTTCT 4 80 

GTGGATGTAA TTACAGGTAT CATTAAAAAT AATATTCCAT CTAGAATAGC TTTTGCTGTG 54 0 

AGTTCTCAAA CAGATTCAAG AACTATTATT GGTACTGGCG GCGCAGAAAA GTkACTTGGT 600 

AAAGGTGACA TGTTATACGT TGGAAATGGT GACTCATCAC AAACACGTAT TCAAGGGGCG 6 60 

TTTTTAAGTG ACCAAGAGGT GCAAGATGTT GTAAATTATG TSTAGAACA ACAACAGGCA 72 0 

AATTATGTAA AAGAAATGGA ACCAGATGCA CCAGTGGATA AATCGGAAAT GAAAAGTGAA 7 80 

GATGCTTTAT ATGaTGAAGC GTATTTGTTT GTTGTTGaAC AAC AAAAGG C aAGTACATCA 8 40 

T T G T T AC AAC GCCAATTTaG AATTGGtTAT AATAGAGCAT CTAGGTTGAT GGATGATTTA 900 

GAACGCAATC AGGTAATCGG TCCACAAAAA GGAAGCAAGC CTAGACAAGT TTTAATAGAT 9 60 

CTTAATAATG ACGAGGTGTA AAAAAATGTC AGAAATGAAT GCGGTATATA ACGTTAAACA 1020 

ATaCATTTtA AATTTgATTA AGCAAAATAA ATTGGAATAT GGTGACCAAC TTCCAAGTAA 1080 

TTTATCAATT GCCAGAGAAT TAAATGTAAA AACCGACGAT GTTTATGAAG CAATTCAGcA 114 0 

TTGATTACTG AACAAGTCAT T AAAGAT A t T TTGAAGAGGG CACAAGTGTT AAGTCACTGC 12 00 

CCCCTTTCTT TTATCCATTG AATGAACTTA TAAGTATTGG GCAAATGATT AAAAATGCAG 12 60 

GrtTTGAATG CGGAACT 1277 
(2) INFORMATION FOR SEQ ID NO: 257: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

TACCGTCGTT TTATGyGTCA AATTTTACAG TAAATTTTGC TTCATCAAAA GAAATAACCT 60 

T T AACAAGT A TAGTAATTTT ACATTTTACA ATGTTACAAA ATATAATTTT TTATAATTAG 120 

TTAAAATCAC TAAAACGCTT TTATACACTA TCAAATCAGC ATTTATAAAA ATATGAACCG 180 

ATATCCTAAA ATGTTAATAA TATTACAAGA TAATAACArA CCACACAAAG CTACTTATTT 24 0 

T T GAT AA.T AT GGAAATCGTA ATATAAAACn AAAACTTAAT TT ACT AT AT A AATTGTCTTA 300 

ATAATTTTTA AAAGTAGTAA AACATAATTT TAAGGAGGAG TCCCTTTGAA AAAATTAGCA 3 60 

TTTGCAATAA CAGCAACATC TGGTGCSGCT GCATTTTTAA CGCATCATGA TGCACAAGCT 4 20 

TCTACACAAC ATACAGTACA ATCTGGTGAA TCATTATGGA GTATTGCTCA AAAATACAAC 4 80 

ACTTCAGTAG AGAGTATTAA ACAAAATAAC CAATTAGATA ACAACTTGGT ATTCCCTGGT 54 0 

CAAGTTATCT CAGTAGGTGG AAGTGATGCA CAAAATACGT CAAACACTTC TCACAAGCT 600 

GGTTCAGCAT CATCTCATAC TGTACAAGCT GGTGAATCAT TAAATATCAT TGCTAGCAGA 6 60 

TATGGTGTTT CAGTTGATCA ATTAATGGCA GCCAATAACT TACGTGGTTA TTTAATTATG 7 20 

CCTAACCAAA CATTACAAAT TCCTAATGGT GGATCAGGTG GTACAACACC AACAGCTACA 7 80 

ACAGGTAGCA ATGGCAATGC ATCATCTTTT AATCACCAAA ATTTATACAC TGCTGGTCAA 840 

TGTACATGGT ACGTATTTGA CCGTCGTGCT CAAGCTGGTA GTCCAATTAG CACATATTGG 900 

TCAGACGCTA AGTATTGGGC TGGTAACGCA GCTAATGATG GTTACCAAGT AAACAACACA 9 60 

CCATCAGTTG GTTCAATTAT GCAAAGCACA CCTtSTCCAT ATGGTCATGT TGCTTATGTT 1020 

GAACGTGTCA ATGGTGATGG TAGTATCTTG ATTTCTGAAA TGAATTACAC ATATGGTCCA 1080 

TACAATATGA ACTACCGTAC AATTCCAGCT TCAGAAGTTT CTAGCTATGC ATTCATCCAT 114 0 

TAATTAAATA AATTGTACTG ATATATACTA GCAATTCACA TCATGTGAGA TTGCTAGTTT 1200 

TTTATTTTTG AAAAAAATTT TCATTTTGGT ACAAAAAATT ATCTCACCCT TCCCTATCAT 12 60 

ACATATTTAT ATTTTGTATG AATGGTAGTT AGGTAAAAAT TAACAACCTA CCTATTTGAT 1320 

TAAATTAAAT GCATAATATA TTAAAAATCA AAATAATGCA ACTAAGTTGA CTATATAATG 1380 

ATTTAATTTG TTATACCAGT ATTTTACGCT TTTTCGTCTA CATATACAAA TTTATATTAA 14 4 0 

ATAAAGCCCA ATACAATTTA GGTTAATTAA ACAAGTTGAT AACTATTTAA TTATTCCTTC 1500 

ATTGAAGAAT ATAAACTATT AAATCATTAT TTTGCTCTTA CATATATTTT AATGACCTAA 15 60 
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CTGaTTATGT TCCATGGAAT ACATTTATAA TATAGCCTCC AATTAArAT GCyTTGTCTT 1620 

GGTCATTCTA CGTAAATTCT ATAAAATATG TTATCTACTT ACATAArArn CTGrACTTCA 1680 

ATACCACCAT ATGTTTGTGA TACTGAAGTT CAGTTTAGTT TTATTTTCAA TTAGAAAAAT 17 4 0 

AAGTTAAGTA TATAGAATAG TAAACCTGCT AACAATGCTG AAATAGGTAA TGTAATCACC 18CD 

CATGTAATGA TCATTCGTTG CGCAgTGCTC CATTTTACAC CTTTAGCTCG GTTAGAAGCA 18 60 

CCAACACCTA AGATTGATGA T G AC AC AAC G TGAGTTGTTG ATAATGGGAA ATGTAGCGAT 1920 

GATGCAACAA AAATTGTTAA TGCAGATGAT AAATCGGCCG CAGCACCATT TGCTGGACGT 198 0 

ATTTTCATAA TATTACCACC TAO\GTTTTG ATAATTTTCC AGCCACCAAT TGCAGTACCA 2 04 0 

AGCCCCATTG cTGTCGCACA GGCAAATTTT ACCCATAACT GTGGTTCAAC ACTGCCATCA 2100 

TTCTGTACAT TAGCGACAAT CAATGCCAAC GTAATAATAC CCATTGATTT TTGCGCATCA 2160 

TTCGTACCGT GAG AG AAT G A TTGTAACGCT GCTGTGAAAA TTTGGAAAA TCTAAAGTTA 2220 

CGATTCGCTC TTGTTAAATT TGCATTTTTA AAGATAACTT TAAAAATTGA AT AC AT C AAG 22 8 0 

AAACCAACAC AAAATGCGAT AATCGGTGAA ACGATTAATA CAATAATAAT TTTTGTGAAA 234 0 

CCTTGGTAAT GTAACACTCC AAATGAGCCT TCAGATGCGA TTGCTGCACC CGCAATTGAA 24 00 

CCTATAAGTG C AT G T G AAG A CGAACTTGGA ATTCCGTAAA ACCAAGTAGC TAAATTCCAA 2 4 60 

ATAATAGCCG CAAGTATTGC AGCTAACACA ACAACTAATC CATTTTCCAA TTTAAATGGA 2520 

TCGACAATGT C T T TAG T AAT GGTGCCTGCA ACGCCCGTAA ATGTTAAAGC ACCTATAAAG 2580 

TTCATCACTG CTGCCATTAA AATTGCCGTT TTAGGgTTAA CGCTCTAGTA GATACAGCAG 2 64 0 

TAGCTACTGC ATTGGcTGTA TCATGGAaTC CcATTGATAA AGTCAAATAT CAGCGAGAAA 2700 

AT AAC T AC AG C TAT AG T G AC GATGATTATA TATGACATAA AT AT AT AC T C CCCTTAGCTA 2 7 60 

TTTTTCATAA TAATAGTTTC AAAATTATTT GCTACGATTT GACATTTATC AGCGATTCT 2 82 0 

TCCATGCTTT CATAAATATC TTTTATTTTA ATTAAAGTGA TTGGATCTGT TTCGCTATTG 28 80 

AAAATATGTT T AAT T G AC T G TCTTAAAATA C CAT C AC AG T TTGTTTCAAA TTCTTTAATA 2940 

TTAATTGAAT GAATACGCAT ATGTGATAAT TTTTTATCGA CTAATAAGCC GACAGCAAGT 3000 

TTCATTTCTG GAACTGCTTT TTGAATGTTA TCAACAAACT CAGCCATATA TTCATCTGTG 30 60 

TATTCGATTG AAT AC AT T T C AAACATrGCT GCCGTTTCTT CAATTGCATC TAAAACATCA 3120 

TCAATTGCAT CACATAATGA TAAAATATCT TCACGTTCAA TTGGTGTGAT AAATGTTTGA 318 0 

TTTAAATCAG TAATTACTTG ATGTACTAAt tCGcACCAG TGACTCATAA GTTTTAATGT 32 4 0 

TGTCTGAGTA TGCTTTTAAA TCTAAATGTG TATTGAAATC CATTTTACCG AATTCAATAG 3300 

CAGCACGATC C AG AT T G AAA ACCATCTCTT CTAATTGAAC CATAAACTTA TCTTTTTTCT 33 60 
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TACTAAACAT TTAAAATCCT CCATTTAAGC GATTGTCACC AATCACATTC AGTTATAATT 



3420 



TGTTTCAAAT TAAGACAAGT GAATTTACAA ACTAATGATA CAAATTTGTT ATTATCAATC 



3480 



GTCAGTATAA TTTTAGTGTA C T GAT AT T AA TTTCAAAAAT GCCTCACAGT AAACAATTTA 



3540 



CTGTATTTGC CCTTATA 



3557 



(2) INFORMATION FOR ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

AACTATACAT TTCGAAAAAT TCTTCTAGTG AACCTGCGCC ACCAGGAGCC ATGACAAATG 60 

CATCTGCAAG TTCTGCCATT TTATTTTnAC GTTCATGCAT AGAATCAACT AAAATTAATT 120 

CAGTTAAACG TTGGCTTGTG ATTTCATGTT CATCTAACAT TTTAGGCATG ACGCCAATAG 18 0 

CTTTGCCGCC ATGATCTAAT ACACCATCTT GaATGGCACCCATAATGCCA ATTGACCCTG 24 0 

C AC C AAAT AC TAATTCATAA CCTTGTTCAG CAAAATATTT ACCTAAATCG TATGCTTTTT 300 

GTACATATGA AGGGTCATGA CCTTTGCTTG CACCACAATA AACTGCGATT CGTTTCATGT 3 60 

TAATCCAGCT CCTTAATTCG ATGAATGACT TTTAATAGTG ATTGTTCAAA CACTTTTTGA 240 

TCTTGCTTTG TAAAAGGTGG GGGACCTTTG TGGCGACCAC CTTGTTTTCT AATTTGTGCA 4 80 

TTCATATATC GTTTATCTAA TAGTTGTTGA ATATTTTTGG AATTGTATAT CTTCCCATTA 54 0 

TGATGCATGA CAATTAAGAC TTTGTCGACT AATAAACTTG CGAGTCCATA ATCTTGAGTG 60 0 

ACTACGATAT CATCCTTCGT TCATAATTGA ACAATTTTGT AATCAACTGC ATCTGGTCCA 660 

TCATCAACAT ATAATGTTGA TACATGTGGA GGATATAATT GGTTCGAAAA ATGGCTGAAG 720 

CTCCGAATAA TTGTCACAAA AATGCCTGTC TCAGTTGTTA AATCTATAAT AGAATCAACA 7 80 

AC AG G A C AAG CATCTCCATC AATAATAATA TGTGTCACAA TTATGCCTT GTATTGTTTT 84 0 

CTTTATTTTG TTGAGAGGCG CTTTTGGCAA CATAATCTTT ATATTTTTTA AATGACTTGA 900 

TGCGTGCTTT ATCAGCTTCT TGTTGGCGTT TTTGTTCTTC TTTGTGTCGT TTTTCAATAT 960 

TTTTTTGTAA CTTTTTATTC ATTTTAGCGA TTTCTTTGCG ATTTTTTTCA GCTAGTTTAT 1020 

CGCCTTTTTT CTCAGTTTTC TCATCTAATT TATTAGGTGT TAAGCCTGCT TTTTCTTCGT 1080 

ATTTTTGTGA TTTTTTCATA TCTTTAATAC GTTGTATTTC ATTCTTTTCG CGGGCTTTTT 1140 

GCTCTTCTTT ATGACGCTTT TCGATATTTT TTTGAAGTAT TTTATTCATT TTATCAGCGT 1200 

CTTTACGATT TTGTTTAGCT AATTTTTCGC CTTTTTTCTC AATATAGGCA GGATCATGTT 12 60 
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CTCTAGCAAA CTTTTTAAGT TCACGTTTAT TTTCAAAATC TTGTTTTTTA TCGCCGACAT 



1320 



ATTCTTTAAC ATCACTCGCT GTGTTACTGA TTGCTGCAGA TGTTTTTGAA GCAACTTTAC 



1380 



TTGTAGCATC TGTAACTTTT TGTACGTCCG GATGTTGTTT GATACGTTTA CGTTKACAA 



1440 



TTAACGGTAC CAATACAATT GGTAATACAT T AAT CAT AAA TTTGATGACT TTTTTCTTAT 



1500 



C CAT AG AT C T TGcCTCCaTA ATTACTTTAT TAAtTTTACa TACCCtATGa TACATCAATA 



1560 



TAAACGATGA TAGTAGTGAA TCACTATTAA GTATTTCAGA TGTTTtTTAA AAgaAGaCCC 



1620 



AAT T An AAAA A 



1631 



(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6645 base- pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

C G AAAT CAT G ATTTAATGCT TTTTCATATA AGCTTTTCCA ATTAATCTTT CGTCCATGAT 60 

ATTCTTCAAC TGTTGCTAGA TATTGTGCAA TTTTAGTTAC TTTAAAGGAG TGTGCTGCAA 120 

CaTTGTGkTC niAAATATTTA AATTTTCCaG G&ATCTTAT AAGTCTTTCC aTATCTGATA 180 

ATCTtTTAAA ATATTGATGT ACACCCATTT CAATTACCTC CTCCATTAAT TAATCATAAA 24 0 

TTATACTTTC TTTTTACATA TCAATCAATT AAATATCATT TaAATATCTT CTTTaTATAA 300 

CTCTGATTAA ATGATACCAA AAAATCCtCT CAACCTGTTA CTTAAACAGG CTAAGAGET 360 

AGTCTTGTCT TGATATATTA CTTAGTGGAT GT AAT TAT AT TTTCCTGGAT TTAAAATTGT 4 20 

TCTTGAAGAT TTAACATTAA AT CCAGC ATA GTTCATTTCA GAAACAGTAA TTGTTCCATT 480 

AGGGTTTACA GATTCAACAA CACCAACATG TCCATATGGA CCAGCAGCTG TTTGGAAAAT 54 0 

AGCGCCAACT TCTG3TGTTT TATCTACTTT AAATCCTGCA ACTTTTGCTG CGTAATTCCA 600 

GTTATTTGCA TTGCCCCATA AACTTCCTAT ACTTCTACCT AATTGTGCAC GACGATCGAA 6 60 

AGCATAATAT GTGCAGTTTC CATAAGCATA TAAGTTTCCT CTGTTAGCAA CTGATTTATT 720 

GTAGTTATGT GCAaCAGGtA CAGTTGGTAC TGATTTTTGT ACTTGAGCAG GTTTGTATGC 7 80 

TACATTAACT GTCTTAGTTA CTGCTTGCTT AGGTGCTTGC TTAACTACTA CTTTTTTAGA 8 40 

TGCTTGTTGT ACAGGTTGTT TTACTACCTT TTTAGCTTGG CTTGCTTTTC TTACTGGTGA 900 

TTTAACCGCT TTAGTTTGTT TCACTTTATT TTGAGGCACA AGTGAAATCA CGTCACCAGG 9 60 

AAAAATTAAA GGTGTTACAC CAGGATTGTA TGAATATAAT TGATTCAACG TTAAGTGATG 1020 
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CTCTAAAGCA ATCTTATATA ATGAATCGCC AGCAACTACT GTATAAGTTG TCGGTGATTG 108 0 

CGTTTGTGCT TGAACATTTG ATACATAATT ATGTTGAACA GGTGTTTTTA CTTGTGTGCC 114 0 

ATGTTGTTGT GCATGTGCTG GYTTATTTAA AGCTAAAAAA GCTAACACTG ACGAAACCGT 1200 

CACTGTAAGA GATTTTTTCA TCTTGCTGTC ATTCCTTTGC TGTTAGTATT TTAAGTATGC 12 60 

AAATACTATA GCACAATACA TTTTGTCAAA AGCTATTGTT ATAACGATGT AAT CAAATGG 132 0 

TTAACAATAT AAAAAGAATA CAACCTTTTA TCATAGTGTA AAATGTffiTC ATACCATGTA 1380 

ATTGAGAACG TTTTCAATAA TTAATTCAAT ACCTTGAAAA TCGCCATAGG TAATATTACT 14 4 0 

AAATGCACAC TGCATATGTT GTTTTAACAA AC AC AAC T T T TAAAAAATAT ATTCTAACTC 1500 

TATCTACCGA ATTGTACTTA AATAT TCATA AACAAATCAT ATTCCAAAAT CTAATTTACA 15 60 

ATTTATTTAG CTACCTTTAA AAAAACCAAA AACCGACGCC CTTTTAGAGC CTCGGTTTTA 162 0 

AAATATATAT TAATCGTGCG ACATTGTCTG TCTTAAATAT GATTCGATAA ATGGTCCAAT 168 0 

GTCTCCATCC ATCACTGCAT CAACCTTACC TGTTTCTTCG TTCGTACGAT GATCTTTCAC 174 0 

CATTGAGTAT GGATGGAAAA CAT AT GAT CT AATTTGGCTT CCCCAGCCGA TTTCTTTTTG 1800 

TTCGCCACGA ATTTCAGCCA TTTCACGTGC CTGCTCTTCC AATTTTAATT GATATAATTT 1860 

AGACTTTAAC ATTTTCATAG CTGCTTCACG GTTTTTAATT TGAGAACGTT CATTTTGGTT 1920 

ATTAACAACT ATACCTGAGG GGTGGTGGGT AATTCGTATT GCCGATTCAG TTTOTTAAT 1980 

ATGCTGACCA CCTGCACCAG AAGCTCTGAA TGTATCAACT GTAATATCAT CCGGATTGAT 204 0 

TTCAATCTCT AT T T CAT CAT TATTAAAATC TGGAATAACG TCGCATGATG CAAATGATGT 2100 

ATGACGACGT CCTGATGAAT CAAATGGAGA AATTCGTACT AGTCGGTGTA CACCTTTTTC 2160 

AGCTTTTAAA TAACCATAAG CATTATGCCC TTTGATGAGC AATGTTACAC TTTTAATCCC 2220 

CGCTTCATCC CCAGGTAGAT AAT C AAC AG T TTCAACTTTA AAGCCTTTCT T C T C AC AAT A 22 8 0 

ACGTTGATAC ATTCTAAATA GCATATTAGC CCAATCTTGA GACTCCGTGC CACCTGCACC 234 0 

AGGATGTAAC TCTAGAATTG CGTTATTGGC ATCGT&GGC CCATCTAATA ATAATTGCAA 2 4 00 

TTCGTATTCA TCCACTTTAG CCTTAAAATT AATGACCTCT TGCTCTAAGT CTTCTTTCAT 24 60 

TTCTTCATCA AATTCTTCTT GTAATAAATC CCAAGTAGCA TCCATGTCAT CTACTTCTGC 2520 

TTGTAGTGTT T T AT AACCAT TAACTATTGC TTTTAACGCA TTATTTTTAT CTATAATATC 2580 

TTGCGCTTTC GTTTGGTTAT CCCAAAAATT AGGTTCTGCC ATCATTTCTT CATATTCTTG 2 64 0 

AATATTAGTT TCTTTGTTCT CTAAGTCAAA GAGACCCCCT AATTTGTGTT AAATCTTGAT 27 00 

TATACTTATC TATATTTCGT TTGATTTCTG ATAATTCCAT AGcATTCGCT CCT ATT TATA 27 60 

TTTCAATTCA AGTCATTGAT TTGCATCTTT TATAATGCTA AAT T T T AAC A TAATTTTGTT 2820 

AAATAACAAT GTTAAGAAAT ATAAGCACAC TGACAAT TAG TTTATGCATT TATTGTTAAA 2880 
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AAT C AG T AC A TTTATCATCG ACATATGCCT AAACCGATTT TTTAAAACTA AG T AC AT AAC 2 94 0 

AACGTTTAAC AACTTCTTCA CATTTTTTAA AG T AT T T AAC GCTGTAAAA T AAAAAG AC T 3000 

CCTCCCATAA CACAAACTAT AGGTGTTTAA TTGGAAGGAG TTATTTTATA TCATTTATTT 3060 

TCCATGGCAA TTTTTGAATT TTTTACCACT ACCACATGGA CAATCATCGT TACGACCAAC 3120 

TTGATCGCCT TTAACGATTG GTTTCGGTTT CACTTTTTCT TTACCATCTT CAGCTGAAAC 3180 

GTGcTTCGCT TCACCAAACT CTGTTGTTTT TTCACGTTCA ATATTATCTT CAACTTGTAC 32 4 0 

TACAGATTTT AAAATGAATT TACAAGTATC TTCTTCAATA TTTTGCATCA TGATATCAAA 3300 

TAATTCATGA CCTTCATTTT GATAGTCACG TAATGGATTT TGTTGTGCAT AAGAACGTAA 3360 

GTGAATACCT T G AC G T AAT T GATCGYTTGT GTCGATATGA TCAGTCCAAT GGCTATCAAT 34 20 

AGAACGAaGT AAAATCATAC GCTCAAACTC ATTCATTTGT TCTTCTAAGA TATCTTTTTG 3480 

ACTTTGATAT GCTGCTTCAA TCTTAGCCCA AACGACTTCG AAAATATCTT CAGCATCTTT 354 0 

ACCTTTGATA TCATCCTCTG TAATGTCACC TTCTTGTAAG AAGAT G T C ATT AAT G TAG T C 3600 

GATGAATGGT TGATATTCAG GCTCGTCATC TGCTGTATTA AT AT AG T AAT TGATACTACG 3660 

TTGTAACGTT GAACGTAGCA TTGCATCTAC AACTTGAGAG CTGTCTTCTT CAT C AAT AAT 3720 

ACTATTTCTT TCGTTATAGA TAATTTCACG TTGTTTACGT AATACTTCAT CGTATTCTAA 37 8 0 

GATACGTTTA CGCGCGTCGA AGTTATTACC TTCTACACGT TTTTGTGCTG ATTCTACAGC 38 4 0 

TCTTGATACC ATTTTTGATT CAATTGGTGT AGAGTCATCT AAACCTAGTC GGCTCATCAT 3900 

TTTCTGTAAA CGTTCAGAAC CAAAACGAAT CATTAATTCA TCTTGTAATG ATAAATAGAA 3960 

GCGACTATCC CCTTTATCAC CTTGACGTCC AGvACGACCA CGTAACTGGT CATCAATACG 4 020 

ACGAGATTCA TGTCGCTCTG TACCTATTAC TGCTAAACCG CCTAATTCCT CTACGCCTTC 4080 

ACCTAATTTG ATATCTGTAC CACGACCAGC CATGTTAGTG GCAATAGTAA CGGCACCTTT 414 0 

TTGTCCAGCG CCTGCAACAA TTTCAGCTTC ACGTTCATGA TTTTTCGCAT TTAACACEC 4 200 

ATGACGGATA CCACGTTTTT TAAGTAAATT TGAAATATAT TCAGAAGTCT CAACTGCAAC 4 2 60 

AGTACCTAAT AGCACTGGTT GCCCTGCCTT GTGTTTTTCA ACAACATCTT CTACTACTGC 4 320 

ATCAAATTTA CCTTTTTGGC TAATGTAAAT T AAAT C AG AC TTATCGTTAC GTTGCACAGG 4 38 0 

TTTATTTGTC GGAATTTGAG TTACTGTCAT GTTATAAATA TTTCTAAATT CTTCTTCTTC 4 4 40 

AGTTTTAGCT GTACCTGTCA TACCCGCAAG TTTATTGTAC ATTCTGAAAT AGTTTTGGAA 4 500 

TGTAATAGAC GCCATAGTTT TAGATTCATT TTGAATTTGA ACGCCTTCCT TCGCTTCAAT 4 5 60 

AGCTTGGTGT AAACCTTCCG AGAAACGACG GCCTGGCATT GTACGTCCTG T AAAT T GAT C 4 620 

GACAATTAAT ACTTCGCCAT CAACAACCAT ATAGTCTACG TCACGTTGTA ATGTAACGTG 4 68 0 
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CGCACGTAAA GCTGTGTTGA TATGACTAAT AACATCAACA TTTTGTACAT CATATAAGTT 4740 

TTCAACTTTG AACATACGTT CAGCTTTATC CGCACCTTGT TCTGTTAAAT GTACAGCTTT 8100 

CGTTTTTTCA TCGTATTTAT AATCTTCGTC CTGTTTTAAC ATTTTCGCAA AAACATTTGC 4 8 60 

TTGTGTATAA AGTGACGTTG ACTTTTCAGC TTCACCAGAA ATAATTAATG GCGTACGTGC 4 920 

CTCGTCGATT AAAATTGAGT CAACCTCATC AATGATTGCA AAATGTAATG GACGCATTAC 4 980 

TCTATCTTCA GAATAATTCA CCATGTTATC TCGTAAGTAA TCAAAACCTA GCTCATTATT 504 0 

AGTACTGTAA GTAATGTCTT GTGCGTATGC TTCACGTTTT TCTTCTGTCG TCTTACTGTT 5100 

TAAGTTTAAT CCGACAGTCA AACCTAAGAA GTTATATAAC TCAGCCATTT CTTCACTTTG 5160 

AACACTTGAT AAGTATTCAT TGACTGTAAT AACGTGAACA CCTCTACAG CTAATGCATT 5220 

TAAGTATGTT GGCATTGTCG CTGTTAATGT TTTACCTTCA CCTGTTCTCA TCTCAGCGAT 52 8 0 

ATCACCTTTA TGAATTGCAA TACCACCCAT AATTTGAACT TTATATGGTG TCATATTGAA 534 0 

TACACGTTTA GAGCCTTCTC TAACAAGTGC ATATGCTTCT GGTAAAATTT TATCTAAATA 54 00 

ATCATTTTGC TTTTTGACAT TATCAATGTC AGCTAATTCT GTTTGGAATT GTTTCGTTTT 54 60 

ATTACGAATT TCTTCATCAG TTAAAATTGC CGTTTTTTCT TCTAAAGCGA TTACTTTATC 5520 

AGCAAGTTTA CCTAACTGTT TAATTTCTTT ATTATTGCCA TCAAGAATTT TTGATAAAAA 5580 

TCCCATTTCG TTCGCTCCTT TAGCTAAAffi. ACTGTTTGGC CTACAACAAT ATATCTTATC 5 64 0 

ATTTATAGTT AGAAAATTAT AC T T AT T T AC TCATTTGTAG AATCAATATA AATATATTTA 5700 

TGACATACTT CATTCACATT CTGTTGTCAA CAAGTTTATC ACTAATAAAT ATATTCTCAA 57 60 

TACGCAATTA TACTTCCTAA TAAATTATAT TATAAATATT TTACGATTTT CGATCGGAC 5820 

TATACAATAG ACTGACATAC TATTATTAAC TTAACATTCA AATATATACA TCCATTAACA 5880 

TTAGCATAGT CACTATGTTT CATTCAACAA ATTACATTAT CGAACTATGA AATAGTCATA 5 94 0 

ATTTGCTTTT GGAGTATAAA AAAGCACTTG TGCAAAAACA CAAGTGCTTT AAACTTAATT 6000 

TATTGTTCAC TAGTTTGAAT CAAGCCATAT TTACCGTCTT TACGGCGGTA AAC GAT AC T T 60 60 

GTTCCATCAG TTTCTCTGTC TGTGAATACA AAG AAG T CAT GACCTAATAG ATTCATTTGT 612 0 

AATACCGCTT CTTCTGAATC CATTGGTTTT AAGCTGAATT CTTTTGAACG AATAATTTCT 6180 

ATCTCGTTAT CATCGTAAGC GTCATTATCA ACTTGT5TTT CTTGCATTTC TTGTAATTCG 62 4 0 

GCAACAAACA CTTCTTGATC TCCTCGATCA CGGCTCTTAC GAT TAATACG TGTTTTATAT 6300 

TTTCGAACTT GTCTTTCAAG TTTATTATTA ATTAAATcAA TACCTGCGTA TAAATCATCG 6360 

TTTCGCTCTT CAGCTCTTAA CGTAACATTT TTCAATGGAA TTGTTACTTC AATTTTAGTA 64 20 

GCTGAATTTG AATAAGTTTT AACTTTAACA TGCGCCACTG CATTTGGTAC GTCATTAAAA 6480 

TAACGTTCCA ACTTACCAAT TTTTTCCkCA ATATAGTTGC GAATAGCATC TGTGATAGTG 654 0 
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AGGTTATCTC CATGAATTTC AAATCTAATC ATAGTAAATC TCTCCTTAAA CCTCTTTATn 6600 

GGnAACTCnT TATTATATTT AACATTTTTA CGCCAATCGT GCAAA 664 5 
(2) INFORMATION FOR SEQ ID NO: 2 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7430 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

CAGTTCCAGC ACAtCTATTG GGGATCAACA AACTAGGGAA AATGCTAATT ATCAACGTGA 60 

AAACGGTGTT GACGAACAGC AAC AT AC T G A AAATTTAACT AAGAACTTGC ATAATGATAA 120 

AACAATATCA GAAGAAAATC ATCGTAAAAC AGATGATTE AATAAAGATC AACTAAAGGA 180 

TGATAAAAAA TCATCGCTTA ATAATAAAAA TATTCAACGT GATACAACAA AAAATAACAA 240 

TGCTAATCCT AGCGATGTAA ATCAAGGGTT AGAACAGGCT ATTAATGATG GTAAACAAAG 3 00 

TAAAGTGGCG TCACAGCAAC AG T C AAAAG A GGCAGATAAT AGTCAAGATT CAAACGCTAA 360 

TAACAATCTA CCTTCACAAA GTCGAATAAA GGAAGCACCA TCATTAAATA AGTTAGATCA 4 20 

AACAAGTCAA CGAGAAATTG TTAATGAGAC AGAAATAGAG AAAGTACAAC CACAACAAAA 4 80 

TAATCAAGCG AATGATAAAA TTACTAACTA CAATTTTAAC AATGAACAAG AAGTGAAaCC 54 0 

TCAAAAAGAC GAAAAAACAC TATCAGTTTC AGATTTAAAA AAC AAT C AAA AATCACCAGT 600 

AGAACCAACA AAGGACAATG ACAAGAAAAA TGGATTAAAT TTATTAAAAA GTAGTGCAGT 660 

AGCAACGTTA CCAAACAAAG GGACAAAGGA ACTTACTGCA AAAGCGAAAG ATGATCAAAC 720 

GAATAAAGTT GCCAAACAAG GGCAGTATAA AAATCAGGAT CCTATGTTT TAGTGCATGG 78 0 

TTTCAATGGG TTTACAGATG ATATTAATCC TTCAGTGTTA GCTCATTATT GGGGCGGTAA 84 0 

TAAAATGAAC ATTCGCCAAG ATTTAGAAGA AAATGGTTAC AAAGCTTATG AAGCAAGTAT 900 

AAGTGCTTTT GGAAGTAACT ATGACCGCGC AGTTGAACTT TATTATTATA TCAAAGGCGG 9 60 

TCGTGTAGAT TATGGTGCAG CACATGCAGC AAAATATGGA CATGAACGTT AT G G AAAAAC 1020 

ATACGAaGGA AT T T AC AAAG ACTGGAAACC AGGACAGAAG GTACACCTAG TTGGACATAG 1080 

TATGGGCGGT CAAACGATAC G T C AAC TAG A AGAATTACTG CGTAATGGTA ATCGTGAAGA 114 0 

AATAGAGTAT CAAAAGAAAC ATGGTGGcSA AATTTCTCCA C T AT T C AAAG G T AAT CAT GA 1200 

CAATATGATT TCATCAATTA CTACTTTAGG AACACCACAT AATGGTACAC ACGCATCAGA 12 60 

TTTAGCTGGT AATGAAGCTT TAG T GAG AC A AATCGTATTT GATATCGGTA AAATGTTTGG 1320 
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TAATAAAAAT T C AAG AG TAG ACTTCGGGTT GGCTCAATGG GGTCTAAAAC AGAGCCAAA 138 0 

TGAATCATAT ATTGATTATG TCAAACGCGT TAAACAATCT AATTTATGGA AATCAAAAGA 14 40 

TAATGGATTT TACGATCTGA CGCGTGAGGG TGCAnCAGAT TTAAATCGTA AAACGTCGTT 1500 

GAACCCTAAC ATTGTGTATA AAACATACAC TGGTGAAGCA AC GC AC AAAG CATTAAATAG 15 60 

C GAT AG AC AA AAAGCAGACT TAAATATGTT TTTCCCATTT GTGATTACTG GTAACTTAAT 1620 

CGGTAAAGCT ACTGAAAAAG AATGGCGAGA AAACGATGGT TTAGTATCCG TTATTTCTTC 168 0 

TCAACATCCA TTTAATCAAG CTTATACAAA AGC GACAGAT AAAATTCAAA AAGGCATTTG 17 4 0 

GCAAGTGACG CCTACAAAAC ATGATTGGGA TCATG'TGAC TTTGTAGGAC AAG AC AG T T C 18 00 

TGATACAGTG C G C AC AAG AG AAG AAT T AC A AGATTTTTGG CATCATTTAG CAGACGATTT 18 60 

AG T G AAAAC T GAAAAGCTGA CTGATACTAA GCAAGCATAA T T TAT AAAG T AAAG G G AG G A 192 0 

ATTAATAATG ACTGCAGACT TCTTTCAATT AATCGGATCA TTATTTAGAA TTCTAAAAGA 198 0 

ATTATTCAAG TAAAACATTG GCGAGGCCCC AACATAAAGA ATTTCGAAAA GAAATTCTAC 204 0 

AAACAATGCA AGTTGGCGGG GCCCCAACAA AGAAGCTGGC GGAAAGTCAG C T T AC AAT AA 2100 

TGTGCAAGTT GGCGGGGCCC CAACATAGAA GCTGGCGGAA AGTCAGGTTA CAATAATGTG 2160 

CAAGTTGGGG TGGGACGAlCG AAATAAATTT TGCGAAAATA TCATTTCTGT CCCACTCCCA 2220 

TTGGCATTTA CGAAGTTTAA ATGTGCAATT AG AAT AT AT G TATAACAATA TTAAACACGC 22 8 0 

GGTAAAACGA AGTCAGTCAA TTCAAACTGA TTTCGCCCAC CGCGTGTTTT TAACATAGCT 2340 

T AAT AAT T AA TAAGCATTAA TGTTCAATTT ATATGGTTGT TTCCAATAA TAAACCTAAA 24 00 

GATATAGAAT TCACGCAATA TCATGCCGAC ACCTATACAT AATCCTAAAA T G AAT AG TAG 2 4 60 

TGATATCGCT AGAAAGACCA TTGTATTATC CTCAAATATA TTTGTATATG CAAACAATGA 2520 

GTCTAGAATG ATTGGATGTA ATAAATAAAT AAAG AAT GAG AAAGCACTAA TCATTTGAAT 25 80 

CGTATTAAAT AACATTGTTT TAAAATGCGT GCAAATACCC AAGATAACAA TAAACATAAT 2 64 0 

ACTATTATAT GGTGTTAATG AATATGAAAA GCTGGTAACG TTCCAATAGT CTCCaTTTGT 27 00 

AACGCAATAA ACACAAAATA AGTAGCTACA GCTAATACAA TCATAATAAC TAAATAACGT 27 60 

T C T AAG AAA T T T AAT AC AC G TTCGTAGTTA TAACCCATAT ATGCACCTAA GAAGAAATAA 2 82 0 

AAAATCCATC CGAATATTAT AGTATTTTCA CTTAATGGAT AATAGTGTAG CACGGTATCG 28 80 

TGAAACGCTG TGTTGTTCGT AAAGTAATAT AAAAATGATT GCTGTAAAAT AAAAGATAAC 2 94 0 

AATAATAATA TTTTACTGTT GAATAGGTTA TAG T T AAT T T TAAAAATGATATAACTCAAA 300 0 

ATAAAGAATT GCATGATAAC AACGATAAAA TAGCCATACC ATTGACCTAA TAGGACATTT 30 60 

TCAATGAATT GTTTATTGAA ACTTGAATCT GTTAATAATG ATTCACTATA ACTGTAAAAC 3120 

AATCCCATTA ATATGTAAGG AATAAGTATA TATTTTACGC GTGTAGTTAA G TAT C TAT AG 3180 



831 

Copied from 10806206 on 08/1.2/2004 



GTGACTTTTT GGTAATTCAA GGTTGTCAGT AACTGTGACA AGATAATAAA GCAAGGTGTA 32 4 0 

CCAAAAATCA CAATATTACG AATGTAAAAT TGTAACACTA AGGATCCACC CTCCATATTT 3300 

TCATGTTTTA AAGTAATTTG TGTAAGTAAA TGTGTGATAA TAATAATTGC ACATATAATA 3360 

GCACGTAAAT ATACGAGTTC AAGTCTAATC UTTTCATGG AATCCGTCCC ATCTCTTAAT 34 20 

TAAATGCTCA AAAGCATCAT CACTAATTAA TATTCTAGGG ATGTAATAAT CATTGGAGTT 3480 

CGGAGTGACT GCTTTTTCCT CTAATGAAAA ACCGTATTTT AACCCAGCTT TTTTGATTAC 354 0 

CGGTAATTTA TCGTCATTCA TCAAGCCATA AGGATAGGCT ATAGTTTTCT GCGACTYTT 3600 

AAAGTTTTTA GTTAGATATT TTTCACTTTT GTTTAAATCT TTTATGATTG TAGCTTCAGA 3660 

AGCTTTCATT AATTTTGACT TATTATTTTT AGATAAGTTA TGCAAATCGT GGGTATGTGT 37 2 0 

TTCAAATTCC CATAACCCAG TTTTATACAT TTCTTTTAGT TCTTTTTTAC TAATCATATC 37 8 0 

GAGGTTGTGA AAGTTTTCTT CCCCAACATG ACCTGTGATA ATAAACCCAG TTGCCGGTAT 3840 

TTTATATTTT TTTAAGATTG GATAAGCATT TTCATAAATA GTTTCATCCA TAT CAT C AAA 3900 

GTTAATCCAT ACACTTCGTT TTGGaAACTT ACCTTTTTTC TTGtAATATA AAAATTCTTT 3960 

CAAGGT T AAA AATTTAGCAT CATGTGATTT TAGCCATTT ATTTGAGATT CAAATTGTGA 4 020 

TTGACTAACA CTATAATTTT TAATTTCTTT ACTACTAGAA AAGAAGTAAA TAAAATTATT 4 08 0 

CAGAAAATTC GCTTTTCTTA CACGGTGATA ATTTAATGCC AGAGCACTAT TTTCTTTATA 414 0 

TTTCAGTTTT TTAGGTGAAT CGTCATCTGC ATTTGCAATA T G A T G AC CAT CCAGTGTGCT 4 200 

TACAGGCAAT ATGATCAAGA TACTCAACAC TAAAATTATA AATTTTCTAT ACTTCACGAT 4 260 

TCTCTTCCTC TCTGCCATTT TTGAATCAAT ATGCTAATTG TAAAAAATAC AAAAATGATA 4 32 0 

ATCGCGAAAA TGCCCATAGT TTCAAATATA TCTAAAATTT CAGTATTTTC AATGTTTAAA 4 38 0 

GCAACACGTA TTGTATTGAT ACTTTCGTCA TGAATTTCAA ATATAGTACC AATATAAACG 44 40 

AGTAGAACAA CTAAACAATA TATCCAAAAG ACACAAGATA TAGCGATAAG TGCTGTTTCT 4 500 

CTTACAATAT TTAGCGATGA TTTTAGCGTT GGGTATTCCC TCTGTCTGGG CTTGACCATG 4 5 60 

TTGCGTAACC ACCTTTCTTA CGTTTTAATG CTTTTGGAAA TGCGAEAGA ACTACTGCTG 4 620 

CGTTAATAAT CCAGTATACT GTCGGATACC AAC T T AC AAA TATGAGTCCA GCCATATTCT 4 68 0 

TTTTCTCGTA GCGACTATCA ATAAAGAGTG CGACTGTAAA TTGAATAACG TTTATAAAAG 4740 

TCATAGTAAA TGATGATAGT AGAAATATTG AAAAACTATA TGTCATAAAT GTATAGTCTA 4 800 

AGAAGTTTGC TGTTATGAAC AAATAGCCTA AATATAGAAG CACTATATAT ACCCATAAAA 4 8 60 

TCGAGATGAT TTGCTCAAAC ATCAAAATAT ATAAAGGAAA CCTTTTCGTT TTCATTGTGC 4 920 

TAAAAAAGTC TCGTAGTAAT ACTTCGTGTC CCCCTTGAGC CCATCTCACG CGTTGCTTCC 4 98 0 
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AAAGACCTCC CAATGTTTCT GGAACCAJCA TCCAACACAT GGCAAGCGGT TCATACTTAA 504 0 

TACGATATCC ACGTAAATGC AATTTCCAAG AAACTGCAAT ATCTTCGGTA ATCATATCAG 5100 

TATCCCAGTA GCCAACGTCG ACAACTGCAC TTTTTTTAAA TAGAGTGAAG AC AC C C G AAA 5160 

TAGTATTGAc TGCGCCAGCA AGTGTCTGAC TtCGCTTAAT ACAGCCAATT AAfiTTGCAT 52 20 

ATTCTATCGT TTGAATTTTA CCTAAAATAG AACTCTTATT TCGAATTCTA GGATTACCTG 528 0 

TAACTGCACC AAGTTTTGGA TCATGTTTGA AATTCTCAAT CATATAATAT GGTGCATCTT 5340 

GATCAACGAT AGTATCTGCA TCCAAGCACA TTACATAATC ATATGAAGCC TGTTTAATGC 5 4 00 

CTTGATTGAG TGCGTTGGCT TTACCTCTGT TTTCTTGTAA ATCGACGAAA ATAAAGTCAT 54 60 

TATTTTCTTT GATTTTATAG ATGAGTTCTG CTGTATTATC TGAACTTCCA TCATTAATGA 5520 

TAATAATTTC TTTCTTCTCG TATTTGAGTG CAAGAACATT AGACAACGTA TCTTCAAyCG 5580 

TTTCACTTTC GTTATAACAG GCAAGTAAAA ATGTATGCC TTCTAATTCA TCCACATTTA 5 64 0 

TGTCAGGCTT CTTGTTCAAT GAATATCTAA TTTCTCTGGT AAAATAGAAA TAAATTGAAC 5700 

CGACAATCCA GTAAATAGAC ATAAATACAG GATAAAAAAG CAAAAAGTTA AAAAATTGCA 57 60 

ATTTCTTTAC CTACCTTTCG TTAGTTAGGT TGTAAGCCAT ATGGTAATTG ATAGTATTTT 5820 

AATTTGCAAT AGATTGTTGT T AT AAT T AAA CGGAAATATT TGTAATTGCA ACTTAATTTT 588 0 

CCTGTAACAT AGTGTGATTA ATTTTCAGTA GGGGGTTATA AAAATTGAAG GATAAGATTA 5 94 0 

TTGATAACGC AATAACCTTA TTTTCAGAGA AGGGGTATGA CGGTACAACA CTTGATGATA 6000 

TAGCTAAAAG TGTAAATATA AAGAAAGCGA GTTTATATTA CCATTTTGAC TCGAAAAAAA 60 60 

GTATTTACGA ACAAAGTGTT AAATGTTGTT TTGATTACCT T AAT AAT ATT ATTATGATGA 6120 

ATCAAAATAA ATCGAACTAT TCAATTGATG CTTTATATCA ATTCTTATTT GAGTTTATTT 6180 

T C G AC AT C GA AGAAAGGTAT ATTAGAATGT ACGTTCAATT ACTAATACG CCTGAGGAAT 62 4 0 

TTTCTGGAAA TATTTACGGA CAAATACAAG ATTTAAATCA ATCATTAAGT AAAGAGATAG 6300 

CAAAATTCTA TGATGAATCA AAGATAAAGA TGACAAAAGA AGACTTTCAG AAT TT AAT AT 6360 

TGCTGTTTCT TGAAAGTTGG TATTTGAAAG CATCCTTTTC GCAAAAATTT GGAGCAGTGG 6420 

AAGAAAGTAA AAGTCAATTC AAAGATGAAG TGTATTCGCT ACTAAATATA TTTTTGAAGA 6480 

AATAATTTTT GTTACTAGTT TGTAATAATT AACTTACTTT TGTAAAAAAA G AC AT G AGAT 654 0 

TATTTTTTTA AATCTATATA AAGTTGACAA TACAAATCGA TATTGAGAAT ATTAAGATGT 6600 

AT AT G AAT T T TATAAATTAA ATGCAATACA TTAATATAAA TATCAATTGT TGCAAAATAC 6660 

GATTTGTTCA ATGATTTGAT AATATTATTC TTTATATTTG TGAATGGTTA AGTTTGTCTT 672 0 

TGAACATATT ATAAAAGTGT AATGTTCCCC TGAAAAGAAT AAGTTGTCAT CTAATTACAG 67 8 0 

GAAATCCGCA TAAATTAGAT GAAATGGAAA GTAATAAGTA ATAATTTATTGATAAGCGCC 68 4 0 
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TATGTGATGG T AAAT CAT G A CATAGGCGCT TTTTTTTATA AGT TAAAAAT GTAAATAAAA 6900 

ATTATATAAA TTACCCACAT CTTTTTAAAA GGTGTGGGCT TTATTATCAT TAACCCAACT 6 9 60 

CACAGTGACG GGTTACGCAA GGTATTGAAT TACCGAGTAC GGGCACGCTC GGTGTTGTAA 7 020 

AGAGCAAATA ATCAAGTAAT GATGATGCTT CTACTCGATT ATAAGAAAGC CATGATAGAG 7080 

TACGATGGTA TCTAGTTTTA TTATTAATAG GTTTGGATAT TTAAAGTTGG ACAATATTAT 7140 

ATCTTGTGCA AAAATATAAA TAAGTTATAC ATAATGGTAG AG AAT CAT G A TATAATTTTA 7200 

AACGATAAAA TATTTATATA AAT AAT TAG A (aAAATGTAG TTGTGTATGT yTTGTGGtCG 72 60 

TTAAACTAGA TATAATTGTC CGATTTATAA AACATACATA AT G AAT AC a A TGATTGATTA 7 32 0 

TGTGGAGGAA ACCATGAmAG AmAAGTTTgA TTTAGTAAAA CTATTAAATA TTCTAAAGAA 7 380 

GAATATTAAA TTATTGCTTA TTTTACCGGC AATATGTCTT GTAGTAAGTG 7 4 30 
(2) INFORMATION FOR SEQ ID NO: 2 61: 

(i) -SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 261: 

ATTGTTACTC ATTATTTTTT CATATTCACA CAAATGATCT TGTTTATATT TAGCTAATTG 60 

ATTTTTATCT AGCATTTTAT CCTCCTGCTG AGTTTGTAAC CTTTAATAAT TTATTTTCTA 120 

TAAAAACTTA GTATTCCAGT TGCTTATTAT ATCATTGATG AAAGGCTGAA ATAAAACATA 180 

AACTGTTCCG ACCATTAACG CTGTAGCTAA AGATAAGTCT ACAAGTCCAC CTGTTTTAAA 24 0 

TTGAATCGGT GTCTTCACAT TAAACGGTAA gGAknAAAAT AATTTCACGC CTTTTGGTGT 300 

TATCATATCT AG T AT AAC AT GTGACACTAA TCCAATTATA ATTGCTGCCA TATAATAATT 3 60 

AGGCGTCTGA ATAATTTGCA ATAAAAATGC TATfATTGCG AT AAAT AAT A TTGAATGCGT 420 

AAAGGTTCGA TGTCCAAAGA T C AAT C T C AC AAAAAAACTA AT T AC C T T AA ACCTTCTGCC 4 80 

AATCTTACTT TGAGTGTGAC ATATATCGGG TAATAAGCTA GCTAGAGTTG CTAGAATGAT 54 0 

AACCGTAACC GACGAAAAAA TATCCGTTTG AAAATATTGT GTTGTTAGCG CTCCAACGAG 600 

CATGCCGCAT GAAGCATGTG TTTTACCTGT CATATTTGTT CTCCTTTAAT AC T C AC AT T T 6 60 

TACCACATCC CTAACAAAAA CACGAACATA TTTTCGGGTT AAAATTCATT AGTATGACAC 720 

AATTTAAAAA AGTATCACAT AACTCTTGAA AACGATTACA AAATCGTTTA TGATGTATTT 780 

ACAAAATATT TAAAGGATGT GTTTGAATAA TGGCAATGAC AGTAAAAAAG GATAATAATG 840 
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AAGTGCGTAT TCAATGGAGA GTTGCTGATA TCAAAATTCC TACAAGTGAA AT T AAAAAT A 900 

TTACACAAGA CCAAGATATT CATGCAGTTC C T AAAT TAG A CAGCAAAGAT GTATCTAGAA 9 60 

TCGGCTCAAC GTTTGGTAAA ACGAATCGCG TTATTATCGA HCTGAAGAC CACGAATACA 102 0 

T TAT T TAT AC T CAAAAT GAT CAAAAGGTTT ACAATGAATT AAC T AAATAA ATTGTATAAA 1080 

AaAATCATTC ATGGTGAGGG CTTCATGAAT GATTTTTTTA AT T GAT T C AA CACCCAGCAT 114 0 

AAACAAATAC AAAAGGACAA CTGTTCCCAT AATTTTAACA GTTGTCCTTT TTCACATATA 120D 

TTTATAACAA AAGATGTGCC AT C AAAG AAA TAATTGGTAG TGTAATGATT GTTCTAATCA 12 60 

AGAAAATCAT AAACAATTTG CCGATGCTTA CAGGAATCTT CGAACCAAGT ATGACGCCAC 1320 

CTACTTCAGA CAAGTATATT AACTGCGATA TACTAAGTGC CCCAATAACA AAACGAGTTA 138 0 

TATCATTTTG TACACCTTCA ATTAATATAG AAGGTAAAAA CATATCGGCA AAACCGATAA 14 4 0 

TAATCGTTTG AGAAGCCTGT GCCGCTTCAG GTATTTGCAT T AAC T C T AAA AATGGAACAA 1500 

AAGGTTTACC CAATATGACA AAAAAGGGCG TGTAGTTCGC AATAATGGTA GCAATAGTAC 15 60 

CAATACTCAT TACTACAGGC AAAATAACAA ACCACATATC AATGACTGT TTTAATCCTG 162 0 

ACTTAAAAAA GTCAATAACG CCCGGTGCTT TAATACCTAC TTCTGTTGCA GTATCAAAGC 168 0 

CATGTCTCAA TGCCGTCTTT CCTTCTGGCA ATGCCTCAGT ACGCGCACTT TCAGGTACCT 17 4 0 

CCTTAGCATA CTCATCAGGA ATTTTATTTA AAGGCCAAAT TCTTGGCATA ATGACTGCTG 1800 

CAACGAGGCA GGATACTATC AC T GAT AAAT AGAAAGCAAA AAATTGATTT TGCATGTGCA 18 60 

CTGTTTCAGC AACTACAATT GCAAAGGTGA T AG AAAC T AC ACTAAATGTC GTTGAAATAA 192 0 

CTGTTGCCTC AC G AC G AG AA TAATATCCTT CACCATATTG TCTACTTGTA ATTAAGACAC 198 0 

CAACAGTTCC GTCTCCAATA AATGATGCTA AATTATCTAC CGTCGAACGT CCTGGCAATG 20 4 0 

TAAATAAAGG TCTCATAACC GGTCTAAAAA TAGGACCCAA CATCTCTAAC AAACCGTATT 2100 

CCaTTaATAG CGGTAAAAAT AGAGCTGCaA ATAAGAATAC TGcCACTAAT GTTGGCAATA 2160 

AAC T T G AG AA CAATAATCCA CCCGTTTCAT CTGAGTAAAT AACCTTTGAA CCAATOGTA 222 0 

AAAAT G T CAT CCATGCAAAA ACAACTGCTA ATATTCGTAA AATTAACCAA CCAATTCTAA 2280 

CGTTAAAAGC ATTGTTCATT AGCCCGTCAG GTTTCAATTT ATCTTTTAAA AT AG T T G AAC 234 0 

AAATCAGAGT TATGATACCC GATAAAGTAA TTATCGTCAC AATT AAAAAT GGCATTACGC 2 4 00 

CACCTAATAC ATCTTTAAGC ACGCCTGCTA AAAATGCCAC GGGCAACGTT GTTTGCTTCT 24 60 

GTCCATCTTG TTCGACTGGA ATTGGTACTA AAAAT AATAA GATACCAATT AAAGACATCG 2 520 

TAATAAACTT AAGTCTCCCA ATAACTATCT CTTTCCTTGA AAAGCTATCC ATAAAATCAA 2580 

TCCATTTCTC TATGTATTCG TTTTAAGTAT ATACAGAS'T CTATTCAGTT AACAAACATA 2 64 0 

TTCCTTATCA TTCTATCTTT CAAAATGTTT ATGTATGCAA AATAATGAAT AATTACAGTT 27 00 
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ATTAAATATA CGCTATTTCT TGTAATTTTT CAAGATGAAT TCAAAAAAGG TTAAGTACAA 2 7 60 

TTACTGATTT CGTACTTAAC CTTTTTTAAA CTCTAATCAT ATGTTAGTTA TTTCATTCTT 2820 

CGTAATAATA T TAAGAAGT A TGGTGCACCG ATAATTGCAA TGATAACCCC AACAGGAATA 2 8 80 

TCCAGTGGCG GATGAATGCC ACGGGCTAAA CCATCTCCAA ATGTTAACAA TATAGCACCA 2 94 0 

ATTAACCCCG ACATGATAAT AACGTGTAAT GTTTTATTTC CTATTAATTG TCTCGCAATA 3000 

TGAGGTGCAA TTAATCCTAA AAAGCTAATA CCACCGACAA CTGAAATTGC GGATCCTGCT 3060 

AATAT TACTG CTAAAATTAA CAATAGCATT TTAATAGTTT TAACTTTTAA ACCGAGTGCG 3120 

GTTGCAACAG CATCACCTAG ATTCAATACA TCTAATTGAT AACTCAATAA AATGATGATT 318 0 

GGTATCGTTA TTAAAAACCA AGGTAATATA GTATAAATAT TCGASTATC ATGTCCATAT 324 0 

AGACTACCTG TCAACCAAAC AAGCGCTTTG TTTGCTTCCA GTGGATTTCT GATTAATAAG 3300 

AACTGCACAA TCGCCGTACA TATTGCGCCT ATTGCTAAAC CAATTAAGGC AAGCTTTGAA 3360 

CCTTTAACAT CATATTTTGA AATTAAAAAT GATAAAAATA AACTTACTGC AAAGGCACCT 34 20 

AAGAATGAAC CTATAGGTAA TACAAACAAT GGTGCTGTTG GAAAGGTCAT AATAATAATC 34 80 

ACAGCAGCTA AACTGGCACC TTTAGAAATA CCTATAACAT CAGGTGAGGC TAACGGGTTT 3 54 0 

CTTATTACAG CTTGTATAAT TGCACCTGAA ATAGCCAAGC TACTACCGAT AATAATACCA 3600 

AGTAATGTTC TAGGTATACG ATACTCSTTT AAAATAAAAT CATCTTGTGT AAAGATTCCC 3 6 60 

TTAATAGCAT CAATCGGATG AATCAT GACA GACCCTACAC ATAAACTTAT GAATATACTC 3720 

ACAATTAAAA GGATTGTGAT TAAACTATAA CGACGTATAA TTTTCGTTGT CATCATATTC 37 8 0 

TTTTCACCCC TTTAATCGTT ATAAATAAGA AGTAAAGTGC ACCTACGAAT G3LGTAACAA 38 4 0 

TCCCTACTGG TGATTCATAA GGATATGTAA TTAAACGACT TAATACATCT GATAGTAGTA 3 900 

ACAAATCTGC ACCTATAATA AATGTTAATG GAAT CATGAC TAAATAATTC TTACTCACAT 3 9 60 

AGCGTTTGAC TATATGCGGT ACGATTAAGC CAACAAATCC AATTGGTCCT GCCACTGACA 4 020 

CCGACATACC TGTAAGAATA ATGACTAATA GTCCAATGAT AATTCTAACT T TAT T TAT AT 4 08 0 

TT 4082 
(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 
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TATTAGAAGG TCGTTCGGAT GAACAATTAA AAAATTTAGT TAGCGAAGTA ACTGACGCCG 60 

TAGAAAAAAC AACGGGGGCA AATAGACAAG CAATTCACGT T G T T AT AG AA G AAA T G AAAC 120 

CAAACCATTA TGGTGTGGCT GGCGTAAGAA AGTCAGATCA ATAATTCTTC ATAAGATGCA 180 

TGCCAATTAA TTCTTTGAAA AC GAAC AAGG CGACTTCTAT CTGaGTATGA TAGAAATCGC 24 0 

CTTGTTTATT TTTAATCTTC ATCTAAAAAG TCTTTAATAG CTTGTTTATT TGTGTTTTA 300 

TTAATCTGTA ATGCACTACC ATCAGTATTT GTATTGACAT CyTCGTATGA GTTCTTGATT 3 60 

GGCaCAGTCA ATGACTTAAC ATCTTTTTCA CCTCGGATAC CaAAACTCAA ACCTGTTTGG 4 20 

AAAATCCCTG AATCAGGAAT GTTTGTATTC ACATAGCCTC TTAAAATACC TGCAACTTTT 4 80 

GGTAATTTAA CAACTGTTCT AAAAT TAACC ATTTCTTTTT TCAATGTTTG CATCACTTGT 54 0 

TGCTGACGTC GCACGCGTCC GAAGTCACCT TCAGGGTCGT GACGGAATCT TGCATAACCA 600 

AGTAATTCTT TACCATTCAA CCTATGGTTA CCCTTTTTCA AAG AT AC AC C AATATTTTTC 660 

G AC AT AT C T T TTTCGACATT AATTGGTACA CCTTCAGCA TTAATTCATC AATCATTTTC 720 

TCAAATCCAG T AAAAT C AAC TACTGCATAA TATTCAGGAT TAATTCCTAA ATTTTTATCA 780 

AGTGTTTTTC TAAGTAGCTC TGGACCACCT AAAGCGTATG CTGAATTAAT TTTGTGTTTT 840 

CCATATCCTG GAATATCTGC AT AAAT AT C A CGCATGACAG ACATCATTTT CATCTTTTTA 900 

TTGATAAAGT CATATTGAAC AACCATGATA GAATCTGTTC TTGATTGTCC ACCTTGTGCT 960 

TTATCTGCAC CGAGTACAAG AATAGAAATT n T AC CAT CAT TTTTTACTGG TCC ATT AAAT 102 0 

TGATGTACTT TAACATCTTT CGCATGTTTC TTGGCATATT CTACACCGCT ATTGTAACTA 1080 

TGTACAATAT ATACAACTAA TGCCGATAAG T AAAAT T AC A ACAATCAGAA GAATGATAGG 114 0 

TAATT 114 5 



(2) INFORMATION FOR SEQ ID NO: 2 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7075 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

TATGGCTCAT CAT T AAT GCA CGTATCGGGT AGCGTTTACC ATTGATAAGT GCTTCATGTT 60 

TAGCACGAGT TCTTAAAATT CCATCGCCAT AACCGATATC AACTACAGCT AATTTTGTAT 120 

TGTTTTTAGT CACTTCAAAG GCAAAGCTAT AACCGCAATA ATCACCAGCT TGTACTTCGC 180 

GC AC T T G AAT AACATGTGCT TTTAAAGTTA ATGACTGAAC TATATCATGT TGATTCAGTG 24 0 

AACTATATGG TCTTGAACCG TATAACGCAA TACCTACACG CGCATGTGTA TGGTGGGGTA 300 
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GTAATAyyyg TCCTTCCCGA TAAAAACTCG CACTATTTTG AGCATGGATT AGGTCGAACT 3 60 

GATAACCTTC AG AT AAAAG T GCTTCAACAA TTTCCATCCA TTGTGAACGT TCAACATTAT 4 20 

AAtCTGACAC ATCGAATTCA TCAGCATATC CAAAATGGGk CCATAAACCA CTAATAATCA 4 80 

TTTTTGCATT TTGATTATGA TGGTGATCTT TCAATACTTC TTTAATTTCG TTTAAATCTT 54 0 

TAAATCCAGA CCGATGTAAT AAATTTTCAA ATTCTAAGTG AACATGAATA CCAGCTAAAT 600 

CATTTTTATG GTTATAGTAA TATGTCAACG ACGGCAAAGT CATGTGTATT TGATGTTCAC 660 

GGACTAAATC AAACTCGTAA ACTGCATTCA T T AAAAAG AT TGTTGCSCT GGAGCAAGTT 720 

GTCTAATTTG AATTGCTTCT CGTAGTGATG TTGTGCTAAA TGTATCTATA CCTGCATGGA 78 0 

TAAACTGAGT T AC AG C AAAT TCTAGGTCAT AGTGATATGC ATTaTTTTTA ACAACTGCCA 84 0 

TTAATGGCTG ATTGTTTTTG ACTGTGATTG CATTTTGTAA AAATATTTTC TTATTTACAG 900 

ACCATGTTGC TGTCAATGTA TTACACCTCT TTGTAATTAT TTAATAAATT TTCGTAAAAA 960 

TTAACCACGT TTATTAACAC TTTTTCATCA AAATTTAAAT GTGATGTGTG CAAACCAGTT 1020 

ACAAAACCTT TATCTTCATT TCGTGTTCCT ATAAAAACAA AGTAAGCTGG AGCTAGTTGT 1080 

TGACCATAAA AACTAAAATC TTCCCCAAA r AAGAATGGCG TTGGTTTGTC ATAGACATTT 1140 

AAATCAGCTT TTATTAAGGC GTCCTCTATT TGAGTACGTA ATTTCGGACT ATTGATTGTA 12 00 

GGGGGATAAC CTTCTGCAAA TTTAACTTCA CAATCTACAT TAAACAGAAG CTTGACACTT 12 60 

TCTGCTATCT TGTGCATTTG ATTTTTAACG ATTGTTAAAT CAT C AAT AT C ATAGTACGA 1320 

ATAGTACCTT CTAAATAGCC ATTACTTGGT AC AG T G T T AA TCGCTTCACC AGCTTTAAAA 1380 

TGACCAATAT G AAC AAT AT T TCGTTTCAAA CCGTTAAGGT GAAATTGTTG AATTTGTGAT 144 0 

ATTTGAGTTA ATACATGTTG TAATGCTTCA CCACAAGAAT GACCTTGTTC TTTATCAGCA 1500 

ACATGACTTG ACAGGCCTGT TAAG AAAAAG CGATACTCTG TTGCGCTGGC CGTAATTTCT 1560 

TCATCTCTTA TCACTGCAAT GCCTTCATCA GCAAATGGGT T AAC AT G AAT ACCAAATACC 1620 

GCTTCAATTG GATACTTATC AAAGGCACCG GCTTTTATTA ATCGATTTGC ACCGCCACCA 168 0 

GTTTCTTCTG CAGGTTGGAA AATGAAAACG ACATTTGCG GTAATTGACC TGCATCTTGC 17 4 0 

ATGTCTTTGC AACGTTGTAC AAAAAGC AT T AATGCAGTTG TATGACCATC ATGTCCACAA 1800 

GCATGCATCA CATGATCAGA TTGACTGCGA TAAGGCACAT CATTTTCCTC TAAAATAGGT 18 60 

AACGCATCAA TATCAGCTCT ATACGCTATC GTATGTGAGC CATTACCTTC TAAGTATGCA 1920 

ATGACGCCAG TTTCCAATGG GCAATCGTAT TTAATATTTA AACTATCTAA AAACGCTTTA 1980 

ATATAAGCAG TTGTTTCAAA TTCATGTAAG C T T AAT T C AG GATGTTGATG TAAATGACGG 204 0 

CGATGTTTCG TAACAAATTC TAATTCATTC AT AAT TAT C A ATCCTTTGTG TTAAATTACT 2100 
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ATATAAATAG TGTAACGtAT TTCGAAATTT GTGATCATAA GTTTATTCAA TGCTAAACAA 2160 

TAAGGTTGAG ACATAATCGT ATCTCAACCT TGAAATTATT ATACGTTGAC GTCACTAGTC 2 22 0 

ATTCAGTTTT CTTAATGCTG CTACAATCTC TTTTTTAGTA TCTTGTACTT CAGAAGCTTG 2280 

CTTAATCACT TTTGCAGGTG TACCAGCAAC AACTGCACCA GCEGTACAT CTTGTGTCAC 234 0 

AATCGCGCCA GCTGcAACAA TAGCACCTTT ACCAACACGT ACACCTTCTA AAATAACTGC 2 4 00 

ATTTGCACCG ATTAATACAT CATCCTCGAT TATAACCGGT GAAGCACTAG GGGGTTCAAT 2 4 60 

CACACCTGCT AATACTGCGC CAGCCCcTAC ATGTACATTT TTACCAGTTG TAGCACGACC 2 52 0 

ACCGAGAGTA GCATTCATAT CAATCATTGT ACCTTCGCCA ACGACTGCGC CAATATTAAT 2 580 

TGTTGCGCCC ATCATAACGA CAGCACCATC TTCAATAATG GCTTGTTCTC TAATAAACGC 2 64 0 

ACCTGGTTCA ATTCGTGCAT TCGTATTTGT TAAGTCTTTT AATGGAATAG CAGAATTGCG 2700 

ACGATCCATT TCAATTTCTA TATCTTCGAA TTGACTACCA TATGCTTCGT AAAAAGGTTT 27 60 

CCAATCATCC GCTTCACAAA AG AT T AC T T T AGATTGTTCT GAACCAAATA CTTTAAAACT 2 820 

TTCTGGATAT GTGATGCCTT CAAAATTACC ATTTAAATAT ACTTTTATTG GTGTAGACTT 2 880 

TTTAGCATCA CTTATATATT GAATAATTTC TTCAGCTGTT AAATGTTGTACCATAAAATA 2 94 0 

ATCGATCTCC TTTAATATGT TTATAAGTTG TCAAACGTAT AAAAGCCGTT TGGTTTATTA 3000 

ACTAAGCGTT CTGCTGCTTG TATTGCACCA TTCGCAAAAA TATCTTTTGA TTGTGCACGA 3 0 60 

TGCGTGATTT GAATCGTTTC ATCAGT GCCA GCAAATAGAA CTTCATGTTC ACCGACAATC 3120 

GTACCTCCAC GAATAGAATG TATACCAATA TCTTGTGGCT GGCGTTTTTC ATTTAATTCA 3180 

TGTCTATCAT ACACAGGTGT TACATTTTCT TTCAAAGATA CGATCACATC ATACAATTTT 324 0 

TCTAACGTAC CACTTGGTGC ATCTACTTTT TTATTATGAT GTGCCTCAGT CAATTCGATG 3300 

TCGAAATCAT CAAGTAGGGG AACAGCAGCT GCTAAAATTT TAGTCAATGC ATGAACGCCA 3360 

TAACTCATGT TCGCGCTGAA AAACACAGGC ATATTTTGAC TCAATTCATC TAACTTATTA 3420 

AGTAGTTTTT CTTTCTCGnC CAGTgTTGCC ACAACTAATG GCAAATGAAA ATCTTCATCT 34 8 0 

AATAAAGGGA AAAGCAGATT TGGATTTGAA AAATCTATTG CAACATCGGC ACCTTTASA 354 0 

TCTGCAATAT GTTGATATTG TTGATATGGC GTTGTTGCTT TCGGTGTATT TTCAATGACC 3 600 

CCAACGATTT CATGTCCTTT TTCTTCTGCT AATCTAGCAA CGCGTTGATT CATTGCGCCA 3 660 

TAGCCAATTA GTAATATTTT CACTCATTTT CACCCGCTTT AAATGTGTCA TATGTTTCAC 3720 

GAAGCACTTT AGTATCTGTA TCTTCTAGGC TAACCAATGG TAGACGTAAT TCATAATTTC 37 8 0 

CAAATCCTAA ATAACTTGTT AGAGCTTTAA TAGGAATTGG GTTAATATCA ACTGATAAAG 38 4 0 

C T GAT AAC AG TGTGCCGATT GGTTTAAATT GATCTTGAAT ATCTAATCCA CTTTGTTGAG 3 900 

CATCGTATAA CGCTTGAAAT TCTTTAGGAA TGACATTGGC AATAACAGAG ATAACCCCTT 3 9 60 
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GACCGCCACG TTGATAGTAT TCGACGACGT TGTCATCATT GCCACTATAT AATGCAAATG 4 020 

AATTTGTATC AATGCGCTTT TTCACTTCTT CTAAATACTC AAAATCATTC GTAGCATCTT 4080 

TTAAAGCAAC TATATAAGGA TGTTGACTTA ATATTTCTAC AGTTTCTGGT TCAATTGTCA 344 0 

TGTTCGTTCT TGAAGGAACA TTGTACAGCA CGACTGGTAA TTTCACAGCA TCTGCAATCG 4200 

CTTCAAAGTG TTTGACTAAA CCACGTTGGT TCGTTTTGTT GTAGTAGGGC GTAATTAACA 4 2 60 

TAATTGCATC AGCCCCTAAG GCTTTAGCTT GGATTGAAGC TTGGATTGAC TTTTCAGTAT 4 320 

CATTAGTGCC AGTTCCTGCT AT G AC AG G AA CACGTTTATC TACAAGATCA ATAACTGTTT 4 38 0 

TTAGAATGCG TTCTTTTTCA TCTGTTGTTA AAGTAGGGCT CTCAGCAGTA GTTCCATTAA 44 4 0 

CGATGATTGC TTGGGCATTA TTTTCTAGTA AAAAATTAAC GTGTGTTTTC AAAGCTTCAA 4 500 

TATTAACTTT GTTATTTGTA AAAGGGGTTG TAAGTGCaAC sCCAACECC TCAAATAAAT 4 5 60 

GTGTCATTTT AATTCGCTCC TTTTAAACGC ATAACTTGTT CCAATACTTG TACAGCATTT 4 62 0 

AATGCAGCAC CTTTTAATAA ATTGTCTGAT GTACACCATA CATGGAAAGT ATTTTCTAAT 4 68 0 

GAATCATCTC TACGTATACG GCCAACAAAC ACTTCATCTT TATTAGTAGA ATTGATTGCC 474 0 

ATTGGATATT CATTGTTCTC TGGATTGTCT ACTAAAACAA CGCGGTCATC TTGATCAAAT 4 800 

AACGCTTTAA TATCTTCTGC TGTTGTTTCT TTGTCAAGCG TTACATCAAT TTCAACACTA 4 8 60 

TGACTATCTT GAACAGGCAC ACGTGCGCAT GTTGCTGTTA CTTTTAAGTC TGGCGCATTT 4 920 

AAAATTTTTC TCGTCTCATC AATCATTTH TGTTCTTCTT TTGTATATCC GTTTTCTAAA 4 98 0 

AAC AC AT C AA TATGCGGTAA CACATTATTA TAAATTGGAT GTGGATATGC TTCTGGTGCT 504 0 

TTACCATTTA CACCTTCAGC TAAATCTTTC TTACCTTTCA TCCCTGAACC TGATACAGCT 5100 

TGATATGTTG TATATGCCAC TCGTTTTAAA CCATAAGCAT CTTGCAATAC TTTAGAGGT 5160 

ACAACAGATT GAATCGTAGA GCAGTTTGGA TTGGCAATGA TACCTCTTGT AAATGTAGGT 5220 

TCATTGACTT C C GG AAC GAT TAAATCAATA TCTTCTGCCA TACGCCATTG ACTTGAATTG 52 8 0 

TCTATAACGA TTGCACCAGC TTTTTCAAAA AGTGGGGCAA AGTGTTCGCT TGTACCGCCA 534 0 

CCAGCACTCA TTAATACATA ATCGAAATGT TCACTTGCAC GAG CATC AG T TAATTCTTGA 54 00 

ACTGTATATG TTTTTCCTTG AAATTCAACT TCTTGCCCTG CAGAACGTGC TGATGAAAAT 54 60 

AATACTAATT CATCGAAAGG AATATTTTTA CGATTTAATG TCTCCAACAT TTTTGTACCT 5520 

ACTAATCCTG TTGCACCCAC AACTGCTAAC TTTGT&TAA CTTGTCACTC CATTTTATAA 5 580 

TAATTTCCaA TTTTTAGAAT ATTTTAACAA TCATTTTACC ATTAAATGTT AAATGCGTCA 564 0 

TATAGTTTTT CTACCGCTTG TTGCCCATTA AAAT CAT C AA T G AC G T AT G A AATACTTATT 57 00 

TCAGATGTTG TTGTTTGGTA GAAAGGTATA TTATTTTCAA TTAATGTCAA AAATGCTTTT 57 60 
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GATGCCACAC CTGACATATC ACGCATGCCT GAGCCAATTA ATGAAATTTT GACATAATGC 582 0 

TCATTGATTT TATAAGCTAA TGCTTCATAT TGATTCTTTA ATGTTTCAAG AATCATAGAA 5880 

AT T T G AT G AA AATCACTATC TTTAATCGTG AAGGATAGTT GTAGCCCATC CAAGTTGACG 5 94 0 

ATTTGTGAAA TCATATGSAC ATTTACAGCA CCTTCTTCAA GTTCCGTAAA TAGTTGGGTA 6000 

AGTAGCTGAT TGTCAGGTAG GGGATAACTA ATTGTTACAT GCATCATATG TTTATCCAAA 60 60 

GCCACACCAG TAACTGCTTT TTTCTCTAAT ATTTCTTCAT TTGACATAAT CCATGTTCCT 6120 

TTCACGTTCG ATAAAGTTTT TCCTAAATAT AAAGGGATAT TA2LGTTTTT AGCTAATTCA 618 0 

ACACTTCTTG TTTCAAGTAC ACCAGCACCT AAAGCGCTCA TTTCCATCAT TTCTTCATAT 624 0 

GAGACGATGT CTAGTCGTTT AGCCTTTGGT AAAAGTCTTG GGTCAGTGGC ATACACACCA 630 0 

TCAACGTCGG TATAAATTTC ACAAGGTATT TGATTACTAA CAGCAAGTGC CACAGCGGTC 6360 

GTATCAGAAC CACCTCTGCC TAAAGTTGTT AATTCCTGAT GTTCATTGAT GCCTTGAAAT 64 2 0 

CCAGCAACTA CTAAAATATC GTTTTCTTGA AAGGCTTGTT CAAATGTTTG AGGATTAATT 64 8 0 

TGAGCAATTT TACTTTTTAA ATGATGGCCA ATGGTTTTAA TACCCGCTTG ATAGCCAGTC 654 0 

ATTGCTTTGG CATTCATACC GATATCATTT AATACCATTG ATAAATAAGA TACAGTTTGT 6 600 

TGCTCTCCGG TTGTCAATAA TAATGCCAGT TCTTGTTGTT TTGGTGCTTT AGTCAAGGTT 6660 

GATACATTCG TCATTAATTG ATCTGTTGTG TTACCCATAG CACTTACAAC GACAATTAAa 67 2 0 

TTGTTCATCT TGATTGACTC GCTCCTTTAA CATTTCAGCG ATCCTTTTTATTTTTGTAAA 67 8 0 

ATCACTGACG GATGATCCGC CAAATTTCAA CACACTTCTT GTTACCATAT AATCCTCCTA 684 0 

ATTGTTAGGA AGGGcACACA ATAATAAAAA AAGAACAAGT TCGGAATGCG AACTTGCTCT 6900 

TAATCTATAT ACAAGTGATG CACTCCATTA TTTTTAAATA AT G AC AAAC T CTCAGCTCTT 69 60 

AACCAAAAAG TCCAACAAAT TATAACTGCT ATTATAATTG CTTCGGCATC GCACCCTTTC 7 02 0 

AAATTTAGCT GTTAGCAGAC AGTAATCTAa ACTTTACTCA TGATTGATGC GCCTC 7 07 5 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

AGACGTACTT TGTGATTmCG AAGyrCGTAC TGmAGCACTT GTCGACGTTG ATGTACTTGT 60 

TGAACCTGAT TGACTAGTAC TTTGTGATAA TGACTTACTA T C AGAAT C AG ATGTACTTTG 120 

TGAATCACTT AATGATTCTG ATGTACTACC TGACTGAGAC GTGCTCATTG AAC T AC T T AC 18 0 
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GGACATTGAT TTACTGTCTG ATGCAGATAA TGACCCACTT GTACTGATAG AGTCACTTAC 24 0 

TATCTCTGAA GTACTCATCG AGTCTGATGT ACTTGTTGAG ACACTTTGTG ATGTGCTAT 300 

GCTTAGTGAT C C AG AAAC AG AACCACTTGT GCTCGTCGAA TCGCTCAATG ATTCTGATGT 3 60 

ACTCATCGAT TTTGAATCAC TTGTACTTAA TGATATTGAT GTACTTTGTG AATCTGATTT 420 

GCTTGTTGAC GCACTTTGAG AGTTGGCTAT GCTATTTGAA ATACTGATAG AGTCCGAGGT 4 80 

GCTAGCTGAC TCGCTCAATG ATGTTGATGT ACTAATTGCA TTCGATGTAC TGTCACTTAA 54 0 

TGATGCTGAT GTACTAGACG ACCCTGATAT ACTCGTTGAT AAGCTTTGTG ACTTAGACAA 600 

GCTTCCTGAT GTACTCATAC TTAATGAGTC AC T GAG T GAT GTTGATGTAC GCAATGAATC 660 

AGATGTACTT G T T GAT AG AC TTTCGGATTT TTCACTACTG CTAGAGTTTG AAAT AGAAT C 720 

GCTTAATGAT GTTGATTTAC TAGCTGAATC CGACATGCTT GATGATACAC TTTGTGAATT 78 0 

CACTAAACTT GTGCTTGTTG AGCTTGATAC ACTATTACTT TCAGATGTGC TTAATGACTT 84 0 

AGATGCACTC AC AG AAT C AG ATAGGCTTAC ACTTGTCGAT TTCGAGGTAC TAGCTGATGT 900 

AGATACCACA ATCGATCCTG ATGTACTCGT TGATGCACTT TGTGAGTCAG CTTTACTTGT 9 60 

TGACACACTT TGAGATTGTT GTGTACTTCC TGATGTTGAT ACGGAATCAC TCATGCTATT 1020 

TCTTGTTACT TCATATTTAA AAGTTGTCGT CGTTTTGTTA CCGCTCGCAT C T G TAG AAAC 1080 

GATTGATATA GTACTTGTAC CAATGTTTGT TGGTGTACCA CTAATAGTAT TATTTGTACT 1140 

ATCAAATGTT AGTCCGGATG GCAATCCAGT CACTGTATTC GTCACCGCAT TTCCACTGTT 1200 

ATCTTGCGTA aATnTtTAAT CGGGGATATT GGTGAATACA CTTCTGATGA TTGATCTCCT 12 60 

ATTGGTGTCA CTGTTGGTGC TGTCGTATCC ACAACATTTA TETAAAAGT TGTCGTCGAT 1320 

TTGTTATTTG CTTGGTCAGT AGACACAACT GTCACTGTTG ATTGACCAAT TTTTGTTGGT 1380 

GTCCCAATGA TTGAATTCGT TGCACTATCG TAACTTAATC CGCTTGGTAA TCCTGTAACT 14 4 0 

GTATTTGTCA CAGTCCCAGT ACCATTATCC GTTGTAGTCA ATACAATAGG ATTCATTGTT 1500 

TTACCCACTT CTATGGTTTG ATTGCCTACA GTTACAGTTG GTGCTTTTAC ATCAGTAAAA 1560 

TAATATGTCA CTGATTGTCC AGCATTCGTC ATTTTTACAG TTTTATTTGT ATCATTATAA 1620 

GTTGACGCAT ATGAACTATC GACGGACGTG TAGTTATATC CTTTAGCAGT CAATGCAGAT 1680 

TGCTGaTTAT CGATTGTCAC GACTTGATCA ACATTTCCTG AATATGTTTT TGGTGGAATA 17 4 0 

ATATCTTTAC CTGTTGTTAC ATCAACGTAT CTCACTTGTG TAnCAGCAGA CTCTGTATAT 1800 

TCGAATGTTC CAAATTGTAC TTGTTGTAAA TTTGTCGCGC CACCTGTTGA GGCTGTCATT 18 60 

GATAATGAAA AGTTGGTCGT ACCACTTTTC GCAATCCAAT CTGAAATATTACGTGTCCAT 192 0 

GTTTGACCTG CAT AT T T G AC AGTCATAACC TTTGTATCAC CATTATAGTT AATATCAAAA 198 0 
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TCTTGGaACG TGTTATTTGT AGGTTGaACA TTTAACTTCG CAGCATTATC AGCTGTTGAA 204 0 

CTTGATGTAT ACGTTGTCGC AACACCATAA CTATCTGTTG TTACAAATGC ACCAAACGCA 2100 

CCTCCACCAG C T AC AT T AGA TGGGTCAGCA TTCGCCTTTG CAGCTGAATT TGGTTTAGAT 2160 

GTATTGTGAT ACGTATCCAA TTTGAAGCCA AATGCGTTAC TTAAGCCACC AATACCTACT 2220 

GCGGCACCGT TTAACCCTGT TTCACCTAAT ACACCTGGTG AAAAGGCAAA ACCGATACCA 22 8 0 

TCTCCACCAT TTCCATGCCC TTCATATTTG TTACCTAAAT TTACTTTTCC AG AAAAAT G A 234 0 

AAACTCTTAT TAGAGTCAAT ACGTGTTCCT AATGTAATAG CACCTTTTTG GCTGTATGCA 24 00 

TCCTGTGTTA ACGTCACAAT ACCGGTACTT TGATCATAGG TAGCATTACC TGACGTTGTC 24 60 

ATATATTGTT TTAAGTTATC TTTATTAACT GTAATTGTAT T AGCAGT T AC TGCGGTGTC 252 0 

GTCgCTGctG ACGCAAATGT TGACATAGCT AAGCGACTGA AAGTTCGAAG TTTTACTGGT 2 58 0 

GCGGTGCTAG TTGACGTTGT GCTAGTTTTG TTTAAGTTGA CCGAAGATGG CGTTGTGCTT 2 64 0 

TGTGAAGTGT TATTTGATGC AGTACTTTGA TTTGTTGATG TATTAATTGG TTGTTCTGTA 27 00 

CTTGAAGTTG AAGC T AC AG A TTTAGTATCA GAACTTGATG TAGTATTCTT TGAGGATGTT 27 60 

GATTCTGATG TAG AT G T C AA TTTCTCTTGT TGATTGCTTG T ACT ATT AG T TGTCGAAGTG 2820 

ACCTTTTCAG ACTTTTCACT TGAGACTGTG TCACTATTTG ATGTTTGTAC CGAACTACTA 28 80 

TTTtTCGTTA CACTTGTGGA ATCGGCTGTT GATGTTGATG CTTCGATTGT CGTTGAGTTT 2 94 0 

TGATTACCTA CTGTTTCACT TTGTGTGTTT AATTCAGAAG TTAATGGTGC ATCAGAAGCC 3000 

GCAAAAGCTT GCTGGTCATG C AAC AT AT T T ACCGTGAATG CACCACCAAT AACCGCCGTA 30 60 

GTTTTCAGTC CGTATCCCGT CATTTTTTTA CTAATGCTTT GATTATCTTG ACTCACTAAA 3120 

CTATGACTAA TAAATGGTAG CCCCATAATT TTGAACATTT CTATTTCTTT AATTCCGGAT 3180 

TTTACCCAAT TTTTTCCAGA TTTATAAAGT CTTACTCTTG TTTTTTCGTT TGCTAAGCTG 3240 

TCAT GAAATG CTTTCTGTCT TTTACTCATG TAATAACTCC TTGTATTATC TTTACATTCA 3300 

TTAGATTATA ATATATGCCA CTATTCAATT TAATACAACT CTTTTTTGAT ACAAAAATAC 3360 

TCATTTTGTT AAAATTTGTA AAAATTCaTT TTTATTCGTC TAAATGTAAT CGTTTTCATA 34 20 

TTTTTAAAAT TACTTTTTCT CGTTTATGCG TATAATCTTT T T T T AT AT AA ATTTGGCTAA 3480 

TTGGCTTTAT GTTTAATCAT TATAATTGTT TCGTTTTTAA AATAATATT GTATTAATAT 354 0 

ATCTATACCA TCCACCTTTT ATTTATAAAT AGTTAATTTA CAACTAAACG ATAAATATTA 3600 

TATGCAAAAT ACATCTTTAA TATTAAAGTA ATACCAATAT TTTTTCAATA AACCTAGTGT 3660 

AATATATGTG TAATTCTAAA AGATTCTTCT TT AAAAAT AT AAATACCACG ACATATTGCT 37 2 0 

TTAACATTTT CATTTATAAA GCGAAAAAAT GCATCGCTAC TAAGTTGAAT GTTTAGTAAG 37 8 0 

GATGCATTGA ATTCACTAAA ATGATTAAAT TACTTATATC TTTTCATCTG AT TGAT T AT C 38 4 0 
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GAAATTTCTT CCTTCTAAAC CTGCTAACTC TTCTTTAGAA GCTGCAGGTG CTTTCATTTC 3900 

AAATATCTCA TTCACTACTG TGTAATCGTA ATATCCTAAT CTGGCAATAG GTTTAATCGA 3 960 

CTTAATGTCC AATTTACCAT TATCAAGAAT AACCTTATCG TCAATATGAA CTTGGGCAAC 4 02 0 

TCTTCCTATA ACAATATCTA CGGTAGATAC TGGATCTCCA GTTGGAATAC GAATCGTTTG 4 08 0 

AACGTACTCA CATTCAAAAT GAACTGGCGA TTCTTTTACA CGATATCCTG GAGTTCTAT 414 0 

ACATTTTTCC TTTGTTACAC CTGCAAAATT AAATTCATCC TCTTCTGGTG GCAATGCTTT 4200 

CGATGATAAA TTAACTGCTT CTCTTAAATC ATACGTTGCC ATATTCCACA CAAACCAACC 4 2 60 

TGTCTCTTCA GCATTTTTCA CTGTATCTTT ACGTTCGTGA TCACCAAGAA CGGATTGATT 4 32 0 

TGCTGCGAAC ATAACCATAG GCGGATCCCA AGTTAAGTTT TGATACTGAC TATAAGGCGC 4 38 0 

TAAATTATCT TTCCCATCTT TCGATACAGT AGAGAT C C AC CCTATTGGAC GTGGTACTGT 44 4 0 

ACTACTTTTA AATGGGTCGT GCGGTAAACC ATGACTTCTT ACACCTTGTT TTGGCGAATA 4500 

ATT CAT ACT A TCTTCACCCC TTATAAGTAA TTACffTTAA GGTTACGCCC TCTTTACATA 4 560 

AGCGTCTAAT ATAAATAAAC AATTTATTTA TAAGTAGAAA C TAT AT AT G A CGTGGTTGCT 4 62 0 

TATAATTTGC GTTCTTGATT CGAAAAATTC AGATAAGGAT TTATACAATT AATATTTATG 4 68 0 

ATATCTTTTG TAAATTTAAT TAATTATAGT TACTTCAATC AT GAT TAG TT TATAATAATA 4 74 0 

AAGTGAAATT GAAAAAGACA GCTATTATGC GATGAGCGAA AAACTTCAAG TAAAACAAGA 4 8 00 

ATTTAACGGC GTTACAAGGC AATACTTGAA TTTCAGAACA GATAGTATCG ATGATCTGGA 4 8 60 

TAAGTTCAAA AAAGAATTCA AACCTGTTAT GCACTTAAAA GGTGATGCAT TCAATCAACA 4 92 0 

GTTACAATCT TTGATTA^CA AATATCCACA AATACAAAAA AATATGAAAT CAGAGTTCAT 4 980 

TGCTTATTAT GATAAAGAAA AAAATAGAGA AACAGTAAAA AACTATGCTT GGAACCTTCA 504 0 

AAAATCTATA AAT G AC AT T A TGCAATCATA TCCTAGCACA AAATTTGTAC AGTTTTATAA 510 0 

AAGATGATGT TTCCCCGTCA ATGGTAGATG GAAATGGCCG TTAAAATCG GGATACTAAT 5160 

GTATTTCCAT C 5171 
(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

CT AC AC ACT A AACCTATTTC AGTTATGGGT GGTACAGTGA TTCTCTTTTC AT T T T T AAT A 60 
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GGTATTTGGA TTGGTCATCC TATTGAAACA GAAATCAAAC CACTTATTAT TGGTGCGATT 120 

ATTATGTACG TACTTGGGCT T G TAG AT GAT ATCTACGATT TGAAACCGTA TATAAAATTG 180 

GCTGGTCAAA TTGCCGCTGC CTTAGTAGTT GCTTTTTATG GTGTGACTAT TGATTTTATT 24 0 

TCGTTGCCAA TGGGTACAAC GATTCATTTT GGATTTCTTA GTATTCCAAT TACTGTGATT 300 

TGGATTGTTG C T AT T AC AAA TGCAATTAAC TTAATTGATG GACTCGATGG TTTGGCGTCG 3 60 

GGTGTTTCkG CAATCGGACT CATTACAATA GGGTTCATTG CAATTTTACA AGCTAATATT 420 

TTCATAACGA TGATTTGTTG TGTTTTATTA GGCTCTTTAA TTGGGTTTTT ATTTTACAAT 4 80 

TTCCATCCTG CCAAAATATT TTTAGGTGAT AGTGGGGCTT TAATSTTGG ATTTATCATC 54 0 

GGATTCCTTT CTTTACTCGG AT T C AAAAAT ATTACAATTA TTGCATTGTT CTTCCCAATT 600 

GTTATCTTAG CAGTTCCATT CATTGATACT TTGTTCGCAA TGATTCGACG TGTGAAAAAA 660 

GGGCAGCATA TAATGCAAGC TGATAAATCG CATTTGCATC ATAAACTATT AGCTTTAGGC 720 

TACACACATA G AC AAAC AG T ATTATTAATC TATTCAATCT CTATTTTATT TAGTCTTTCG 7 80 

AGCATTATTT TGTATGTATC GCCACCATTA GGTGTTGTAT TAATGTTTGT ATTAATCATA 840 

TTTAGTATTG AATTAATTGT TGAATTTACA GGATTAATAG ATAACAACTA CCGACCAATA 900 

TTAAATTTAA TTAGTCGTAA GTCATCTTAT AAAGAGGAAT AGGGAATGAA AGCATAGCTG 9 60 

TATGGGATAA TTTGTATTAT ATGGCTTTAC TCTTTACAAT TTTTTTGTAT TAAATTTCAA 102 0 

ATATAAAAAG CACTGCCATA AACGTGTACT TCAATTGTCG TTTAATAATA CGCAATTGAT 108 0 

ATTTACCGTC TTATGATAGT GCTTTTTATT TTTATTCAGT TGGTATATCG AAGGTAACT 114 0 

GCTTTGGAGT TTCTTCAGTC AAATCGAAAT TTCCTGCAGT CATTTGATTT AAAAAGTTAA 1200 

TAAACGCTTC ATAGTCACTT TTAACGACAT CGATATAGTA GCTTACCTTA TCAGTGTAAG 1260 

TTTGGTTTCT TAACATAAAA TGAGTTGAAG CTAATTCATA TTCAAATTTA CCAGTTTGAT 1320 

CATAATTCAG TGTTACTATA CATGGTACTG CTTCTCGTAG TTCGACACGC CCGATATCAT 138 0 

AAATGACGTC TCTAACAGCA CCGCTATAGG CGCGAATTAA ACCGCCACCA CCTAATTTAA 14 4 0 

TACCACCAAA ATATCTTGTT ACTACGACAC ACGCATTATG aACATCGrGC TTTTTTaATA 1500 

TGTCTAACAT TGGGaCACCG GcAGTTCCtG TCGG"TCACC ATCATCATTC GCnTTTTGAA 15 60 

TATTCATTtC AGGTCCAATA GTATATGCAG AACAATTATG AGTGGCATCT TTATGTTCTT 162 0 

TTTTTATTGC AGCAATAAAT GCTTTaGCTT CATCTTCATT TTGAACAGGT TTGATATGAG 168 0 

CAATGAATCT T GAT T TACT A ATCACATTTT CAATAATGTG TTCTTTTTTA ACAGTAATGA 17 4 0 

TATTTTGTGT CATAATAACT CCTTAATTCA TAAGCTTAAG ATTATTTAAT CTTCATTATA 1800 

CACTGAAAAT GACATGACTA TAAATCGTTT GATTGCCATT TTCTTTTTAA CTGAAATATT 18 60 

GTATCATTGC TATGAGTATA TTTTAGGAGG ACGACTATGA AAATTGCTGT GATGACCGAT 1920 
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TCTACAAGTT ATCTG1CGCA GGACTTAATC GATAAATATA AT AT T c AAAT AGCGCCATTA 198 0 

AGTGTGACTT TTGAAGATGG CAAGATTATA CCAGAAGAAA AAGTTCGTAC TAAAAAGCGT 2 04 0 

GCCATTCAAA CAT T AG AAAA GAAAGTATTA GATATTGTAA AAGACTTTGA AGAAGTAACT 2100 

TTATTTGTCA TAAAT GGAGA TCATTTCGAA GATGGTCAAG GTTATACAA AAAGTTACAA 2160 

GATGATTGTC CTTCAGCTTA TCAAGTAGCA TACTCTGAGT TTGGTCCAGT TGTTGCAGCA 2220 

CATTTAGGTT CTGGTGGATT AGGTTTAGGC TATGTTGGCA GAAAAATAAG ATTAACATAA 2280 

TTATAAAATT TTAATAAAAG AGTCTATATT GTAATTGGAA ATTATCTCTC GTATACATGG 2301 

CTTTAAATGT TCATCATTTG AAAGCCAAAA TGCTAAAGAT ATAAGAAAAT CATTATAATA 2 4 00 

TTAGGCTCTT TTTTACGTTG AAATGAGGTT TTAAGCATTA AACATTACGG GAAATTAATT 24 60 

CATCCTCATA CTTCACTTAC TAATGAAAAA ATTAAAAAAG AAGTAACAGG TGTCATCAAA 2520 

CAAAATTCAA ACT AT TAT TG TGTTCAATGT GAAAGTACAA ATCCAAAGCA TTTTTATCAG 2 58 0 

TATGATTCCT CAGTACATTC CAAGAAAATT GTATATTGCA GAAAT TGT AT ATCACTGGGT 2 64 0 

CGAATGGATA ATGTAACAAG ATATAAAATA ACAGAGAGTT CGCAAAGTTC ATCACAAGCA 27 00 

TAT TAT CAT C TCTCATTTGA ATTGTCGGAA CAGCAGTCTT ATGCCTCAB ACATATTGTT 2 7 60 

CGAGCCATTA GAAAGAGACA AACGATTTTG TTATATGCCG TAACAGGTGC AGGTAAGACA 2820 

GAAATGATGT TTCAAGGCAT TCAATATGCA AGAATACAGG GAGATAATAT AGCTATTGTG 2880 

TCACCACGTG TAGATGTTGT T G TAG AAAT T AGTAAACGTA TTAAAGACGC ATTTCTTAAT 2 94 0 

GAAGATATAG AC AT ACT AC A CCAGCAATCA AGACAACAAT TTGAAGGGCA TTTTGTTGTA 3000 

TGCACAGTGC ATCAACTTTA CCGATTCAAA CAGCACTTTG ATACTATTTT TAT T GAT G AA 3060 

GTCGATGCCT TTCCTTTATC AATGGATAAA AATTTACAAC AAGCATTGAA GTCATCTTCT 3120 

AAAGT TGAAC ATGCAACAAT TTATATGACA GCAACACCAC CGAAACAACT TCTGTCAGAG 3180 

ATTCCCCACG AAAATATAAT TAAATTGCCA GCTCGCTTTC ATAAAAAATC ACTTCCAGTT 324 0 

CCTAAATATC GTTATTTCAA ACTTAATAAT AAGAAGATTC AGAAAATGTT ATACCGAATT 3300 

TTACAAGATC AAATTAATAA TCAACGTTAT ACACTGGTGT TTTTTAACAA TATAG&ACA 3360 

ATGATTAAAA CATTTTCGGT TTATAAGCAG AAAATTACTA AATTAACATA CGTCCATAGC 3420 

GAGGATGTTT TTCGCTTTGA AAAAGTTGAA CAATTAAGGA ATGGACATTT CGATGTCATT 34 80 

T T T AC T AC G A CAATATTAGA ACGTGGATTT ACAATGGCAA ATTTGGATGT TGTTGTTATC 354 0 

GATGCACATC AATATACTCA AGAGGCTTTA AT AC AAAT T G CTGGACGTG 358 9 

(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

TTTCCAAGAA GGcTTgAAAA AtGTTwCaAG TGGTGCGAmC CCAGTTGGTT TACGACAAGG 60 

T AT C G AC AAA GCAGTTAAAG TTGCTGTTGA AGCGTTACAT GAAAATTCTC AAAAAGTTGA 120 

AAATAAAAAT GAAATTGCGC AAGTAGGTGC GATTCAGCA GCAGATGAAG AAATTGGACG 180 

TTATATTTCT GAAGCTATGG AAAAAGTAGG TAACGATGGT GTCAT TACAA TTGAAGAATC 24 0 

AAATGGACTA AACACTGAAC TAGAAGTGGT TGAAGGTATG CAATTTGATC GTGGTTATCA 300 

ATCACCGTAT ATGGTTACTG ATT C AG AT AA AATGGTTGCT GAATTAGAAC GCCCATACA 3 60 

T T TAG T AAC A GATAAGAAAA TCTCGTCTTT CCAAGATATC TTACCTTTAT TAGAACAAGT 4 20 

GGTTCAATCT AATCGTCCAA TCTTAATTGT AGCTGATGAA GTTGAAGGCG ATGCATTAAC 4 80 

AAATATCGTG CTAAACCGTA TGCGTGGCAC ATTTACAGCT GTTGCAGTAA AAGCACCTGG 54 0 

TTTTGGTGAT CGTAGAAAAG CGATGCTTGA AGATTTAGCT ATTTTAACTG GTGCGCAAGT 60 0 

GATTACTGAT GATTTAGGCT T AGAT T T AAA AGATGCATCA ATTGATATGT TAGGTACTGC 660 

AAGTAAAGTA GAAGTAACTA AAGATAATAC CACTGTTGTT GATGGTGACG GTGACGAAAA 720 

CAGCATTGAT GCACGTGTTA GCCAATTGAA AT C T C AAAT T GAAG AAAC T G AATCTGACTT 780 

TGATCGTGAA AAAT TACAAG AGCGCTTAGC TAAATTAGCA GGTGGTGTTG CAGTTATCAA 84 0 

AgTAgGTGCA GCAAGTGAAA CAGAGCTTAA AG AAC G T AAA TTACGTATTG AAGATGCATT 900 

AAATTCTACA CGTGCAGCAG TTGAAGAAGG TATTGTTGCA GGTGGTGGTA CTGCATTAGT 630 

AAATGTTTAC CAAAAAGTAA GTGAAATTGA AGCTGAAGGT G AC AT T G AAA CAGGTGT 1017 
(2) INFORMATION FOR SEQ ID NO: 2 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

TTAATCCAGC GTTAACTGTA TTTGCATTTA T T AT GAT TAT TTCGATTCTT TTAGCGTATG 60 

TATTTAAATG GCTTGGATTA GTGGATGATG TGTTATTAAT GGTCATTATC ATTTCAACTA 12 0 

TTTCCTTAGG CGTAGTTGTT CCAACTTTAA AAGAAATGAA TATTATGAGA ACAACTATAG 18 0 

GGCAATTTAT CCTATTAGTA GCAGTACTTG CGGACTTAGT AACTATGATT TTATTAACGG 24 0 
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TCTATGGCGC AATCAATGGT 

TTTTCACAGC AATTTCATAT 

AATTGATGGA TGGTACGACG 

TAGTAGCCCT AGCAGAGGGA 

TCGTTGTTTC AT TAT T AAA T 

ATGGGTTCTT TATTCCTATT 

TAATTAAAGA ACCGAAATTA 

CAAAATTAAT TCCAGTCATG 

CAGCATTTTT AT T AAC AT C A 

GATTAAATGC TATTTCAGCT 

GTGTATTCGT TCCGATTATT 

AAATTGAAGT TAGTTTAATT 

CATCTCAGTT ATATGACGTG 

TGTCAGATGA TATCACGATG 

TAGGTCTGTT TGACCGAGAC 

AAGTTGCTAA ATTAGCCAAA 

CAACGGACGA T AC AG AG T T A 

ATAAAATCTT ATTAAAAGGT 

TTGAAACGTC ACTATATGAA 

GTAATTTCCC ATTCGGAGGA GACATCATC 

(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

'(A) LENGTH: 4702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1409 



CAAGGCGGCA GTACAATATG GTTAATAGGT ATATTAGTTG 
ATTTTAGGTG TTCAATTTAA AAfiATGTCA TTTTTACAAA 
CAAATCGGTA TTCGTGCGGT ATTTGCATTA ATAATATTAT 
GTTGGCGCAG AAAAT AT AT T AGGTGCATTC TTAGCAGGTG 
CCAGATGAAG AAATGGTTGA AAAG T TAG AC TCATTTGGTT 
TTCTTTATAA TGGnTGGTGT AGATTTAAAC ATACCTTCAT 
CTAATTATCA TACCGATTTT AATCGTnGCA TTTATCATTT 
TTTATTCGAC GTTGGTTTGA TATGAAAACA ACGATTGCAT 
ACATTATCGC TCGTGATAGC TGCAGCCAAA AT T T C AG AAA 
GAAACGTCAG GTATATTAAT TTTAAGCGCA GTCATTACAT 
TTCAAAAAAC TGTTTCCAGT TCCAGATGAG TTTAACCGTA 
GGTAAAAATC AATTAACGAT TCCTATAGCGCAAAATTTAA 
ACATTATATT ATCGCAAAGA CTTGAGTGAT CGTCGTCAAT 
ATAGAAATTG CTGATTATGA ACAAGATGTT T TAG AAC G AC 
ATAGTTGTTT GTGCTACGAA TGACGATGAT ATTAACCGAA 
GCACATCAAG TTGAGCGTGT CATTTGCAGA CTTGAAAGCA 
GTTGATTCAG GTATTGAAAT TTTCAGTAGC TACTTAAGTA 
TTAATTGAAA CACCTAACAT GTTGAATTTA TTAAGTAATG 
ATTCAAATGT TAATTATAA ATATGAAAAT ATTCAATTAC 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

AAAGAGGGTT TTTTACTACT ATAATCCATC CTTTAATGGA ATTCCATTTG TCCCTTTGGC 60 

CCGTTCATAA CCATAGGAAA ATATATGCnG nAATCATTGA TAGTAAATAC ATSATAATA 12 0 

CGAATAATGT CGGTGAAATC ACGTCCGTAT TACCATTATT AGC T AAAAC A TTTTCCAAGT 180 

TTTCTTTTGA ACCGGATACT CATAAATGCT TTTAATGCnT GGTTTTGTCT GTCGCCATCT 24 0 
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TTAGAATTTG TAATACTTTT TTAAATTCTT TAGCGAACAA CTCATTATCT TTATCGTTTT 300 

TAGCCATTTG AC GAT T C AAA TCGTTAGCTC TTACGCCTAA TGCTCGACCT GTCGCATGTA 3 60 

GCTTATTCAC ATTATTGTCG ACTTGATTTA ATTGGCCACT AACAGAATCT GCAATTGATT 420 

TTGATTCTTG TGTATCTGAT AGCAATTGCG TACTCTTTTC AGAAATTCTA CTAATTTCTT 4 80 

TATCTAAATT TGAAGACATC GTATTAAATT CATCATTTT GCCTTTATCA ATTTTTGGTT 54 0 

CTTGTGGCTC TTCAGCAAAA GTCTTTTTAA CGTTTTCTAA CTGATCAATC AGCTTGGAAA 600 

TATCTTCTTk ATTTTTTGTT gTATTCTTTT TGTTATTTAA AATGTCATCA ATCAGTTTGT 660 

CTGAGTTTTC TTCCATTGAA TCAATTTGaT GTAACACAGC TACTTTATCG TCTTTGAAAC 72 0 

TTTCCATGTC ATTGATAACT TGGTCAACCA T CAT AT C AAT TAAACGTTTG TTGTCGAATG 780 

GTTTATCTTC tCTGCCTTTT GTATCTGTGT ACATTTTATA ATGCGCATCA AACCTAGATA 84 0 

ATGCACTCAA TTGCTGGCTT AATGCATCTT TCGATAAGCG ACCATCaAGG TTATGATTCA 900 

ATGTTACATC CACAACACTC GTTGCTTTCT TATCATTTGG TTCATCTTGA CGATTTGCTT 960 

GTCCAAATAA CAATTGTAAA TGCATTGTTT TATCTTTTAA GAAATCTTTC TCAGCATCCT 1020 

TTTTCAATTT AGCAACGCCA TTGACTTCAA CTTTATATTC CTTGTTAGAT GTATCGAGTT 1080 

GAACACTTTG ATCCTTAATG TCATATTTTT TACCATTAAT TTAATGTCG CCTTCAAAAT 114 0 

TAAAATGAGG AtCTGTTGCA ACAGTTAATT GATTAATATC ATTACTTTTA ATCGTTTCAG 1200 

TACGTTGCAC TTTGACACCA TCATTAATCA AACTACTTGT GTCTTGCGCA ACTTGGTTAT 12 60 

CGTAATCTGT TAAATTAATG TGTTCTGCTA ACGGTTTTTT CAAATTATAT TCATTTTTAT 132 0 

AACGTTTTGC TTCTTTGACA ATTGCTTCGT ATTTATTAGC TTCATCCTCA TTTAAACCTG 1380 

CAGCTATAAA GTCTTGTTTA GACATGTTAT AGATAAATGT TGTATCTGTA TCAGGTTCTT 14 4 0 

T G AC AAT AT C ATCATGAAGT TGTTTCTCTA AGTTTTCAGC GAATTGAGCA TTGTTCATTT 1500 

TAATGCTATT TAGCGCATCT TGTMGTCTT TGTTATTTGC AAGCTCATCT TGCAGTGATT 15 60 

CTGTTAATTG C T T AC GAT An TCTTCAATCA TACCTTTTGA AAATGGTGAC TCTTGTGATT 1620 

GAATGATTTC TCTTAATTTA. TCTAAGTTTT CTTTAACAGT TTGTTTATAT TCTTCTTTAC 1680 

CTGTATCTTG CATACTTGAT TGTTGATCAA TTTGGCTGTC CATCTGTTTTAATGCATTGA 1740 

TATAGTTATC AAGTTCCACG CTATCTTTTT GCGATTTATA ATCTTGTAAC ATTTTATCCA 1800 

TCGCTGTATT GTGCTCGTCA AATAATGAAT TTTGTTTTTC AATTAAAGTC G AAAC AT TAT 18 60 

AATCTGTGTT CACTCTGAAC GTATCTGAAT TCGCACTCAA T AAT GAT T T A TTGTATGTTT 1920 

GGAACCATTT TGTAATGTCT TTGTTTGCAG AAATTGAATT TACAAGCGTA TCTGTAAATA 1980 

ATTCCGGGAA GTCGTTAATT GGATTTAATA AGTAATTCGA GAATTTACTA TTCACACCAT 204 0 

GTTCACGCGT CATAATAGCG CCAACATTTT TTTGTGCATT ATGTAAATTA TCAATGATGC 2100 
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TTGTTAAATA AATTTCGACT AAGTTTTTGT TLAAGTCGTT AAGTACATTA CTTACAACTT 2160 

TTTCTGTGTT TTTAGCTACT TCTTCTTTTT GTCCTACAGC TGTTTTATAC TGTAGCGATA 2220 

TTTTCGATGG TGTTTTAGCG TCTAATTGCA TTGCCAATTT TGAAAAGTTT TCTGGGATAA 22 8 0 

CAATCATGAC TTGGTATCCA CCATTTTTCA AACCAGACTC AGCAACGTTT CTTGTKATG 234 0 

TTTCAAATTT ATAGTTTTTC TCATTTGCTA ACCTTTTAAT AAATGCTTGA CCCAGCTCAA 2400 

CTTTTTTACC GTTATATGTC GTTGGTTGAT CCTCGTTAAC AATTGCGATA TGTATTTTAT 24 60 

TATTTTTATT ACTTACACTT TGGGATCCTT TTTCTGATTG ATCTCCATAT TTTGTTTGAA 2520 

CAAAAAATAT CATACTAACT ATGGCAATTA TAATAATTAA AGTGACAATT AATGCATAAA 2 58 0 

TCCAATTTTT CTTTTTCATG CTTATTTCCT TTCAGTTGTT TTCTTAAAAA AATGATAAGC 2 64 0 

AAAGCCACAT . TAGAAAATGT GACTTTGCCA ATTTCAGAAT GCTTATTGCA AACCGAAATT 2700 

AT T AG AAAG T TGTTGGTCTT GTTCTTGAAC GGCATCAGa GTGCtATTcA ATTGTTGTTT 27 60 

AATTTCTTCT AATAATTGTG CAAATTTTTC TACTTTAGGA CTAAGTTGTT GGAATTGCTC 2820 

TTCGAAACGG CTGAAAGCTT GACCTTCCCA GTTCGCTGCA ATTTCACCTt GTGCACGkGT 28 8 0 

TAAATCAGAT AAAATTTGAC GGATTTGGTC TGAACCTTGC CCGTAAGATT GCGATTTTGC 2 94 0 

TCTGATTTCC TCTGGACTCA TCTTAATCAT TGCCATAACT AGAAACCTCC TGAATATTTT 3000 

AAGTTTATCa AAACTTTTTA GGGACACTAT TTTTTGAAAA AGTGCTCCTT ACTCAAATAA 3060 

TATATAAATT ATTAGTATAT GTATATAGTt TTTTAAGTAT TTTTAGCTTT TTTAAAATAA 312 0 

ATATATTGAA TATAACCATA TATTTTTAAT TAACCATTCA TTTTTGTAAT ATAAATGTGT 3180 

ATACTAAAAT TAAATTAAAT ACATAAAGGA TTAAATGGTT ATTATGAAGA AAACAATTTT 32 4 0 

ACTGACGATG ACAACTCTTA CTTTATTTAG TATGTCGCCT AACTCGGCTC AAGCATATAC 3300 

GAATGATAGC AAAACATTAG AAGAAGCAAA GAAAGCACAC CCAAAGCAC AGTTCAAAGT 33 60 

GAATAAAGAC ACCGGCGCGT ATACTTATAC AT AT G AC AAA AACAACACGC CAAACAACAA 34 2 0 

TCATCAAAAC CAGTCACGTA CAAACGACAA TCATCAACAC GCAAATCAAC GTGATCTTAA 34 8 0 

CAACAATCAG TACCATTCTT CATTAAGTGG TCAGTATACG CACATTAATG ACGCAATTGA 354 0 

TTCACACACA CCGCCTCAAA CGTCACCAAG CAATCCTTTG ACACCAGCAA TACCGAATGT 3 600 

CGAAGACAAT GACGATGAAT TAAATAACGC TTTTTCAAAA GATAACAAAG GGCTTATTAC 3660 

AGGCATCGAT TTAGACGAAT TGTATGACGA ATTACAAATC GCCGAATTTA ATGACAAAGC 37 2 0 

AAAGACCGCT GACGGTAAAC CTTTAGCiTT AGGTAACGGT AAAAT CAT T G ATCAGCCTCT 37 8 0 

TATCACAAGT AAGAACAACT TATATACTGC TGGACAATGT ACATGGTATG TCTTTGATAA 384 0 

ACGTGCCAAA GATGGACACA C GAT TAG T AC ATTTTGGGGA GATGCTAAAA ACTGGGCAGG 3 90 0 
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CCAAGCTTCA AGCAATGGCT TCAAAGTAGA TAGACACCCA ACACGAGGAT CAATTTACA 3960 

AACAGTAAAT GGTCCATTTG GTCATGTAGC CTACGTTGrA AAAGTTAATA TTGATGGAAG 4 020 

TATTCTAATT TCAGAAATGA ACTGGATTGG TGAATATATC GTTTCATCAA GAACCATCTC 4 080 

TGCTTCAGAA GTTTCATCAT ATAATTACAT CCATTAAATT AATCATGACA TCAATAAAAA 414 0 

GCGACCAGTT CGCAGTTTAC AATTCGTAAC ACTGCAAAAT TGGTCGCTTT ATTTTGTATG 4 2 00 

TTATTCGATT ATAAAATTAC AAAGAAATGT TCTCTACATT CCCCATTAAT CAAAATCGTT 4 2 60 

TACGAAAGTA TAATTGTAGC TATAATAATC CAAGTCGTAA CAACTAGTGG CACTATCGTC 4 32 0 

TTGAATAAGA ATATACCGTA TTTTTTCTTG CGATAATAT CCAGTACTAG CCAAAT T AAA 4380 

ATGATTATAA C AC C AAC AAA AATAAATACA GGATTCATCG ATATAGCATC TGCCTGTAAC 4 4 40 

TCAGGTTGCA TTCTTAATTT AGTGATAATT AACATCACTA CTGAAATAAT GAAAAAG TAG 4 500 

ATACCTCTTA TCTTTGATGT CTGTAAATCT AATTCCTGCT CTTCAATGAC CTCTTTAGAT 4 5 60 

TCACCCAATT CTTTTGCAAT CAAATAATTT ACTACCTTAG GTTTCACCCA TAAACACTTA 4 620 

ATTGCAAAGT ACATAAAAAT ATATGATCCA GTATCCATAA ACATTAAAAA GTTGCTTAAA 4 680 

TCATGATCAA CAAAGCGAGC GC 4 7 02 
(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

ACAAAAAATT CACCCTCATT AAT AT T G AAA CTAATGTTAT CGACAGCAAC ATGTTTGCCA 60 

TAACGCTTAG TTACATTTTT AAACTTAATC ACTTTGCCAC CTCTTTTTTT CTCATAGCAT 120 

AAAAC C G AG A TTATATGTAT GTATTCCCTA TTTAACCACG TTTATTACAA TTTTCAAATT 180 

TAAATGATTT ATCCTTGAAC TTTTTTAATA AAATAATfflA TAAwAGGwAA TCwCCAGTTA 2 40 

AG AAA T AG T G TTATTTTACC TTGAATTCAA AAAAAC AC C C AG T AAAAC AA GGAATGCTTA 300 

CTAGGTGTCT T C AC TAT AC T TTGGCTTTAT AATTTTGAAT CGTTtCTAAA AATGCTGGAC 360 

AATAATGTTT TAATTTGTAA CTACCTACGC CATCmATATT AATCATATCT TGTTTCGAAG 420 

CAGGCTTACG TTTAGCAAAT TCCTCCAACG TGTAAT CAGA AAA TAT AC TT ACAGGTGCTA 4 80 

TCGTTAATTT GTCACTTAAC TTTTTACGAA CTTCTACCAA CTGACTGAAT AATACTCGGT 54 0 

CAACCCCTTC AACCGTATTT ATAAATACTT TTTCAGTCGC TTTTTGCTTA AATGGTGTTG 600 

T G AAT AC T T C TACTTCATTA CTGAGTAATT TTTTAATTGA AGTATCACAC ATTAATATTT 660 
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CGTCATTTTC ATTTAAGAAC CCTTTGAATC TTAATTCATC TATTAAGTGA CTTAATTCTG 720 

ATGTTGTGTA ACCTTTCATT AAACCATGGG TTGAAATTTG GTCATAACCT TTATACTTAA 780 

TATAATCTGk TGACTCTCCT CTTAACACTT GAATGATAAC ACTAXACTC TCTTGTTGTT 84 0 

T CAT AC G AGC GATGCAACTA AT AAT CAT C T TAGCTTCTTG TGTCATATTA TATGATTTAT 900 

CTTGTTGAAC AC AAT T AC T A CATTGTTCAC ATTCTTCTAA TTTTTCATTC GGTTCAAAAT 960 

AATGGACAAT TGTTGCTTCT AGACATTTTT TTGTTTTTGT ATATTGAATC ATTTTAGTTA 1020 

ACTTTTCGCC CATTTTATCT TTATAGTCAT CATCAGCTTG AGAGACTGTT ATAAAATACT 1080 

CGTGTAAATT GATAtCGCGT TCGCTAAATA ACAAAATACA TTCACTTTTC AACCCGTCAC 114 0 

GACCTGCACG ACCCGCTTCT TGATAATAAG ATTCTAAATC TCCAGGCATA T TAT AAT G AA 1200 

TAACAAAGCG TACATTGGAT TTATCASTAC CCATACCAAA AGCATTTGTA GCAACGACTA 12 60 

CTTTAACACG ATCAAATAAG AAATCATTCT GCGCTTCTTC TCTTTCTTTA TTGCTCAAAC 1320 

CTGCATGATA TATAACACTT TCAATTTTCT GACTTTCTAA GGCTTCTTGA AGCTCTTCAA 1380 

CTTGCTTACG T G T AG AAC AA TAAATAATAC CTGCATCTTC ATCGTGTGTT TSATATAAT 14 4 0 

C C AAT AT AAA TTTTTGACGT T G AT AAG TAG GATTTACTTT AAAAATTAAG TTTCTACGCT 1500 

TAGTACTCGT TTTAATTTGA TCAGTTTGAG CGATATTTAA CTTTTCTCTA ATATCTTGCT 1560 

GTACTTCAAC CGTGGCAGTT GCTGTCAACG CTATTATTGT AAAATCTTGA GGTAACGTAA 162 0 

ATACTTTTGA AAT AAC AT T T TGGTAACTCG GCCTGAAATC ATGACCCCAT TTAGAAATAC 168 0 

AATGCGCTTC ATCAAACGCG ACTAAGTGAA TCTTTATACG CTGAAGCATA TTTAAAAAAT 1740 

ATCGGTTTTC AAATCGTTCT GGTGCAACAT ACAAAAATTG AATTTCTCCA TTTGATAATG 1800 

CTTTTTCAAT ACGTTGTTGC TCTTTTTGAG TCA&CTACT ATTTAAAAAA GCAGCTTGAA 18 60 

TTCCCATCGC TTTTAATTGA TCCACTTGAT CTTTCATTAA TGATATTAGT GGACTTATTA 1920 

CAATTGTTGT ACCACCTAAC mATAAACCTG GTACTTGrTA GCmTATAGAC yTACCTCCAC 1980 

CAGTtGGkAA GrCACCAAGC ACAT 2004 



(2) INFORMATION FOR SEQ ID NO: 27 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: S.Q ID NO: 270: 
AAAGATTGCT TGCCTTGAGG GTTTATATAT CTGACTCAAT TGCCACATTT T TAT C AAG AG 60 
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TAGTTGATAA TACTCATCAT AATTATAGCT AATATTATAT TTTTTTAAAA GATAGTGTAT 12 0 

GATTTTCTGG TGTTTGTTGT ATACGTCATT AAATTTCAAG TAGTCATTCT CCAAGTTATA 180 

CGTATAACAA ATATTTCCGG ATAAAGTTAG AATAAAATAT TTAGAAAAAT CATTCATTTG 24 0 

CGTAATCGCT AAATTAAGTG TTAAATATAA GACATAAGTA ATTAATTTAA TGTGATATGA 300 

TGTATTATTA CTTTGCTAAA TAGTAGATAG AACAAAATTT GTAATCGGGA GGTAACAATG 3 60 

GAT TACGC AC ATTTAAATTT AG AAC AT T T T TTTGCAGAA ACGACGATTT AGATGTTATA 420 

AGAGATCGCG CTGATTTCGT GATGATAAAT AACTTCACTA ATGAAATGAT GTATCGTGAT 4 80 

GGTCAAATTG AAGGCACGAT TGATTTAAAT CAGTACTATT ATAAAAATAG ATCAAATGCA 54 0 

GCAAGTTTTA TTATGATGGA TTATAAAAAA GAAACTAAGT AAACGAACAA AAGAATTTTT 600 

TGTTTTTTAA TACGTGAATA ATAAGATTAT T G AT AT AAAG GTTTTCAAAG GTTATACAAA 6 60 

AAGATAAAAC ATTTATGATT CGTAGATCAA CGTAAAGTAA TGTTGATAAA TGGTTTAAAA 7 20 

CGTTTCATTT ACATTACTGT TTATTTATGA ATATGTAACA ATGCATAGAT AAAATTGTTA 7 80 

AACGGTTTTA ATAATGTTAA ACTTTATTAT CGTAGTCAAA CTsAATGTAT AACAACAATG 8 40 

ACCTAAGAGG TGTGGATATG AATAAACACA AGAAAGGTTC TATTTTTGGA ATAATAGGAC 900 

TTGTTGTCAT ATTTGCTGTT GTCyCaTTTT TATTTTTCTC AATGATATCC GATCAGATAT 9 60 

TTTTCAAACA TGTTAAATCC G AC AT T AAG A TTGAAAAGTT AAAETTACA TTAAACGATG 102 0 

CAGCAAAGAA ACAAATAAAT AATTATACGA GTCAACAGGT AT C AAAT AAA AAGAATGATG 1080 

CATGGAGAGA TGCATCTGCA ACTGAAATTA AAAGTGCAAT GGATAGCGGT ACTTTTATCG 114 0 

ATAATGAAAA GCAAAAATAT CAATTTTTAG ATT TAT C AAA GTATCAAGGG ATTGATAAAA 12 00 

ATAGAATTAA ACGTATGTTA G TAG AT AG AC CAACGTTATT GAAACATACG GATGATTTCT 12 60 

TAAAAGCTGC TAAAGATAAG CACGTTAACG AAGTTTATTT AATTTCACAT GCATTATTAG 1320 

AAACTGGCGC AGTTAAAAGT GAATTAGCTA ATGGAGTCGA AATTGATGGC AAAAAG TACT 138 0 

ACAATTTCTA TGGAGTAGGA GCCCT'EATA AAGACCCAAT TAAAACAGGT GCAGAATATG 14 4 0 

C T AAAAAGC A TGGTTGGGAT AC AC C T G AAA AAGCTATTTC AGGCGGTGCT GATTTCATTC 1500 

ATAAGCACTT CTTATCAAGC AC AGAT C AAA ATACATTGTA TAGTATGAGA TGGAATCCAA 15 60 

AAAATCCAGG AG AAC AT C AA TATGCTACAG ATATTAAGTG GGCAGAAAGT ATGCAACAA 162 0 

TTATCGCTGA C T T T TAT AAG AACATGAAGA C T G AAG G AAA AT AC T T C AAA TACTTTGTGT 168 0 

ATAAAGATGA CAGTAAACAT TTGAATAAGT AATTTGATAA GCTACGAGTT GTTTTTATGA 17 4 0 

CTCGGACATA C T AAAAAG AC GCTTTCTATC TTGTTTTGAT AGAAAGCGTC TTTTTGCATT 1800 

AGAGAAAACA CATTGATkGA TAAtCCCaCC aATGCAAgTG GGGcAGGACa T C GAT AAAG A 18 60 

ATTACTTTTT C T T TAG AAAT TAGTATTTCT TATGCATGAG TTTTACTCAT GTATTCCTAT 1920 
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TTTTAAGTAC ACATTAGTTA TAGCTAATGA TAAAGAACCA CTACATAATA AAT CAT TAG T 198 0 

GTTTTTTTAT CATTTCTGTC CCaCTCTCAT CCEATTTGA AATTTTCAAT TGCGATTTTA 2 04 0 

ATTTCATCTC TTACACGTTG GAACTCTGAC CAAGGCTTGC CTGCAGGATC ATCAAATCCC 2100 

CAATGTTCTT TCTTAACATT TGTTGGTAAA GAAGGGCAAT TTACGTCTGC ATCACTACAT 2160 

AATGTAACAA CTAAATTTGA ATTTnTAATA ATATTATTAT CGGATTAAAA TCTGATGGA 222 0 

GATTTGATAT ATCAATGCCT ACTT 22 4 4 
(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES3 : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

ATAAGCAATT TAATTTTGAG TCTACAATGG AGGAATTATC ATCTTTATCA GAGACTTGCC 60 

AACTTGAAGT GTTGGGTCAA ATTACTCAAA ACAGAGATCG TGTAGATCGC AAAT AT TAT G 120 

TTGGTAAAGG TAAAATTGAA GAAATTCAAG CATTTATTGA GTTCAAAGAT ATTGATGTAG 18 0 

T CAT C AC AAA TGATGAATTA ACGACTGCAC AATCCAAATC AC T AAAT G AA GCTTTAGGTG 24 0 

T AAAAAT TAT TGATAGAACT CAGTTGATTC TTGAAATATT TGCATTAAGA GCAAGAAGTA 300 

AAGAAGGTAA ATTGCAAGTA GAGCTAGCAC AACTT&TTA TTTATTACCT AGATTGCAAG 360 

GCCATGGTAA AAGCCTTTCT CGTTTAGGTG GCGGTATTGG AACTAGAGGC CCTGGTGAAA 4 20 
CGAAGTTAGA GATGGATCGC AG AC AT AT T C GAACTCGTAT G AAT G AAAT T AAACATCAAT , 4 80 

TGCGGACGGT AGAAGAACAT CGCGAAAGAT ATCGAAATAA AAGAAATCAA AATCAGGTGT 54 0 

TTCAAGTAGC TTTAGTTGGT TAT AC AAAT G CTGGTAAATC ATCATGGTTT AATGTTTTAG 600 

CAAATGAAGA GACGTATGAA AAAG AT C AAT TATTTGCAAC GTTAGATCCT AAAAC AC G AC 660 

AAAT TCAAAT AAATGATGGA TTTAATTTAA TTATTTCAGA TACTGTTGGT T T T AT AC AGA 720 

AACTACCTAC GACGTTAATT GCAGCTTTTA AATCAACTTT AGAAGAGGCT AAAGGTGCAG 780 

ATTTATTAGT ACATGTCGTA GATAGTAGCC ATCCTGAATA CCGTACGCAG TAT G AC AC AG 84 0 

TTAATGATTT AATCAAACAA T TAG AT AT G A GTCATATTTC TCAAAT AGTT ATTTTTAATA 900 

AAAAGG AC T T ATGTGATCAT G C AT C AAAT C GTCCAGCAAG TGfflTTGCCT AATGTTTTTG 960 

TTTCTTCTAA AAATGATGGT GAT AAAT T AC TTGTTAAGAC GTTATTTATT GAT G AAAT C A 1020 

AAAGGCAATT AACTTATTAT GATGAGACAA TTGCGACGAA TAATGCAGAT CGATTATATT 108 0 
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TTCTAAAACA ACATACATTA GTGACTGAAC TTAAATATGA TGAAATTGAA AATGTTTATC 114 0 

GTATAAAAGG ATTTAAAAAA TAATAAAAGG ACGAAATTCA AATGAAAGAT ATAAGTAAGA 12 00 

TAGTAGCTGA CGTCGAATCA ACGTTAGCAC CATATTTTAA AGAAAT TGAA GAAACAGCAT 12 60 

ATATTAATCA AG AAAAAG T A TTAAATGCAT TTCATCATGT CAAAGCAACC GAAAGTGATC 1320 

TACAAGGATC AAC AG GAT AC GGGTATGATG ACTTTGGACG TGATCATTTA G 1371 
(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

CATCATATGG TAaCCGAAtT GAAAtAtTTT TAATAATAAT CCCTTAAGCA TATCTGTATG 60 

CGTAAAcCTA TGCGTTTTAA TATTCTGAAG TTACTTAGTT CATCCTCAGT TTCATCCATT 120 

TGTTTAATAT AAATAATACA TCCAGCTGCT ACTAAAAATG CTAATCTAA AAATGATGTA 18 0 

ACAAATATTA GAATACCGTT AGTAGCATcG ACCTCTTTTT TCATTTCATC ATACGTGATG 24 0 

ACTTTGTCTC CAAACTGTTT TGCAATTGCT TGAGCTTTTT CCTTTTGTGA TGTTTGTTTA 300 

ATATCATATC CAT AAAAAG T ATGAACGTTA TTTTGTGTTT TCAACTGCTG ATACTTTTCA 360 

GGACTTACTT CGATGACAGG TGAGTTGAAG CTTAGATTTA AAGGATAAAC CTTACCTTTG 4 20 

TCTTCTTGTG TAcACGGAAA GTTTCATTCT TAGTACCTTT TACTACTAAA TCTTTGTTTA 4 80 

AATGGATATT AATAATGTTA GGCAGCGATT TTGTATTTGT AATGATGGCA TTGTTGCCTG 54 0 

TtAACTTGTT ATTTGCACTT AAAATAGZAT TCGTGCGACC TGAATCACTA CCATTTTCCA 600 

AAGTAATAAC CTGATCATTA ACATTATCTA CAGTAATAGT TTCGTAAGCA TTTTTAGAAA 6 60 

ATGTAATTTG TTGTTGGCTT AGTTTAGTTT CAAATTGTTT AGCATCTTGA GTAGCGACCA 720 

CGTTAAATTC ATTTGGTGCC ATAGATGTAA GGGTTTGATC TGTATTTGAT TTfiATAATG 780 

CCGCAAAACA CAATACAGTT ACTGTAACTG CAGAAATAAT TGCAATGATA G T T AAAGAC A 84 0 

TGGCATTTTT CTTCATTCTG TACATAATAG ACGATGTGAA TACAACATCG GTAATAGATA 900 

CGCGTCCATT TTTTGATTTT TTCAATGTTT TAAAAATAAG TGACACGGAA CTTCTGAAGA 9 60 

ATAAATAGGC GCCTACAACC GTTAAAAATA AAAT GAT AAA CGGTGATGTC ATAGCCATAG 102 0 

TTAGTGCTTT GAACGTACCA AACATTTCTG TCGCCATATA ATAGCCTAGT GCAATCATAG 10 8 0 

CAATACCTAA TACGCCTGAA ATAACCTCTG CAGTCGTTAC TTTAGCAGTT GTGGCATCAG 114 0 

TTTTAATTGA ATCTTTCATC ATTGATAAGA TACTiCGTCT TTTT AGAAAT AAAGCACTTT 1200 



855 

Copied from 10806206 on 08/12/2004 



GAAATAAAAT CAGTACATAC GCAATAATTA GC AT GAAAAT AG T T AAAAC A AGGGCCATAG 12 60 

GTTCGAAATG TATCGATAAG TTAATCGATA AC G AC AT C AA TTTAGATACT ATGGAAAGCA 1320 

ATAaTTGTGC ACCTGCAATG CCACATAATA CACCGACAAC ACCTGTGATT AAAAAT AC G A 138 0 

TCATTtGTTC AAGTGCTAAC ATTTnCAAAA TGTTTTGTCG TGTTAAACCA ATCaACTGAA 14 4 0 

ATAGCGCAAA TTCACGTGTA CGGCGTTTTA CGrmTAAATG ATTGGCATAC ATTAAAAAGA 1500 

TGACAATAAT GATAAATAAA AATATTGATC CGACTAAAGC ACCTTTCTTA ATGATGGCCA 1560 

TCGAGTCGTC ATTATTTACA CCTTTAGTAA ACTGTAAGGT TGTAAAACTG AAATATAAGA 1620 

CGATGCTAAA AAATAATGAA AATAAATACA TTGCATAATG TTTTAAGTTT TGTCGTAAGT 168 0 

TTTTG AAAAC GATATGGTTA AATGTCATTT GAGACACCAC CTAATACTGA TTGAAGATGT 17 4 0 

ACAATGTCTT CATAAAAGGC CTGTTTAGAA CGTCCTTCCT GSAAAGTTG TGTATGAATT 1800 

TGGCCATCTT TTAGCATGAT GACACGTTCT GCAAAACTTG CTGCAACCGG ATCATGTGTA 18 60 

AC CAT G AC AA TAGTTGTATC AAACGAT T T A TTCATTTCTT CCAAACGTTG TAATAGGTCA 192 0 

TTTGCACTTT TCGAGTCGAG TGCGCCTGTT GGCTCATCTG CAAATATGAT TTGTGGTTTG 198 0 

TGAACAAATG CTCTCGCTGC TGCAGTTCTT TGTTGTTGAC CACCAGATAA TTCGCTAGGG 2040 

TATTTATTTC CTAGGTCATA AATACCTAAT GCTGTCGTGA TCGCTTTATA ATTTTCTTCC 2100 

ATTGTTGCCT TCGACATTTT TTGAACAGAT AAAGGTAACA TAATGTTTTC TTTAACGGTT 2160 

AATGTCGGCA GAATACTGTA ATCTTGGAAG ATGAAACCTA ATGATTCTTT GCGGAATTTG 222 0 

GCAAGTGCTT TTTGATTAAG TTTATTAAGC TCTTGTCCGT TAGCAATCcg cTACCGCTAG 22 80 

AAATTTGGTC AATTGAACTT AGTACATTTA ATAAGGTTGT CTTACCTGAT CCAGAAGGCC 234 0 

CCATAATCGC AACGAATTCG CCTTTTTGTA TGTCAAAGTT AATATCTTTAAGTGCTTGAA 2400 

ATGTGTGCTT TTTACCGTAT GTTTTTGAAA CATGTgCACT GATAATATCG TCATAGTCTC 24 60 

ACTCCTTtTG TATTTAATTT CATTTTAAAT AATGTTTGGA GTAGTAGCCT TTATCTAAAC 2520 

TTACAATTCA ATGAATGAAC CTTACAGAGT TGAAArcTAT CGCTACTTAG TAGATTTTTG 2580 

AGTGAGGATA C AG AT T CAT C G T AC AT AT T A GACAAAAGCA ATGGTGCTTT CTAAGTGATG 2 64 0 

ATGTTTGTGT AAATTGAGAA AAGGGAATTT AATTATTGTA TAATAAATTT TTTGTAAAAA 2 7 00 

TTAAAAGAGG GTTTTATTTG AAAGGAATTG ATTGTTATGG AAAAAGGAAA TCAAGGTATT 27 60 

AAATGGTCTA GTTTAATAAT GGGTGTATTA TATTAATGT TGGCAGTCGT TATTTTTACA 2820 

TTTCCAATTG AAAATTTTTA TGCTATTACC TGGTTGATTG GACTGTTTGT ATTAATTAAC 2880 

GGTGTGATTC AAATCGTTTA CCGTAGAAAA GCAAAAGCTT TAGTAGGTGG TAACCAAAAT 2 94 0 

TGGATTCTGT TTATGGGGAT TGTAGATATT CTATTTGGTC TATTAGTTAT TTTTAAOTT 3 000 
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GGCGCAAGTT CAGCATTCTT TATTTATATG TTTGCTTTTT GGTTTATTTT TAGTTCTATC 3060 

TCTGGATTAT TTACGTTTTC GGGTAGTGGT AGCTTAAAAC TAATTTCAGT GATTTTTAAT 3120 

TTATTAGGTA TTGTTTTCGG TGTCATTTTA TTATTTAATC CATTAATGGG TATCGTCTTT 3180 

ATTTCGACGA TGATTGCTAT TGCATTTGTA TTCGTAGGTG TCATTTATGT TGTAGATGCA 3240 

CTTGCTTAAG TAAAATGAAG CGGTTCAAAA GAAGGGTGTG ACATGAAGTT TGTGTCATAT 3 300 

CCTTTTTGTT GTGTTTATGA AGCATAAAAA AGGGGCGCTA CCTACAATAA GTAAGATACG 3360 

CCCATATTTT TATATTTTAC TATTATTGTT TTTCAATAG ATTAATAGTT ACATTTAGTC 3420 

CAAAATATTT TTCTAAAAAA TGTTTATAGT TATCTTTAGT GACATCAAAT TTTTCTGAGC 3480 

TACCATTCCT TGTTAAAGTT AAATGATTTT C AG AC AT T G T AGCACGGCCA AATGATTGTG 3 54 0 

GCATTGTAAT TAATAAATGC TGTACAAATA TTGAATCTGG ATGCGTTTGA TTATATTCGA 3600 

TGTTGCTATT AAAATCTTCT ATTTGTTTAG GTTTGAATTC aGCTTCaTAT TTTGTATGCC 3660 

AATGATCATT TTCGAATTTT TGAACATAGA AAATATCCTT GTCTTCGTTG T T AAAAAT AG 3720 

CGCGGAATGT ACCACTGATA TCAGTAATTG GTTGTGTATG CTCAGATGAA GTAATAGGAA 3780 

TGGCATGTAG AGGTAAGTCT CCAAAGCCAA CATCAGTTAC ATAGAATACA TCATTTATAG 3840 

AAACAACAAG TGAAGCATGT GAACCGTTCA GACTACGACC GCCACCGGGw GTGTGAATAG 3 900 

TAGCTGACAT TAATTCAGGA TTAAATCCTT TTTGTTGTAA ATAGGCTTTG AAAAAT GT AT 3 9 60 

TTAATTCATA AC AAAAAC C A CCACGTTTAT CAT GAAC AAT TTTATXAAA AGTGCATCGA 4 020 

T AT T TAT AG A TATCGGCTTA CTATTTTGAA CATCAATATT TTCAAAAGGT ACAGTTAACA 4 080 

TAAAACGTGT TGCATAATAA TTTAATGCTT CAATACTCGG TCGATTATAA C GAG AT G AAT 414 0 

CAATTTGTAA ATAATTCTCT AACTTCGCAA TATTCATAAG CATAGCGCCT CCTGTATTAA 4 200 

AGATTATAAT TAAATTTTAA ACAGAAATAC TGAAATTTTA AATTCGAAAG CATTGAATTT 4 2 60 

TGGATAAATA CATTTTAAAT AGAAAAATAC GCTCTCAAAA TGAAGTCATC TCTAAAAGAA 4 32 0 

AC GAT T T AAA GATGACTACT GAGAGCGTAg CATAATGGAA GAAGTGTGCA GGGTGTCTAA 4 380 

AAATGCAACA ATACAAAGGT AGTTGCAA5A CAAGTTGCCT TATCTAGACC ATTTGTGTTC 4440 

TATGCGACCA AACTTCCAAA TTAAACTTGA AATAAGCCAA GTAATTAAAA ATAATGCAAC 4 500 

TAAAATATAG CCTAAATAAT CAAATTCGAT CGAACCAATG AATGCCCAAA ACGCACCATG 4 560 

TAAATCTAAC TTATCAGCAA GAAT T T G TAG CAATTCAATC AT C C C AAT C A CTATGCTGC 4 620 

CATGACTGAT ATCGCAGTAA TCGTTATATT GTAATAGATT TTGCGAATAG GATTGAAGAA 4 680 

TGCCCAATTA TAGGCATACT TCATTACAAC ACCATCTAAT GTATCCAATA AAC T CAT AC C 4 74 0 

TGATGCGAAT AAAATTGGTA AAGATAAGAT TCCGATAAAT GAAATGGCTT GTTGTGATGC 4 800 

GCCTGAAGAA AGAGCGAGTA ACGCAATTTC ACTAGCTGTA TCAAAACCAA GTCCAAATAA 4 8 60 
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AAAGCCAAGT GGCAATACGT GCCAACTACG CGTGATTAAT TTGAAATAAG GTCCTACAAA 4 92 0 

TCGAGAAACC AATCCTCTAG ATTCAAGTAA TGCATCGACT TCAGCTTCTT CAATGTGTTC 4 98 0 

ACGACGTAAT TTAGCGAACA AGTTAATTAA AG AG AT AAA ATAATTAGAT TCAACACACC 504 0 

GATAAGCACT AAAAAGAAAC C T GAAAC TAG TGTACCAATC GTTCCACCAA TATCTTGGAA 5100 

ATGCGGTAAT TCATCTTTAG CCCATTTTAC AGATACCCCT AAAAAAACAG C CAT T AAAAA 5160 

TACGACAGAT GAATGTCCAA TTGAAAAATA GAAAC C C AC A CCAGATGGAT CTTTGCGTTG 5220 

CTGTAATAAT TTGCGaACCG TATTATCTAT TGCAGCAATG TGATCTGCAT CAAATGCATG 528 0 

ACGCAAACCT AATGTATATG CAAGAATCCC CaTACCAAAT AAGATATGAT GGTCTTTTCC 534 0 

AGCAATCCAT AAAAAACTAA ACCCAATAAC GTGTAACAAA AT G AC AAT AG CTATGTATGG 54 0 0 

TAACCAGCTT AAGCGCTCAT TTTTAAAAAC AG T C AAAAT A TATCCTTCTT TCATAACATG 54 60 

ATATTTAATC ATACTGTATG TTCAATGGGC ACTCTAGTAA TAAGTGTTCA TATAACAAAA 5 52 0 

ATGTTATGCC AAATTATTTG TTATATAAAA ATATACATGT AACCACAAAA GATTTTTTGC 5580 

GATATATATA ATTTGATAAA TTAACCAACA ACAATGTAAG AtETCACTTT GCTTAACTTG 5 64 0 

GCATCCTTTT TATGATTTTC AAATTCAAAA AAAT GAGCAA AATGAATCTC TTTAcCAGTT 5 7 00 

TTTAATATTT CaATACCATG CATGGAACCT AAGCACCCAT GTGTGATGCT GGAATGGATA 57 60 

TTGAGACTAG CAACCTGATT GTAATGATTA GATAGTTCTT GAATTAATAT TTGAGGTCCG 5 820 

TATATGTCAA AGCGGCCAGG G AC AG AC C AA ATAAATTCTG TTGTAACCAG TGAACGTAAT 58 80 

AATTCAATAT CTAATGCTGC TGTAACAACT AT AAAAT C T A TCATTTGTTG ACGTTTAGGC 5 94 0 

GCATGATTGC ATGACACATC TCCTGTTAAC TTAAAAGGTA AT GATGACTG AACTTCCGTT 6000 

TTAAAATGTA GTTGGTGCTG AAATAAAGCT TGTTC 6035 
(2) INFORMATION FOR SEQ ID NO: 27 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 

TTTTGAACAG CCATATTTAT TCACCCTCAA CATCATTATA ATGGTATTAG TCGCATTACC 60 

TTCACTTGTT TTAGCTATAT ATGATTATAT GAG T T T TAG A ATTTCTTCTG CTATTTTACA 120 

ATTTCTAGGG GCTATCTCTT GGTTCTTTTT ATCATTGATA TTATGCTCA CACAATTTAC 180 

ACCTTTTACA TTAGCGTCAT TTATAACTTC AATTATTTTG TTCACAAGCA CAATTATCAC 24 0 



858 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



ATTAGCCATT GGTGGTAAGT CTGTTGAAAA GAATGATTCC CC TTAAATTC CAAATGAAAA 300 

AAAGGTTCTG AAGGCCGCTA TAAAACACAG TTTTTCAGAA CCTCTATACT TCTATTCAAT 360 

GATATATGGT TTGCAATTTT CTACCTTTAA ATCCACAGCT TCTGCCCTTG AAACTTTGTT 4 20 

AAAATAAACC ATCAAACAAC GAATGACAAC TTGATGTGCA ACAATGACAA TATCATCTTT 4 80 

TTGTGTATCT TCATTGACAA CATGATTCAT AAAATGTTCT ACGCGTTGAT ATACATCTTC 54 0 

ATAACTTTCT CCTTCAGGCG CTTTTTGTGA AAAACTATGA CGAAAGTCTT TAAAGTTTGG 600 

AT CAT T G AAA TATTTTTCAT ATTTCGGATT CGCACTGATT TCATCTTTAT ATTCACCCTC 660 

AAATACGCCA AGTGAACGTT CTCTTAATAG AGGGGTAGTC GTTGATGCAA TGTCATATGG 7 20 

AAAAATATGT TCAAACGTTT GCTGTGTTCT TAATAAGTCT GAAACATATA C&GTTTAAT 780 

CTGTTTCTCT TTAAAATAAT CACATAAATC GTCTGCACTT TTTGTGCCAG TATCCGTTAA 840 

CGGCACATCT AATTGTCCAC AAAAATAAGA TCGAAAATGT TTATTATCAT AATTCGATTT 900 

TGATTCGCCA TGTCTAACTA AATAAATCGT CATAATATTA CTCCTTACCT TATGTATTTC 9 60 

ATATCTACCA TAACACTTTG ACTACTAATT CGATATCAAT CTTAATATTC TATTCTAAAA 102 0 

AAAGAATTAA TTCATATnT 1039 
(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

GAGAGAATnT GCAATTAGTT ATTCAATTAG TTGATTTAAG ACATGATCCA AC ACAAG AT G 60 

ATATCTTAAT GTACAATTAT TTGAAACATT TTGATATTCC TACTTTAGTT ATATGCACTA 120 

ArGAaGACAA AATTCCaAAA GGTAAGGTyC AAAAGCATAT TAAAAATATT AAGACACAAT 180 

TAGATATGGA CCCAGACGAT ACAATTGTAA GTTATTCATC AATTCAAAAT AATAmaCAAC 24 0 

AACAAATATG GAATTTAATT GAACCGTATA TTTCATAGTT TTTGTACGTC AAAAETATA 300 

CAAAAATTTT AAAAATAATG TAAGCACGAA ACTTTTAATT AGTACACAAT T G AT AAC AT T 360 

TTTCAACGTT CATCATTTTG TCAAAAACTC AAAAGTAAAT TAGAAAGATT ATAATTTATT 420 

TAAGCATCGT ACTTAATTGG ATTTTAAATT ATGTTATAAT ATTTGTATTG T TAG TAT AT A 4 80 

TGGGGGCTTT TCAAATGCAT TTTATTGCAA TTAGTATAAA TCATCGCACA GCTGATGTgC 540 

ACTAAGAGAG CAAGTTACTT T TAG AG AT G A TGCCTTACGA ATTGCCCATG AAGATTTATA 600 

TGAAACTAAA TCTAyTTTAG AAAATGgTCA TATTaTCAAC ATGTAATCGA ACTGAAGTAT 660 
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ATGCTGTTGT TGATCAAATT CACACAGGTC GTTACTAAT TCAACGATTT CTAGCTCGTG 7 20 

CATTTGGATT TGAAGTAGAT GAT AT T AAAG CAATGTCAGA AGTAAAAGTG GGGGACGAAG 7 80 

CaGTAGAACA TTTATTGCGT GTCACTTCTG GTTTAGATTC AATCGTACTT GGAGAAACTC 840 

AAATTTTAGG TCAAATAAGA GATGCATTTT TCTTAGCGCA AAGCACAGGT ACGACAGGrA 900 

CAATTTTTAA TCATCTATTT AAACAGGCAA TTACTTTTGC AAAAAG AG C A CATAATGAAA 960 

CAGATATAGC TGATAATGCT GTAAGTGTGT CTTATGCTGC GGTCGAGTTG GCGAAAAAAG 1020 

TATTTGGCAA ATTGAAAAGT AAGCAAGCTA TCATTATTGG TGCAGGGGAA ATGAGTGAAT 1080 

TATCACTATT AAATCTTCTT GGTTCTGGAA TTACTGaTAT TACAGTAGTA AATAGAACAA 114 0 

TTGaAAATGC TATGAAATTA GCAGCAAAGC ATCArGTGAa AT AT GAT GAA CTATCAT CAT 12 00 

TACCAAATTT ACTTGAAAGT GCAGATATTG TGATTAGTTC AACGAGTGCA CAATCTTATA 1260 

TCATTACAAA TGAAATGATA GAAAGAATTG CAGAAAATAG AAAGAAGAT TCACTAGTAT 1320 

TGATTGATAT TGCAGTTCCT CGAGATATTG AACCTGGTAT TAGTGCCATC ACAAACATCT 1380 

TTAATTATGA TGTTGATGAC TTAAAAGGTT TAGTTGATGC AAACTTACGT GAGCGACAAT 1440 

TAGCGGCTGC AACAATTTCG GAACAAATTC CTACAGAAAT ACATGCACAC AATGAG 14 96 
(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4826 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CTTGATTTTT TCCCtTTAGT ATTTTCCaTt TGanTGTCGC AGCTTCTAAA TCCTGCTTTG 60 

GTTCTCTAGT GAACTTCATA ATTAAAGCAG CTACAACGAA TGATACAAGT GCAGCAAGGA 120 

AGACACCGAG TAACATGTGC AAGAATTCAC CTCTAGGTGC ATTTAAACAG TAAACTATAA 180 

ATGAACCTGG TGACGCGGGA CTTTTAAATC CAAATCCTGT TGCTTGATAA GTTGCAACAC 24 0 

CAGTCATTCC ACCTAAAATA ACAGCGATAA ATAATAAAGG ACGCATTAAT ACATATGGGA 300 

AATAAATTTC ATGAATACCA CCTAAGAAGT GGATAATTCC AGCACCATAT GACGTTGCTT 360 

TTGCAGTGCC TTTTCCAAAA ATCATATAAG CAAGTAAGAT ACCTAAKCT GGTCCAGGGT 4 20 

TAG AT T C AAT TGTGTATAAA ATTGATTGAC CAGCTTTTGC AGCTTGATCT GCACCAAGCG 4 80 

GTGTGAATAC ACCATGGTTA ATCGCATTGT TTAAAAATAC AATTTTTGCA GGCTCTACTA 54 0 

AAATACTTAC AAGTGGAAGT AGGTGTGCAT GTACTAATGC TTCAACTGCC ACTGATAAAA 600 
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TATGCATAAT AAATTTCATA AGTGGTGCTA AAATTTTAAA TCCTGCAATC GTCATGATAA 6 60 

ATCCTAAAAT ACCAGCAGAA AAGTTATTAA ATAACATTTC AAAACCTTGC GGCGTTCTAG 7 20 

GTTGAATCAA TTGGTCGGTC TTCTTCATTA ACCAACCAAC AAGTGGACCC ATAATCATTG 7 80 

CACCAAGTAA CATTGGTGTA TCAGGTAATS C AAC GAT G AC CCCCATAGTT GCTGTTGCTG 84 0 

CGATGATACC AC C AC G T AAA TCATAAATTA AACGACCACC ACTAAATGCG ATCAATAATG 900 

GGATTAAATA AGTAATCATT GGTCCTGCTA AAGTAGCTAA ATCTTTGTTA GGTAACCATC 9 60 

CATTATCTAT AAAAATGGCC GCGATAAAAC CCCAAGCGAT GAAAGCGCCA ATGTTGGCA 102 0 

T GAT CAT AC T ACT T AAGAAT GATCCAAATG CTTGAACACG ACGACCAATT CCTTTTTTCT 1080 

CTTCAGTTTG TGACATGTTT ACACCTCTTT TTTAATATAT GTGACTATAT ATAGTGTATA 114 0 

GAGAGGTTAC TTGTTACTCA ATATAAACAA AAATCAACTT TGTCAAAATA AATGTGACAA 12 00 

AATTAAATAA AGTGTCATCA AT G T G AC AG T AT AGAT AT T T TGAAAAAGTA AAACAAAAAA 12 60 

ATTGTTTTAG GATTTTTAAA ATTTTATTGT GAAAATATTT GCAAAACAAA ACAACACCGT 1320 

gTACAATAAT GATTAATGGA AAGGGGGAAA GTTCGGCAGT ACAGTTAAAG CGCCTGTGCA 13 80 

AATAAATATT TGTATTTGAA GATTAAAGGT TAATA'ATGA GTGGCCTTTA TAGAGTGCAA 14 4 0 

TATATGTATT TGTAGACGAG GAGGATAGTG AT C G AAT AG A TCGGCGGATG CTATCCCGGA 15 00 

TGTGGCTCAT TCGTTAGCTT ATTAAGTAAA ACATTAGGGT GACTTAATGG ACAAAGTTAA 15 60 

TAAGATCGCC AGAAATTGAA TATAAAAAAT ATTAATATGG AAAGTACAGT GTGAGCAATT 162 0 

TGTATAGTTG TAAAAATAAC TATGCTTAAT TTGTTATGGA TGAATGCGAT GATAGCATGT 1680 

TCCTATTTAT ATTATGAAAG CAGATTGTCA ATCTAAATTA TCGGCAATAA ATCATAATTT 17 4 0 

ACGCGTACTA TTCCAATATG GAGGAAAATG TCGTTATGTG TGGAATTGTT GGTTATATTG 1800 

GCTATGaTAA TGCCAAAGAA TTATTATTAA AAGGTTTAGA AAAATTAGAA TACAGAGGTT 18 60 

ATGACTCTGC AGGTATCGCA GTAGTAAATG ATGATAATAC AACTGTATTT AAAGAAAAAG 1920 

GTCGTATTGC AGAATTACGT AAAGTTGCTG ATAGTAGCGA TTTTGATGGA CCTGTTGGAA 198 0 

TCGGTCACAC ACGTTGGGCA ACACACGGTG TACCGAATCA TGAAACTCT CATCCACATC 2 04 0 

AATCATCAAA TGGCCGTTTT ACTCTAGTTC ATAACGGTGT TATTGAAAAC TATGAAGAGT 2100 

TAAAAGGTGA ATACTTACAA GGTGTATCAT TCATTTCAGA AACAGATACA GAAGTTATCG 2160 

T T C AAT TAG T TGAATACTTT TCAAATCAAG GACTTTCAAC TGAAGAAGCA TTTACAAAAG 2220 

TTGTGTCATT ATTACATGGT TCATATGCAT TAGGTTTATT AGATGCTGAA GACAAAGACA 22 8 0 

CAATCTATGT TGCTAAAAAT AAATCACCAT TATTATTAGG TGTTGGTGAA GGTTTCAATG 2 34 0 

TTATCGCATC AGACGCACTT GCAATGTTAC AAGTGACAAG CGAATATAAA G AAAT C CAT G 24 00 

ACCATGAAAT CGTTATTGTT AAAASVGATG AAGTTATTAT TAAAGATGCA GATGGAAACG 2 4 60 
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TTGTAGAACG TGATTCATAT ATTGCTGAAA TTGATGCATC AGATGCTGAA AAAGGTGTTT 2520 

ATGCACACTA CATGTTAAAA G AAAT T CAT G AACAACCAGC AGTAATGCGT CGTATTATTC 2580 

AAGAATATCA AGATGCAGAA GGTAACTTGA AAAT T GAT C A AGACATCATCAATGATGTTA 2 64 0 

AAGAAGCAGA CCGCATTTAC GTTATTGCAG CAGGTACAAG CTACCATGCA GGTTTAGTAG 27 00 

GTAAAGAATT TTTAGAAAAA TGGGCTGGCG TACCAACTGA AGTACACGTT G CAT C AG AG T 27 60 

TTGTCTACAA CATGCCATTA TTATCTGAAA AACCATTGTT CGTTTATATT TCTCAATCAG 2820 

GTGAAACTGC AGATAGCCGC GCCGTATTAG TTGAAACTAA TAAATTAGGT CATAAATCAT 28 8 0 

TAACAATCAC TAATGTTGCA GGTTCAACTT TATCACGTGA AGCAGACCAC ACATTGTTAT 2 94 0 

TACACGCGGG TCCTGAAATC GCAGTTGCAT CTACAAAAGC AT AT AC T GC A CAAATTGCAG 3000 

TATTATCAAT CTTGTCTCAA ATCGTTGCAA AKAGCaTGG TCGTGAAGCA GATATTGATT 3060 

TATTGAGAGA ATTAGCAAAA GTAACAACAG CAATAGAAGC AATTGTTGAC GATGCACCAA 3120 

TTATGGAACA AATTGCTACA GATTTCTTAG AAACAACACG CAATGCATTC TTTATCGGAC 3180 

GTACTATTGA CTATAACGTA AGTTTAGAAG GTGCGTTAAA ACTTAAAGAA ATTTCTTt&A 324 0 

TTCaAGCAGA AGGTTTTGCT GGTGGAGAAC TTAAACATGG TACAATTGCC TTAATCGAAG 3300 

AAGGTACACC AGTTGTAGGT TTAGCAACAC AAGAGAAAGT TAATTTATCA ATTCGTGGTA 3360 

ACGTTAAAGA GGTAGTAGCA CGTGGTGCAC ATCCATGTAT TATTTCTATG GAGGGTCTTG 34 2 0 

AAAAAGAAGG CGACACTTAT GTCATTCCTC ATGTACATGA ATTGTTAACG CCATTAGTAT 34 80 

CAGTGGTTGC ATTACAATTA ATTTCATACT ATGCAGCATT ACACAGAGAT TTAGATGTTG 3540 

AT AAACCAC G TAACCTTGCT AAAT C AG T T A CTGTGGAATA ATTCACTTTT TTAGAATCAA 3600 

TCATGTATTA AAATTAAAGT ATATGGCACC C T T T TAG AT T AAT C G AC TAG AAGGGTGCTT 3660 

TTTTAGGTCG ACTTaGCTTT TACTTCATCT TAATTTGGCA GAAATGCGTa AAAATGAAGT 37 2 0 

GTTTTATTTA TTTAAATAGT CTGACAATTA AGGGTGTTAT GTTAATATGA TTTTATGAGA 37 8 0 

AGTATGGAGT AGCAATAAAG GGGTGACCTC GCATGTTAAT TCAATTAGAT CAAATTGGGC 834 0 

GAAT GAAGCA AGGAAAAACA ATTTTAAAAA AGATTTCTTG GCAAATTGCT AAAGGTGATA 3900 

AATGGATATT ATATGGGTTG AATGGTGCTG GCAAGACAAC ACTTCTAAAT ATTTTAAATG 3960 

CGTATGAGCC TGCAACATCT GGAACTGTTA ACCTTTTCGG TAAAATGCCA GGCAAGGTAG 4 020 

GGTATTCTGC AGAGACTGTA CGACAACATA TAGGTTTTGT ATCTCATAGT TTACTGGAAA 4 08 0 

AGTTTCAAGA GGGTGAAAGA GTAATCGATG TGGTGATAAG CGGTGCCTTT AAATCAATTG 414 0 

GTGTTTATCA AGATATTGAT GATGAGATAC GTAATGAAGC ACATCAATTA C T T AAAT TAG 4 2 00 

TTGGAATGTC TGCTAAAGCG CAACAATATA TTGGTTATTT ATCTACGGT GAAAAACAAC 4 2 60 
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GAGTGATGAT TGCACGAGCT TTAATGGGGC AACCCCAGGT TTTAATTTTA GATGAGCCAG 4 320 

CAGCTGGTTT AG AC T T T AT T GCACGAGAAT CGTTGTTAAG TATACTTGAC TCATTGTCAG 4 380 

ATTCATATCC AACGCTTGCG ATGATTTATG TGACGCACTT TATTGAAGAA ATAACTGCTA 4 440 

ACTTTTCCAA AATTTTACTG C T AAAAG AT G GCCAAAGTAT TCAACAAGGC GCTGTAGAAG 4 500 

ACATATTAAC TTCTGAAAAC ATGTCACGAT TTTTCCAGAA AAATGTAGCA GTTCAAAGAT 4 5 60 

GGAATAATCG ATTTTCTATG GCAATGTTAG AGTAAATATT TTGCAAATAA TAAGTAATAA 4 620 

TGACAAAATT TAATTAAGAT AAAATGGAGV GTGGAGGGCA ATATGGATAA CGTAAAAGCA 4 680 

ATATTTTTGG ACATGGATGG AACAATTTTA CATGAAAATA ATCAAGCATC AACGTATACG 4 74 0 

AAAGATGTCA TTAATCAATT GAGAGAGAAA GGATATAAAG TATTTTTGGC AACAGGACGT 4800 

TCGCATTCTG AAAATACATC AACTTG 4 82 6 
(2) INFORMATION FOR SEQ I D NO : 27 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

GAATAAAAAG TAAAATTACT CGCCTTTGTT ACCTTTTACT TTATCAATaA AATCAGTTGC 60 

TTTTTCTTTT GCATTTTCAA CGAATTCTTT CGCTTTACCA GAAGCTTTAT CTTCTTTACC 120 

TTCGTTTTCT AAATTTTTAT TATCAGTAAC ATTACCTACT GTTTCTTTAA CAT TACCTTT 180 

TGCTTGTTCA AATTTACTTT CGTCTGCCAT AATAAATGCC TCCTCGGAAT AATTAAATGT 24 0 

TATATATAAT ACTTACCCAC T GAAAAAT T A TCTAAACATT TTACTTAAAT AATTTTTGAT 300 

ATTGATTTGA CGTCATTTTA TAACTAGCGA AATAGATTCA TCATTAACTT GAGGGAGTGG 3 60 

GACTGAAATA ATAAAGAATC ACTAATGATT TATGATGTAT TAGTCACTAG CCATGTGAAA 4 20 

TTAAAAATAA GAATAAATGA GTAGCACGCA TGCATATAGG ATTTTACTTT ATCCGTAATA 4 80 

GCATCTCATT CCTAAATATC ATATAAATAC CTGTTTAAAT TAAAAAGCCC AGCAACATCA 54 0 

CGTTACTGAG C CAT T AAT AT GATTTATTTA GCAGGAATAA CTGCACCATT GTATTTTCA 600 

TTAATGAAGT C T T G AAT AT C TTTAGATTGT AATACTTCAA TTAATGCTTT GATTTTCTTA 6 60 

TCATCTTGAT GTCCTTCTTT AACAGCAATT AAGTTTGCAT AAGGATTATC TTTCGCACTT 720 

TCTACAGCAA TAGAATCTTT TTTAGGATTT AGTTTTTGTT CGATTGCAAA GTTCGAATTA 780 

ATGATAACAG CATCAGCGTC TTCATTTTGA TAAATTTTAG GTAAGAATTC TGCTGATTGT 84 0 

TTATTATTAA AC T T AAT AT C TTTTTTATTC TCTGTAATAT CACTAAACTT AGCATCTTCA 900 
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ATTTTTACGC CTTTTTTGAT TTTAATTAAA CCTGCATCAA CGAAGAATTT TAAGAAACGT 9 60 

CCTTGTTCAG CTGGATTATT AGACACATAG ACTGTTGCS CTTTTGGTAA TTCTTTTAAA 1020 

CTTTTATACT TTTTAGAGTA TACAGCCATA GGTTCTAAGT GAACATCACC GGCACTTACG 1080 

ATTTTGTAAC CTTTATCCTT TTTCTCTGTG TTTAAATATG GTGTATGTTG GAAATAGTTT 114 0 

GCGTCAATTT CACCTTTGTC TAGTAATTTA TTAGGTGTAG TGTAATCGTT AATTGTTTTA 1200 

ATATCTAGTT CATAACCTTT TTTCTCTAAT AATGGTTTTG CTTTTTCTAA AATTTCAGCA 12 60 

TGTGGTGCTG GTGAAGCACt AnCTGTTATT TTCTTGTCGT CACTGCCACT TTTGTTTCCA 1320 

TTACCGCAAG CTGCTAATAC AACTGCAAAT GTTAATACTA AAATAAGACC AAATAATTTT 13 8 0 

TTCATAAAAT GAAACCCCCA ATTTATCGTT TATCAAGTTT ATTTGTAAGC CAATCCCCAA 14 4 0 

TGAATTGGAT TATAAATACA ATAATTAAAA TAAAAACTGT T GAT AC T AAA AT G AC AT CAT 15 00 

TTTGATTTCG AGTGAAACCT GTTAAGTATG CTAAATTTCC TAAACCACCG GCACCAATTA 15 60 

CACCTGCAAC TGCTGTTGAA CCAACTAAAG CGATTGCTGT AACTG2LATG C C AG AC AC T A 162 0 

GCGCTGGCAT AGCTTCAGGT AAAAGGACTT TACGAATTAC TGTCCAAGTA TTAGCGCCCA 1680 

TTGACCAAGC CGCTTCGATG ACACCTTTAT CAATTTCTTT AAAAGCAATT TCTACGAGCC 17 4 0 

TTGCATAAAA CGGTGctGCG CCAATGATCA AGGCTGGTAA CGCACCTGTC GGACCACTTA 1800 

TCGTTCCAAG TATCAAACTT GTAAATGGAA TTAATAATAA AATTAAAATA ATAAATGGTA 18 60 

TCGCTCTAAA TAAGTTAACA ATGAAAGAAA C GATAGAAT A AAATAACCTT GCACCGATAG 192 0 

ACTTACCTTT AGCAGACAAG AATAATAACA CACCTAAAAT AAGACCAAGT ATAAATGCAA 1980 

ATATAGTTGA GACGACTGTC ATGTATAGTG TTTCGACTAT TGCAGTCCAA ACTTCTGGCC 204 0 

ACTGAATATT AGGCATTGTA ATCATTTCAT TTATAATTTC ACTAAATGAT TTACCCATGT 2100 

CTTAACACCT CCATTTTAAC TTGTCGCTCA ATTAACTCTT TTTCGAATTT TCCGAAATCT 2160 

AC AC T T GAAA TATATGGAAT ATGCAGAACT AAAAAGCCGA CTGTTCCATT TTTGTATTT 2 22 0 

TTAATATTTG CTTCTAAAAT ATTAATTTTA ATATCATAGG CAGTTGATAG ACTCGATACA 22 8 0 

ATAGGCTCGG TTGTTGTTGA AC C AG C GAAA ACTAATCTAA CGATATATGC ATCTTTTTCT 234 0 

AATGGCTCTA ATTCTGTTAA AGATGTTTCG AAA T CAT CAT TTAAATCGTC TTTCACAAAT 24 00 

CGTTTTGTCA CAGTGTGTTG CGGATTTTCA AAAACCTGTG TCACCGGTCC TTGTTCTATC 24 60 

ACTTTACCAC TTTCCATAAC TGCAACTTCA TCACAAATAC GACGAATGAC ATGCATTTCA 2520 

TGCGTAATTA GTACAATTGT TAAATTTTGT TGTTCTCTAA TTTTTAGTAG TAG AT C T AAA 25 80 

ATTTCATCTG TTGTTTGCGG ATCAAGTGCA CTTGTGCCT CAT C AC AAAG CAAGACCGTT 2 64 0 

GGATCATTaG TAACGCTCGT GCAATCCCAA CACGTTGCTT TTGTCCACCT GATAACTCTG 2 7 00 
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ATGGATAAGC CTTTTCTCTA CCTTTTAAAC CGACGAGTTC GACAAGTTCT AATGCTTTTT 27 60 

GCTTAGCTCT CCTTCTAGGG ACACCTGCAA TTTCAAGCGG AAACATAATA TTTTTTAACA 282 0 

CAGTCCTTGA CCATAACAAA TTAAAATGTT GGAAGATCAT ACTTACTTTT TGTCTTTTTG 2880 

CTCTTAATCC ATTTTTGGAC AATTGACCTA TATGGTCTCC ATCTATAATA ACTTCACCTG 2 94 0 

ATGTAGGCGC TTCTAAATGA TTAAACATTC GAATCAAAGT ACTTTTTCCT GCTCCAGAAA 3000 

AACCAATGAC GCCATAAATC GATCCTGCTC GAATCGATAA ATTAACGTGA TCTACAGCAA 30 60 

GGACTTCTTT ATTTTTAGTC CGATATTCTT TAACAACTTC TTTTAATTCA ATCACGTTGA 3120 

TTCCTCCCTG TGTTGCTTAA TAAAATAAAA AATGCTTTCT CAATATCGAT AG AAAAAT T G 3180 

AGAAAGCAAT AGTAGTATTG TTTCTCTCAT CTTCAAAAGT TAAACTTTA TGTGAATTGG 32 4 0 

CACCATTTCT AT AT AAG AC G GTTGCCGGGC TTCGTAGGGC ACATCCCTCC ACCACTCTCG 3300 
ATAAGAGTTT ACGCATCATT TAATTTGTAT TAATCCTAAC ACCTTAGTAA AATTTCGTCA . 3360 

ATAACTATTT TAAATTTTCT AACAAATCAG TCACCGATTT AAATGCATAA ATTCGTTTTA 3420 

CTTCTTTATC TTTATTCATC AACAATAAAA TCGGCGTAGA CATGATTTGC ATATCTTTAC 34 8 0 

AAAACTGAGG ATAAAAGTTT AAATCTATTT TCAATAATGG TAACTGCAAT ATTTCATTAG 35 4 0 

CAATGTCTAA CATTCTTTCT GaAACCTTAC AAGTACCACA CGTTGGTGTA TAACCAAAGA 3600 

TTAAATGTTT GTCTTCCTCA TAAAVTGTAG TTACATCTTT GATGTCTAAT GAATTATTCA 3660 

TTTACTAAAA CTAACCTTTC ATTATTTATA TTCGGTAAAA GAGGTGTTTC TTTCTTACAA 3720 

GTAAAGCCAT GTTTTGAAAG TACATGCGCC AAATATTGTT TGGGGCAATT CGCAACTTGA 37 8 0 

CAGTAAGTTT TATCAATAAA TATATGTTCA CTTTCACTCA AATAACGTTTAAACCAATTT 384 0 

CTAATTCGAT CTCCTTCGTC ATCAGAATCG GCTAATACAA AAACTTGTTT AT C ATACAGT 3900 

GAT T C TAT C A TATCATCAAG CTTATCTATA CTCATTGTTC CATGAGTACA AATAATATTG 3 9 60 

ACTGGTTCTG CAATAACCTG TTGCACCCTT TTTTTATCAG ATTTTCCTTC AACAATTATC 4 020 

ACTTTATTTA CAATAGCCAT CATCATCACC CTTTAAAATC AATAAACATC TGTCACTGTA 4 080 

TCATTTCACA AAATTGGTAT GAATAAAACA TAAATCACAA AAAATTTAAA CTAGCTTAAT 414 0 

ATAATAATTA CAAACTCAAT GTTTGACTAG CTGGAACATT TAACATAAGC AGACAAAGGC 4200 

TAAGTCAAAA ATCAACATCC TAAAATCTAC ^TGTTATAT TAACAATAGT TAACCAAAAG 4 2 60 

AAAATACACC TATAACAAAC TTTTCAATTA TAGCGGGGCC CCAACACAGA AGCTGATGGT 4 320 

AAGTCAGCTT ACAATAATGT GCAAGTTGGC GGGGCCCCAA CATAAAGAAA TACTTTTTCT 4 380 

TTAGAAATTA GTATTTCTTA TGCATGAGTT TTACTCATGT ATTCCTATTT TTAAAKAAG 4 44 0 

ATTAGCTGTG GCTTATGAAA ACAGGCTGGG ACATAAATCA ATGTTCTATG CTCTACGAAg 4 500 

TTATATTGGC AGTAGTTGAC TGAACGAAAA TGCGCTTGTA ACAAGCTTTT TTCAATTCTA 4 560 
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GTCAGGGGCC CCAACACAGA GAATTTCGAA AAGAAATTCT ACAGGCAATG CAAGTTGGGG 4 620 

ATGGGCCCCA ACAAAGAGAA ATTGGATTCC CAATTTCTAC AGACAATGCA AGTTGGGGTG 4 680 

G G AC G AC G AA ATAAATTTTG CGAAAATATC ATTTATGTCC CACTCCCTAG ATTGATCTAT 4 74 0 

AGATACTACA CTTATTAAAG TAATATATTT TTATGATTCT CTTAGCTGCA ATCCCATGAA 4800 

TACATGTAAT CATCAAACTT CATAGCCTCA AGGTCAGTS ATTTCA 4 84 6 
(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

AACAAAGACA CAATCGAACA TGAACCATCA GTAAAAGCTG AAGATATATC AAAAAAGGAG 60 

GAT AC AC C AA AAGAAGTAGC TGATGTTGCT GAAGTTCAGC CGAAATCGTC AGTCACTCAT 12 0 

AACGCAGAGA CACCTAAGGT TAGAAAAGCT CGTTCTGTTG ATGAAGGCTC TTTTGATATT 180 

ACAAGAGATT CTAAAAATGT AGTTGAATCT ACCCCAATTA CAATTCAAGG TAAAGAACAT 24 0 

TTTGAAGGTT ACGGAAGTGT TGATATACAA AAAAAACCAA CAGATTTAGG GGTATCAGAG 300 

GTAACCAGGT TTAATGTTGG TAATGAAAGT AATGGTTTGA TAGGAGCTTT ACAATTAAAA 360 

AATAAAATAG ATTTTAGTAA GGATTTCAAT TTTAAAGTTA GAGTGGCAAA T AAC CAT C AA 4 20 

TCAAATACCA CAGGTGCTGA TGGTTGGGGG TTCTTATTTA GTAAAGGAAA TGCAGAAGAA 480 

TATTTAACTA ATGGTGGAAT CCTTGGGGAT AAAGGTCTGG TAAATTCAGG CGGATTTAAA 54 0 

ATTGATACTG GATACATTTA TACAAGTTCC ATGGACAAAA (TGAAAAGCA AGCTGGACAA 600 

GGTTATAGAG GATACGGAGC TTTTGTGAAA AATGACAGTT CTGGTAATTC ACAAATGGTT 6 60 

GGAGAAAATA TTGATAAATC AAAAACTAAT TTTTTAAACT ATGCGGACAA TTCAACTAAT 72 0 

ACATCAGATG GAAAGTTTCA TGGGCAACGT TTAAATGATG TCATCTTAAC TTATGTTGCT 7C8 

TCAACTGGTA AAATGAGAGC AGAATATGCT GGTAAAACTT GGGAGACTTC AATAACAGAT 840 

TTAGGTTTAT CTAAAAATCA GGCATATAAT TTCTTAATTA CATCTAGTCA AAGATGGGGC 900 

CTTAATCAAG GGATAAATGC AAATGGCTGG ATGAGAACTG ACTTGAAAGG TTCAGAGTTT 9 60 

ACTTTTACAC CAGAAGCGCC AA2&ACAATA ACAGAATTAG AAAAAAAAGT TGAAGAGATT 102 0 

C CAT T C AAG A AAGAACGTAA ATTTAATCCG GATTTAGCAC CAGGGACAGA AAAAG T AAC A 1080 

AGAGAAGGAC AAAAAGGTGA GAAGACAATA ACGACrCCAA CACTAAAAAA TCCATTAACT 1140 
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GGAGwAATTA TTAGTAAAGG TGAAyCgAAA GAAGAAATCA CAAAAGATC GATTAATGAA 12 00 

TTAACAGAAT ACGGACCAGA AACGATAACA CCAGGTCATC GAGACGAATT TGATCCGAAG 12 60 

TTACCAACAG GAGAGAAAGA GGAAGTTCCA GGTAAACCAG GAATTAAGAA TCCAGAAACA 1320 

GGAGAyGTAG TTAGACCACC GGTCGATAGC GTAACAAAAT ATGGACCTGT AAAAG GAG AC 1380 

TCGATTGTAG AAAAAGAAGA rATTCCATTC rAGAAAGAAC GTAAATTTAA TCCTGATTTA 14 4 0 

GCACCAGGGA C AG AAAAAG T AACAAGAGAA G G AC AAAAAG GTGAGAAGAC AATAACGACG 1500 

CCAACACTAA AAAATCCATT AACTGGAGAA ATTATTAGTA AAGGTGAATC GAAAGAAGAA 15 60 

AT C AC AAAAG ATCCGATTAA TGAATTAACA GAATACGGAC CAGAAACGAT AACACCAGGT 1620 

CATCGAGACG AATTTGATCC GAAGTTACCA ACAGGAGAGA AAGAGGAAGT TCCAGGTAAA 168 0 

CCAGGAATTa AGAATCCAGA AACAGGAGAT GTAGTTAGAC CACCGGTCGA TaGCGTAACA 174 0 

AAATATGGgA CCTGTaAAAG GAGACTCgAT TgTaGGAAAA AGarGAATTc CaTTOaGAA 1800 

AGAcGTaATT TaTCCTGTTT AGCACCCGGG GCAGAAAAGT TAC 18 4 3 
(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TAAACAGCGC GTGTACTTGT GATTCCCCCT TCTTCTATTT TACCCACCCG GGAAATAATA 60 

CTTTTCGCGA TTCCTTACTT GaACAAGCAA TATTTTATCa GCTGTTTCTT CAACTAAACA 120 

GACACATTTA ATCATCTTTG ACACCCCAAC TTTGTGAAAT CAATTTTTCA AATTATACTG 180 

TACAATTATG TTATCATATA TGAGTAGTTA TAGCGCAAAA CGTTAGCAAT TCAGCGCACC 24 0 

CAACTTTTCA TATAAACAGA AGATACTAGG GGGAATTATT ATTATGGCAA AACgTTCCAA 30 0 

ATCACAACGT TTATCAAGTT TACTAAATGT CGCAGGTTTC AT AG T C G AC G GCTACAATGG 3 60 

CTATAAATAT CATGCTAAAA ATAAAAAATT AGTATATCTT TCATTAGGTT TAAGCACTGT 4 20 

AGGAACCGTG TTAGACTTTT ACATTTCAAT TAAGTCACCA CGTAAGTTCA AAAAAG C AG T 4 80 

GGCAGTTGTT ACTTTAATAA CAAACGGTGC TAGATTATTT AC AAGC AT T C GCAAAGTA&. 54 0 

AC AT G AAT AC TAATTCAGAA AAGGATTGGT CGAACATAGA ACATGAAGTT CATTCGACCA 600 

ATCCTTTTTC ATATACAAAA ATTCCTATTA CAACAATCAC GACTTTGATA GCCCCGCCAT 660 

AAAATTTAGG ATTCAATCCA ACTTTTCAGC TTGTGaAATG TAATAGGaAT TCATTATATA 72 0 

T T TAT AT AC G TAAGACTTTA G T G AAT AT AT C TAT AAT TAT TTACTTGGTA AGCTGGTACC 78 0 
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GTTCTGTAAG TTAAAATACC TGGTGCAGCT GAATAGTTCA TTTCTGAAAC TAAAATACTA 84 0 

CCATCGTTAT TTACACGTTC TACAAACATA ACATGACCAT AGTAACCTAC ATCTGTTTGA 900 

GCGATAGAAC CTACAGTAGG TCTATTGTCG ATAGTGTAAC CATCTGCTGC CGCTGCGTTA 960 

TCCCAGTTAT TAGCATTCCA CCAATAAGTA CTAATACCTT TACCAATTTC AGCACGACGA 102 0 

T T AAAT AC AT GATATGTACA TTGACCCCAT GTATATAAGT TTTGGTGACT GAATACTGGT 108 0 

GTATTGTAAC CTCTATTTGT CGTTGTTGCA GATCCTGAGT TCGTAGATGC aTTACCAGTT 1410 

ACTTTCAATT TTTGACCCGG ATATATAAAG AAAT T AT T T A AGCCATTTAA GCTCATAATT 1200 

TTTTGATAAG T T G T AC CAT A TTTTGATGCA ATTAATGACA ATGAGTCACC TGCTTGTACT 12 60 

GTATAGTATG ATCCGCCACC TGAGTTCGTT GATGGACGGC TACTATTGCT CGCAGCGTTA 132 0 

CTTGAGCTAG CAGTACCTGA TA^TTTTAAT TTTTGACCTG GATAAATAAA GAAATTATTT 1380 

AAACCATTAA GTCGCATAAT GTTTTGGTAA GTTGTACCAT ATTTTGATGC GATTAATGAT 1440 

AATGAGTCGC CTGCTTGTAC TGTGTAGTAT GATCCGCCAC CTGAGTTCGT TGATGGACGG 1500 

CTACTATTAC TCGTAGAATT ACTTGAGCCA GATACTTTTA GTACTTGffiT TGGGAAAATT 15 60 

AGATTAGATG TTAAATTGTT TAATGACTTT AATTTAGCAA TCGAAATCCC ATACTTATTT 162 0 

GAAATTGCCC ACACTGATTC ACCCGGTTTT ACTGTGTGAG TTGTAGCCGC ATTTGCTTGA 168 0 

GTTGCCGCAA CAGCGCTAAT CGCGCTTGTC CCAATAATAG CTGCAATTAC TTTTTTTTGC 174 0 

ACT TTAAAAT CCTCCTCTTG CTTAACTTTC CTAACATTCT TTTATCCGAA TTTATGAATA 1800 

CTACATCATT ATACGATTTT ATTATGTATA ATAGGTTGAT GTTTGATGAC ATTATGaTTA 18 60 

CAAAAAAATC ATATACTGTA TCATCAAATT TTATAATTAT CCCTTAAAAT TATTACAACT 1920 

TATTAGATTT TACAATATCT AAATTATTAC AATTTCATAA TATTTCACTA TAAAATGATT 1980 

ACAATCCCTT TCTCTATTGG AAATAATTTT ATTCTCCAAC AATAACGCCC TACAAACATA 204 0 

AGCATGAACT TTGCTTGTAG GACGCAATAT AATTTATTTT GCTAATGACG TTTCTATTGC 2100 

CTTAATCTCA TCTTTAGATA AATTAACAGG TTTCTCTCCA TCTTTGACAT CTTCEGCAA 2160 

CGCTTTTTGA GCTTCTTTTG AATGATACAA TTCTACGATT TTAGCATATG TTTTGTTATC 2220 

CAAGTCTTTG TCATTAACTG CAACAATATT AATATATGGC TTTACTGCAT CTGAATTTGA 2280 

TTTTTCTAAA AATATCGGAT CATTTTTAGG ATCTTTACCC GCTTTAGTTG CTACACCGTT 234 0 

ATTAATAACT GCAATATCGA CAT C AGAT AA AGCACGTGCA GTTTGTTGTG CATCTACTGC 24 00 

AGTAATTTTT AAATGTTTTG GATTTGACGT TATATCTTTC ACCGTGCCTG CTAATCCGAA 2 4 60 

ATCTTTTTTC AGTTTTATTA AACCAGCTGC TTCTAATAGT TTAAGTGCAC GTGCTTGGTT 2520 

TGACACATCA TTTGGAATGA CAACTTTAGC ACCATCTTA ACCTTTTTGA CATCTTTAAT 2580 
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TTTATCTGAG TAAATGCCCA ACGGTGCTAA AACTGTTGTA CTTAATGCTG AAATCTTTGT 264 0 

TCCTTTATGC GCCTTTTTAT ATTGATCTAA AAATGCAAAA TGTTGGAATG CATTCATATC 2700 

AATATCACCA TCATTTAATG CTTTATTCGG TAAATTGTAA TCTGAGAAGT GCTTAATCTC 27 60 

CACATCAATA TCATCTTTTT TAGCTAATTC TTTAACCTTC TCCCAAGCCT TAGTGTCATT 2820 

TGATGCGACA CCAATTGTTA CTTTTTTATC ATTGTTACCA CCACACGCTG CTAATAATAC 28 80 

AAGTGCTACG ATAACTAACC CAATCAATCT TTTCATTCTA TCAATTCCTT TCAAAATCTT 2 94 0 

CACTATATAT CATTAATGTC TACGTATGAA TCTAGCTAGA ACATTCCCTA GCGTTTGAAT 3000 

CACTTGGACA ATAATGACTA ATACAATAAC GGTAATAATA ATGACCGTCG TATCAAATCT 3060 

TTGATAACCA T AC AC T AAAG CTAAGTCTCC TATACCACCA CCGCCAACAG CTCCTGCCAT 3120 

CGCCGTACTT CCAATAAGTC CAATAATCGC AGTGGTAATT GC TAT ACT A ACGAACCTAA 3180 

AGCTTCAGGA ATTAAAAAAT ATCTAATGAT TTGTAGTGGT GAAGCGCCCA TCGnTTTCGC 32 4 0 

CGCTTCAATA ATCCCCTCGT CTACTTCCAA TAATGAGTTT TCAACAAGTC TTGCAATGTA 3300 

AGGTGCCACA TATACTGTTA AAGGCACGAT GGCAGCAGTC GTACCAATTG AAGTACCTAC 3360 

TACTAATTTT GTGAATGGCA CAATCGCAAT TAACAAAATA ATAAATGGTA GTGACCTTAA 3420 

AATATTGATT AAAGGAT T T A AAACTTGATG TATCACTATA TTGGGCCATA TGCCTTGTTT 3480 

TCGAGTAATT ACCAATAAGA CACCTAATGG AATACCAATC ACTGCTCCTA AAAATAAAGC 354 0 

AAT AG AT AC C ATATATAGCG TTTCGTACAA TGCTTGTAAT AACTGTGCAC TGTCTAAATC 3600 

AGAACCAAAC ATATGTTAAT GcACCTCCTC AAATTGAATA TTTTTCTCTT TGAAATATTG 3660 

ATTTATTGCC GTGTCTTCAA ATTGTTGATC CAT ATT AAA T CGAAGCCACA TATAACATAC 3720 

GGTGTTACCT TGTATTTCTG AC AT AGAT G A AAATAAAATT TTAACCTCTC GCCACAAAT 3780 

TTGAATCAAG TCATTTATAA TCGGTTGTGT CACCTGAGTT TCCTCGACGA AGATTTTATA 384 0 

ATCTTTAAAA TCGCCAACTT GTTCGTCATT C AAT C G AC G A ATCAATGATG TACTTGGCTC 3900 

AGTCTGTATA AC T G TAG AC A CAAAATTTTG AGCAATCGTC GTTTTAGGAT G AC T AAAC AC 3 9 60 

CTCTTTAAcA GTTCCTGTTT CAACCACTTT CCCCTTTTCC AT T ACAGCAA CACGATTACA 4 020 

AATGTCTTTA ATAACGCGCA TTTCATGTGT AAT CAT CAT A ATTGTAATGC CAAAGGTTTG 4 080 

ATT G AC AT T C TTTAATAACG TCAATATCGA AGCAGTCGTT GCTGGATCCA ATGCGCTTGT 414 0 

TGCTTCATCG CATAGGAGTA TTTTCGGATT AG'AACAAGC GCTCTTGCAA TAGCCACCCT 4 2 00 

TTGCTTCTGC CCACCAGATA ATTCATCAGG AAATTGGTCT TTTTTATCAC TCAATCCTAC 4260 

AAATTCAAGC ATTTCCGTTA CTCGTTGCTT AATTTCTGTT TTGCTTTTCT TACTTAAAAT 4 32 0 

GAGTGGCATT GCTACATTTT TAAATACGGT AGCTGAATTT AATAAATTGA AATGCTGA& 4 38 0 

TATCATACCG ATATCTTTCT TAATATCCCT CATCATTTTA TCGCTATAAT TCGTAATATC 4 440 
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ATGTCCATCT ACAATCACTT GTCCATTCGA GGCAGCTTCA AG AT GAT T C A CGAGTCTTAC 4 5 00 

CAACGTACTT TTTCCTGCAC CACTATATCC AATCACACCA AAAATATCAT TGCGATTGAC 4 5 60 

CGTAAATGAT ACGTCCTTCA AAGCATCTAT TTTTTGCTTC TTTTTATTAA AGGTCTTACT 4 620 

TaCCTGTCGA AACTmAaTCA TCATCGCCCC TCCTACAAAT TCTTATTAAT CCACTAGGAA 4 68 0 

TAAGAGCATT ATATGTAAAA TTGCATATAT CGTcAATACA ATTTGCCGAA TTTTCTAAAA 4 74 0 

AATTAAAAAA TAAGTAATTC ATGTGACAAT GACGAATTGT GAGACTACTA TGACATTTAT 4 800 

CAAATTAAAT CCATAAAAAT GTCCACCAAT CCTCCACAAC GCAATTACTA AATATTAACA 4 8 60 

TCGCACAAAA AAGCACTAGC ATATTCAAGA ACAACAAACG TTGAACTCAA AATATATGCC 4 920 

AGTGCTGCTA TTATTTATAA AGTATCTAGT GCTTGTTTTA AATCATCGAC TAAATCTTCA 4830 

GTATCTTCAA TACCTACAGA AATTCTTACA AGTCCGTCTG TAATACCTTC TTTAGCTCGA 504 0 

ATATCTGCTG GAATGGATGC ATGTGTCATC AATGCAGGTA CTGAAATTAA ACTTTCCACT 5100 

GCACCTAAAC TTTCAGCTAA TGTGTAATAC GATGTTGCTT TAATCAATTG TTTGGCACTT 5160 

TCTGTATTTT TCACTTCAAA TGCAATCACA CCTGTATGGC CATCCGCTTG AGCCATATGG 5220 

ACATCATGAT TTAAATGACT TTCAATACTT GGATGGAACA CTTGTTGCAC AGCTGGATGT 528 0 

GCTTGTAACA TTTTAATAAT TTCAATAACG CTGCGATTAA TTTGTTCCAT ACGTAAACCT 534 0 

AATGTTTTAA TACCCCTCAC AAGTAAATAG CTATCTTGAG GTCCTAAAT GCCACCTGTT 5 4 00 

GAATTTGAAA TAAATGCTAA ACGTTCTGCA AGCTTGTCAT CCGATGTTGC AACTAAACCA 54 60 

GCAACGACAT CACTATGTCC ACCTAAATAT TTCGTTGCAG AATGTAAGAC AATATCGATA 5520 

CCTAAATCTA ATGGATTCTG ATAATAAGGT GTCATAAATG TGTTATCAAC AACTGAAATC 558 0 

AAACCGTGTT CTTTCGCAAT TTCAGCAGAC TTTTTAATGT CAGTAACACG TAATAATGGA 5 64 0 

TTAGAAGGTG TTTCAATAAA CAACATCTTT GTTGTTGGGC GTATCGCTTG TACAATTGAA 57 00 

TCTGTATGCG TTGTATCTAC AAAATCCACT TCAATGCCAA ATCGTGTAAA TACTTTTGTC 57 60 

AATGCGCGAT AAGTACCGCC GTATACATCT GAATTTAAAA TAATATGATC TCCTTTGTCC 582 0 

AACAGCATAA CAACTGCACT GATTGCTGCA ACACCTGAAC TAAATGCAAA GCCATGTTTG 5880 

CCATTTTCTA ATGTCGCAAT AACGCTTTCT ACAGAACTTC TTGTTGGATT CGCAGTACGA 5940 

GAATATTCAT ATCCTTGACG TAAATCACCA ATATCATCTT GTAAATATGT ACTTETTGA 6000 

TAAATTGGTG TTGTAACGGC ACCTGTATAA TCGTCTGTTG TGTGCCCACC ATGAATTAAT 60 60 

TTAGTTTTCT TGTTCATTAT TATTCTCCTC ATAATTAAAT ATTTGCTTAG ACATATATCG 6120 

AT C AC T AC C A TCTGGAAATA CGACAACAAT CGTACCTTCA GATAATTGCG CTTTTAAATT 6180 

CAATGCACCT TGTAATGCTG CACCTGaAGA ACTGCCTACT AACAACCCTT CATTTATAGC 62 4 0 



870 

( opied h orn 1< 80(ii>0(i on 08/ 1/ /"> ■ ; 



CAAACTTTTG ACATTTCGAA AGGCATCTTG ATCTTTAATC GTAAATATCC CATCTACAAG 6300 

ACGTCTCTCT AAAAATATCG GCCATTTCTC AGAACCGATA CCTTCAGTGT CATGTGCATG 6360 

AGCTGGCCCT CCATTTAACA CGGACCCTTC TGGCTC&CG GCATAACATT GCACGTGATG 64 2 0 

TTGCTTTAAA TAACGTGCGG TACCTGTAAA TGTACCGCCA GAGCCAATAC CAGCCACAAA 64 80 

ATAATCAATT TGCTGTAATG CTGAAGTCAA TTCGGGTCCC AATGTATGAA AATATGTATC 65 4 0 

CGGATTATGT TCGGATTCAA ATTGATTCAT ATAAACGGCA CCATATTTTT CAGCATAGGA 6600 

ACGTGCAGCT AATTGTGCCC CATGCATACC TTCAGACTGA CTCGTCCTTG AAACTTCTGC 6 6 60 

ACCAAGCGCT ATCATAATAT TAATCTTTTC TTCTGAAAAA CCATACGGCG CAAAGATCTT 67 2 0 

AC AT T T C AAA TGATGTCTAT TCGCTGCAAT AGCTAACCCT ATGCCTGTAT TACCAGCAGT 67 8 0 

CGCTTCAACA ATAGTTTGAC CTGCACGCAC ACGCCCTTCT TGAATTGCCT TCTCTACTAA 68 4 0 

ATATTTCCCG AGTCTGTCTT TAACACTGCC TCCAGGATTC CATTGTTCAA GCTTGGCATA 6900 

AATTTTAACT T TAT CATC AC TATAATGTTC TAACAGTACT AATGGTGTAT TGCCaATTAA 69 60 

ATCATAAGTA ATCATAGATG CACCCTCATC TGACATGCCG ATCAATGAA TGAAACCTTT 7 02 0 

CTTCATGTCT CAATTTTAAT TCTTACTTTT CAGATAAGAA TTATAAACGA CATTTTGTTA 7 080 

TTTTGCAATT ATCTAAGTTT CGATTAATTC AG AAC C AG T A CTAAATTTTC AATTCCAAAC 7140 

AAAAAAACAC CTGAGCAACA CAAATACTTG TGTGTCAGAT GCTTCTATAT ATTAACTAAA 72 00 

TAATTGCACG ATAAAGACTA AAATAATAAC GACAGGCATC GCATACTTAA TTAAGTAATA 72 60 

CCAACCACTG AATAATCTAA ATCGATCTTT ACCAAAATAT TGTTGTAATA ATTTTTTATC 7 320 

TAATAATTGT CCTACGACAA GCGTAGTACC TAATGCGCCT AATGGCATCA ATACATTCGA 7 38 0 

AACGATGAAA TCCATATTAT CAAAAATCGT TCCCGCACCG AATCTTACAT CTTTTAAGAT 74 4 0 

AC C AAAAG AT AAGGTTGCTG GAATACTAAT GATAAATACT AAAATACTAC CGATCACTGC 7 5 00 

GACTTTTTTA CGTTTTGTAT TGTCATTCTT CGTGAAGTTA GAAACATTTA ATTCTAATAA 7 5 60 

AGAAATAGAT GACGTTAAAG CCGCAAATAA GAACAGCACT AAGAATCCCA ATAGAATAA 7 62 0 

TGTGCCTAGA TGCATTTGAC T AAAG AC CAT TGGCAGTACT TTAAATAATA ATCCAGGCCC 7 680 

TTCTTGTGGT TCATAGCCAA AACTATGTAA AGCCGGAAAT ATAGCTAGAC CTGCCAATAC 77 4 0 

AGATACAAAG ATATTCATAA CAACGATAGA AATAGCTGAT GACTTAATCG TCATGTCTTT 7 8 00 

AGAGGCATAA CTCGCATAAG TAATCATACC TGTAGTTCCT AATGATAACG TAAAGAATGA 7 8 60 

TTGACCTAGC GCAAACAAGA TGCCATCAGC AGTAATCTCT GATACTCTTG GTTGTAAAAT 7 92 0 

AAATTTCACA CCTTCTAAGA CGCCATCTAA TGTTAAAGAC TTAATCACAA T G AC GAT T AA 7 980 

AAAGACAAAC AGCAATGGCA TCATAACTTT CGffGCCTTT TCTAATCCTT TTTCAACACC 8 04 0 

TAACATGACA ATAATCATCG TAGCGAATAT GAATATACCT TGCCCTAGAA CGGTTAACCA 8100 
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AGGATTTGAT ATTACCGCTT CAAAATTCAT TTCTTGGAGA TGATTGATGC GTTGAAATAT 



8160 



AACTAATTGC CATAATACTT GTCCGATGTA AATGACAATC CAACCACCGA T AAC ACT A3 1 



8220 



GAAGCCAAAT AAAATAAACA CTGCCAAATT ACCGTTCCAG CCAATGATAT TGAGCCATTT 



8280 



TTTACCAGTT AATTTACTAT ATATTTGTGT TGTATATGTC CGTCCCATTT TCCCAACAGT 



8340 



GAATTCCATA ATGAGTAATG GCAACCCAAC AAAAATGGTG AATATTAAGA ACATAGCTAG 



8400 



AAAGGCACCG CCGCCATAAA TCCCTGCCAT ATATGGGAAT TTCCACATGG CACCAAGACC 



8460 



GATTGCAGAA CcCGCACTAG CTAAAATAAA TCCAGTTGAT GACTTCCATT GTGATTGTTG 



8520 



TCTTTTCATC ATTCAC 



8536 



(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

GCTTTGGCCA TTTTATGTGG C GAT T GAG AC AATCtdkGT TGTCTTATTT GATGTTGTAT 60 

TTCAACTGGT AATTCTAGTT GCGATTGAAA TAATGGCAAC TTTTCCCAAT CAT T AAC AAA 12 0 

TAATTCAATA CCTGCTATGT CTAATACTTT AGCACGTGCA T CAT C AAC AA GACGGCGTTC 180 

CAATTGATTT GCTTCTTCTT TAATACCTGG TGACGTACTT TCTAATATCA AATTAGATAT 24 0 

AGGGATGTGA CCATTAATTG CATAATATAA TGCAACACGC CCACCCATTG AAT AT C C AAA 300 

CAATGTTATT GATTTATCTT TATATTTATC TAAAATTCGG TCTAACAACG TCGTAATATA 3 60 

ATCAAAATTC CACGTTTCAT C CAT T GAAG A CTGATCTTCG CCATGGCCTG GTAAGTCTAT 420 

AG T GAT G AC A TGATAGTTAT CAGTAAATTT TTCGATGTGA TTATGATAAG TACGGCTGTC 4 80 

GCTAAGAAAT C CAT GC AGAA ATACTAAAAC TTGATTGGTC TCAACGTTTG CTTCATAAAA 54 0 

TTTATAATGT G T CAT GAAT C ATTTCACTCA ATTTCTGGTA TAAAATTTGA TGCTGTTTAA 600 

AGTTATCTTC GCGATTCGTT ATCAATTCAT AAATCGTCGA AGTTCAGAT AACAATGTGG 6 60 

CATTTTTAAA TTCTGAAACA CTGTTAAAAC GTTTAAAATC GAATTGATAT AACTTAGCTG 720 

TATACTCGAA ATCCAATCCC GTCGGTGTGC CAAACAACCG . TTCAAAATAG TCAGTTGCAC 780 

TTTCTTTTTG TGGTAAATAT GAAAAAATAC CGCCACCATC GTTGTTCAAT AATACAATAT 84 0 

TCATCTGAAT ATT ATT T AAT TTTGACATTA ATAGTCCATT CAT AT CAT GA T AAAAT GAT A 90 0 

AATCACCTAT CAATAATGTT ATTCGTTTAT GCACAGCCAT ACCCAGTGCA GTTGAAACGA 960 
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TACCATCAAT ACCATTCGCA CCACGATTCG CATAGACATC TATATTTTTA TTCAATAACA 1020 

AGTTATCTAC ATCTCTGATA GGCATACTAT TACTAATAAA TAATGCATCT TTTTCAGATG 108 0 

TTTTCTTAAT CAATTCACCA ACGAATGCAC TCTCATCTGT AGCTTGTTCC AAATAACATT 114 0 

TAATTTCTTT ACGCCCTTTT TTCTCTAAGC ATTGCCATTT TTCTAACCAA CTTACGCGAT 1200 

TAACTGTCGT GTCTTCCATT AATGACCTAA AGAAATCATT CGCAGAAATCTCATATGAAA 12 60 

TATCTGGCGC TATCGGAAAG ACATCAATCT TATCATTGTT TTGCACTAAT ATTTGAAATG 132 0 

CATCAGTTTT CTTTAACCAT TGATTTAACT TTTTAGAAAT CACTGGTTTC CCAACACGAA 1380 

TTACGAAATC CACATTTAAG TCTAAGCCGC TTCTAAACAG CAAATCATAT GTACAGATAA 14 4 0 

CATTCGGATG ATCAAATTTT CTTAAATGAC TTAAAGGATC AGCTAAAATA GGCAAATCAT 1500 

ATATCGTTGA ATACGTTAGT ATTTGaTCAA CTTCTTGGTG CTGCATATCC CCTACAATAA 15 60 

TTAAACCTTT TTTCTTATTT AAAATGTGTC TTAATGCCGA TGCATCTATA CTTTTTTGAT 1620 

AGTGCGGTAA AATCTTCATC TCAGAAGTTA A(AATTCTGT TGCATTCAAA TCAGGTGTTA 168 0 

ACGGATCTCT AAATGGCAAG TTAAAATGAA TTGGCCCTTT ATGTGGTCCA TATAAATATT 17 4 0 

GACTAGCAAT TTGCATTTGA TAGTAAATTG CATCAATGGT CTCTTTACTA TCATCCGCAA 1800 

TAGGCATATC GAACTCATAA CTTACATAAT TATTAAACAT ATTTACTTGA TTAATCGET 18 60 

GTGGTGCGCC TACACTTCTT AATTCATGCG GACGGTCACT TGTTAAAACG ATTAAAGGAA 192 0 

TTCTACTAAT TTGGCTTTCA GCAATTGCAG GCGTATAATT CGCTGCTGCT GTACCTGACG 198 0 

TACATAATAT AGCGACAGGT CTTTCACTGC CTTTAATTAA CCCAACTGCA AAAAACGCTG 204 0 

CACTTCGCTC ATCGGGGTGT ATCCATGTTT TAATATTTGG ATGTGCTTCA AATGCAAGTG 2100 

CAAGTGGCGT TGAGCGTGAT CCCGGACTGA TAACTACTTC CCTTACGCCG TACGCATATA 2160 

ACTCAGATGC AAATGTAAAA ACTTGCTTCG TTAAAGCTGC TTTATGATTT CCCATTCATA 2220 

TCGACTCCTA ATGCATTCAT CATAGGTGTG AACTTAAGGT TCGTTTCTGC CAATTCACTA 2280 

TCTGGATCAG AATCTTTAAC AATGCCACAC CCAGCAAATA AAGTTGCTTG TGCTTTCTTA 234 0 

ATAAGCATCG AACGAATTGC AACAATAAAT TCACAATCAT CGTATATATC TATATAGCCA 24 00 

ACCGGTGCAC CATATAATCC TCGCGTACCA AATTCTTTCT GCTCAATAAA ATCCATTGCA 4260 

AATTCTTTTG GATAGCCACC TAAAGCAGGT GTTGGATGTA AATTATCAAT TAAACTAATA 2520 

TACGAATCAT CCTTCAGTGG CGCCTTTATT TCAGTGTACA AGTGATATAA ATGATCATTT 2580 

TTTAGAATTT TAGGCGTCTT ATCATAATGT AATTCAGTGA TATAAGGTTT AATAT CATGT 2 64 0 

AAAATACTGT CAACAACAAA TCGATGTTCG ATTAAGTTTT TATTATCTTT TAAAAATGCT 2700 

TCAACATTTT TTGTATCTTC GTCCTCATCT TGTGAACGTT TAATTGTACC TGCTACAGCT 27 60 

TTAGTCGATA GTATTTTATT ATTGACCTTT ATTAATTGTT CAGGTGTTTG TGAAAAGAAT 2820 
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ATAGAATCTT GTGATTCTAA CAAGAATATA TAACTGTTTT TTTCTTffiGA ATATGCTTGC 2880 
TTTAGAATAT ATGGAATACT GATATCTTTA TCGAACTTTA TTAACCGTCT ACGTGCTAGT \ 2 94 0 

ACAATTTTTT CTTCATTATT AATAGATTCT ATAGCTTCTA CTACAAGTTG ACGCCAGTCA 3000 

TCTTTATAAA TATCTTCATT TCTAGTAATT TCCCCAATTT GCTCGTCCAC ATCTATGTCC 3060 

GATATATTGT TGAACAAATC CATTAAATCG TTCAATGCCT C AAC AG T AAA ACTTTCCCTT 3120 

TTAACTGTAT AAGTTAAAAA TGTCCCATTA TTATCAGTTG AAATTAAAAC TTCAGGTAAT 3180 

ACAAAATGAT TTAGTCCAAA CTCTCGCCAT TCATCATCTG ATTTATGACT TGAAAATTGG 32 4 0 

AACCCTCCAA CAACTCGAAG ATGATGT1"E TCAGATTGCG GAT G TAT AAA TGTGATGTTA 3300 

TGTTTTAATT TTTCCCAGTC TTTAAAAATA GATTGTTTAT TTTTAGAATT ATTTTTGAAT 3360 

AATTGAATTG CTTTGTAGCC AAAAT AT G AC GTTCGATTAT CATTCAAACG CATATAAAAG 34 2 0 

CGATCTCCTG CCTCATTGTC AGTGAGATGA AATAATGTGC TCGGGTCTAG TGATCTGAT 34 8 0 

AATTTCACTT CAACTGAAAC CCATTCCTTT GAGCTGCCAT ATATCTCTTT GACAAT AT CG 354 0 

TCCTCTAATA CGCCCGTAGC CATCCATTTC ACTTCTTTCT TCGTCTTTTT TCACTCATTA 3600 

TTATATTGTA TCATTTTTGG ATAATTGTGT TACAAGAATT GCTTAAACTT ATCTTGCAAT 3660 

TTTTCACGTC AATTGACCTT TATGCTACTT TCTATTAAAA TATCTTTGTT ATAAAAAATA 3720 

TGATTTAAAG AGGTTTTGTA TTCAATGAGT AATCAATATC AGCAATATTC TACAGTTAAG 37 8 0 

AAATATTGGC ATTTAATGCG TCCTCATACA TTAACTGCTT CCGTAG TACC CGTTTTAGTT 384 0 

GGTACAGCAG CAT C T AAAAT ATATTTTCTT GGTAGGAAG ATCATATTAA AATCAGCCTA 3900 

TTCATTGCCA TGTTACTAGC ATGCTTACTT ATTCAAGCAG CAACTAATAT GTTTAATGAA 3960 

TACTATGATT ATAAAAAAGG CCTCGATGAT CATGAATCTG TAGGCATTGG TGGTGCCATT 4 020 

GTTCGCAACG GTATGAGCCC AGAGCTTGTG C T AC GAT TAG CCATTGCATT TTACATCTTA 4 08 0 

GCAGCAATAT TAGGTTTGTT TTTAGCTGCT AACTCTTCAT TTTGGTTATT ACCAGTTGGA 414 0 

TTAGTATGTA TGGCTGTTGG TTACCTATAT ACAGGTGGCC CTTTCCCTAT TTCATGGACG 4 2 00 

CCTTTCGGTG AATTATTCTC AGGCGTATTT ATGGGTATGT TTATTATCGT TATTGCATTC 4 2 60 

TTTATTCAAA CTGGCAATAT TCAAAGTTAT GTAATTTGGT TAAGTGTACC TATAGTAATC 4 32 0 

ACTATCGG 4 32 8 
(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

GTTCAATACA GAAAAAATAA ATTTAGATGT TGAAGCATCC TACAATTAAT ACAGATCCAT 60 

TTCAATATAT T T AAAC T AAA ATCTCGGGAT TTCTAAATT TGAAATTTCG AGGTTTTnAT 120 

ATTTTTATTT AAAATAGCAC ATTTATACTT TATAATAGTA AAGATGAACA TATAAGGAGG 180 

CCAAATCATG GCAAAACATC CATTCGAACA ATTTAATCTA GAATCTAGTT TAATTGACGC 24 0 

TGTGAAAGAC CTTAATTTTG AAAAACCAAC TGAAATTCAG AATCGAATTA TTCCAAGAAT 300 

ACTAAAGAGA ACAAATTTAA TTGGTCAATC TCAAACGGGT ACAGGGAAAT CTCATGCATT 3 60 

TTTATTACCA TTAATGCAGT TAATTGATAG TGAAATAAAA GAACCACAAG CAATCGTAGT 4 20 

TGCACCAACA AG AG AAC T T G CACAACAACT ATACGATGCA GCGAACCATT TAAGCCAATT 4 80 

TAAAGCTGGT GTTTCAGTTA AAGTTTTTAT TGGTGGTACA GATATAGAGA AAGATAGACA 540 

ACGTTGTAAT GCACAACCAC AATTGATTAT AGGCACCCCT ACTAGAATTA ATGACTTAGC 600 

TAAAACGGGA CATTTACATG TGCACTTAGC ATCATATTTA GTTATTGATG AAGCGGATCT 660 

TATGATTGAC T TAG GAT T AA TTGAAGATGT AGATTACATT GCTGCAGAT TGGAAGATAA 720 

TGCAAATATT GCGGTGTTTA GTGCTACAAT CCCACAACAG T T ACAACCAT TTTTAAATAA 780 

ATATTTAAGT CATCCAGAAT ATGTAGCTGT CGACAGTAAA AAACAAAATA AAAAGAACAT 840 

CGAATTCTAT TTAATACCTA CTAAAGGTGC AGCTAAAGTT GAAAAGACTT TAAATTTAAT 900 

T GAT AT ACT A AATCCATACT TAT G TAT TAT TTTCTGTAAT AG TAG AG AT A ATGCAAATGA 960 

TTTAGCACGT TCACTAAATG AAGCTGGTAT TAAAGTTGGT ATGATTCATG GTGGCTTAAC 102 0 

GCCaCGTgAA CGTAAACAAC AAATGarACG TATACGTAAT TTaGAATTCC aATACGTTAT 108 0 

TGCCaGCGAT TTAGCATCTC GTGGTATUA TATTGAAGGT GTTAGTCrTG TCATCaATTT 114 0 

TGATGTGCCA AATGATATTG ACTTCTTTAC GCATAGAGTC GGACGAACTG GTCGTGGGAA 1200 

TTATrrAGGT GTAGCAATTA CGCTTTATAG TCCTGATGAA GAACACAATA TTTCATTAAT 12 60 

AGAAGATCGC GGTTTTGTAT TCAATACTGT T GAT ATT AAA GATGGTGAGT TAAAGAAGT 132 0 

TAAAGCGCAC AATCAGCGTC AAGCAAGAAT GCGCAAAGAT G AC CAT T T AA CTAATCAAGT 1380 

GAAGAACAAA GTTCGAAGTA AAATTAAAAA CAAAGTTAAA CCAGGTTATA AGAAGAAATT 14 4 0 

TAAACAAGAA 14 50 
(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

AGTCAGGTAT ATCATGCCaT yCTGAATTGG TCGATATTAA TATCAGTGGT GTTAAAGAAC 60 

GAATTGTATA CCAATAGACG CTTTATATTG TAAAATAGTA TTAAATGCaG AATAGAGAGG 12 0 

AGATTTAATG CGATATGACA AATTATAAAG TTGTCGTTTT AG AT AT GGAT GACACATTGC 180 

TAAATTCAGA TAATGTGATA TCAGAAGAAA CTGCAAATTA TTTAACAGCA ATTCAAGATG 24 0 

AAGGTTATTA TGTTGTTCTA GCATCTGGTA GACCTACTGA AGGTATGATT CCAACTGCTA 300 

GAGATTTAAA ATTACCTGAA CATCATAGCT ATATTATTAG TTATAACGGT AGTAAAACGA 3 60 

TTAACATGAC TAATGAAGAA GTAGAAGTAA GTAAATCGAT TGGTAAGCAA GATTTGATG 4 20 

AAATTGTAGA TTATTGTCGA GATAGAGGCT TTTTCGTTCT TACATATCAT GATGGTCAAA 4 80 

TTATTTaCGA CAGCGAACAT GAG TAT AT G A ATATTGAAGC AGAATTAACA GGTTTACCGA 54 0 

TGAAACGTGT TGATGATATC AAAGCGTATA TTCAAGGCGA TGTACCCAAG GTCATGGGTG 600 

TAGATTATGT AGCGAATATT ACAGAAGCTA GAATTGATTT GAATGGTGTG TTCAATGATA 660 

ATGTAGATGC TACGACAAGT AAGCCATTCT TCTTAGAATT TATGGCCAAA GACGTTTCAA 720 

AAGGTAATGC AATTAAAGCG TTATGTCACA AATTGGGATA TTCGGTGGAT CAAGTCATTG 780 

CTTTTGGTGA TAGTATGAAT GATAAATCAA TGTTTGAST CGCAGGTCTA GCTATTGCTA 84 0 

TGGGGaATGC AT C AG AT G AA CTTAAGCAAT ATGCAAATGA AGTTACGTTG GATCATAATG 900 

AAAATGGTAT TCCACATGCG CTCAAAAAAT TGTTATAAAT TTTAAAATAA GCCTTAACAC 960 

ATGATATTTG AATAAGATAT CTTGTGGTTA AGGCTTTTTA TTTTTGTGAA AATGACTTCA 102 0 

GT TAT AC TAT GGAGGATTTG AAATACATAT TTTAGATTAG TAATGATATC AAACGAATAG 108 0 

AGTAAATGTA TATTTttTGA ATAAATCAAG TATTAACTAG TCACGGAAGG nAGATAAAT 1139 
(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29a base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

TCTAAAAATG CTGTGAAATT CTTTTATAAA TATCTAAAAG GAATTAATGT TGATAACATT 60 

GCTGTGATAG GAAGTAAGAC AGCGCAATAT TGTGAATCAC TTGGCATTCG AGTTGATTTT 120 

AtGCCAAACG ACTTTTCTCA AGAAGGATTT TTAAAATCAT TTAATCAAAC TAACCAAAAA 180 
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ATACTTTTGC CTTCGAGTGA ATTGGCGAGA CCATTGTTAT TAGCAGCGTT ATCTAAAGAT 24 0 

AATGAAGTTG TTAAAATAGA TTTATATACT TCAGTGCCTAACAAACAAAA TATACAAGAT 300 

GTTAAAGAAA T GAT AGAACA TCAACAAATC GATGCATTAA CATTTTCAAG TTCGTCGGCA 3 60 

GTACGTTATT ATTTTAATGA AGGATTTGTA CCAAAATTCA AGTCGTATTT TGCTATTGGA 4 20 

GAACAAACAG CACGGACCAT TAAATCATAT CAACAACCAG TAACAATTGC AGAAATTCAA 830 

ACACTCGAAT CACTAATTGA AAAGATTTTA GAAAGTAGGG GCTAAAAATG AAATTTGATA 54 0 

GACATAGAAG ATTGAGATCA TCAGCGACAA TGAGAGATAT GGTTAGAGAG AATCATGTAA 600 

GAAAAGAAGA TTTAATATAT CCAATTTTTG TAG T T G AAAA AGACGATGTG AAAAAAGAAA 660 

TTAAGTCATT GCCAGGTGTA TA:CAAATCA GTTTGAATTT ACTTGAAAGT GAATTAAAAG 720 

AAGCTTATGA CTTAGGCATA CGTGCCATTA TGTTTTTCGG TGTTCCAAAC TCAAAaGATG 7 80 

ATATAGGTAC TGGTGCATAC ATTCACGATG GTGTTATTCA ACAGGCAACA CGTATTGCTA 840 

AAAAAATGTA TGATGACTTA TTAATTGTTG CAGACACTTG TTTATGTBA TATACTGATC 900 

ATGGTCATTG TGGCGTGATT GATGACCATA CACATGACGT TGACAATGAT AAATCATTGC 960 

CACTACTTGT TAAAACAGCA ATTTCTCAAG TGGAAGCTGG TGCTGATATT ATTGCGCCAA 102 0 

GTAATATGAT GGATGGTTTT GTTGCTGAAA TTCGTCGTGG AT TAG AT G AA GCCGGCTATT 1080 

ACAATATTCC TATAATGAGT TATGGTGTCA AGTATGCATC AAGTTTCTTT GGACCTTTTA 114 0 

GAGATGCAGC AGATTCAGCG CCATCATTTG GGGATAGAAA AACGTATCAG ATGGACCCTG 12 00 

CTAACCGTTT GGAAGC AC T T CGTGAATTAG AAAGTGATCT TAAAGAAGGG TGCGACAT G A 12 60 

TGATTGTTAA ACCTGCTCTA AGTTATTTAG ATATAGTTCG AGATGTTAAA AAT CAT AC GA 1320 

ATGTTCCAGT TGTTGCATAT AATGTGAGTG GAGAATATAG TATGACTAAA GCAGCGGCAC 1380 

AAAATGGTTG GAT AG AT G AA GAACGTGTCG TTATGGAACA AATGGTTTCA ATGAAACGTG 14 4 0 

CAGGTGCTGA TATGATTATT ACGTATTTTG CAAAGGACAT TTGTCGCTAT TTAGAAAAT 1500 

AAGGTTTTAT ATTTATGATT TTCCATAAAC TGTAGGAGGA ATTTACTTTA T GAG AT AT AC 15 60 

GAAATCAGAA GAAGCAATGa AGGTTGCTGA AACTTTAATG CCTGGTGGTG TAAATAGTCC 162 0 

AGTACGCGCA TTTAAATCAG TAGATACACC AGCAATTTTT ATGGATCACG GTAAAGGTTC 1680 

AAAAAT T TAT GATATCGATG GTAACGAGTA TATCGACTAT GTACTAAGTT GGGGACCACT 174 0 

TATTTTAGGA CAT AG AG AC C CTCAAGTTAT TAGTCATTTA CAT G AAGC AA TTGATAAAGG 18 00 

TACAAGTTTT GGTGCATCAA CATTACTTGA AAATAAATTG GCGCAgcTCG TTATTGACCG 18 60 

AGTACCTTCA ATAGAAAAAG TGCGTATGGT GTCATCXSGT ACAGAAGCTA CATTGGATAC 1920 

TTTAAGATTA GCACGTGGTT ATACTGGCAG AAATAAAATT GTGAAATTTG AAGGTTGCTA 198 0 

TCATGGTCAT AGTGATTCGT TATTAATCAA AGCTGGTTCT GGGGTGGCAA CATTAGGATT 204 0 
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GCCGGATTCT CCTGGTGTGC CTGAAGGTAT TGCTAAAAAT ACAATTACAG T T C C AT AC AA 2100 

T GAT T TAG AT GCACTTAAAA TCGCTTTCGA AAAATTTGGa AACGATATTG CTGGTGTAAT 2160 

CGTAGAACCT GTTGCTGGTA ATATGGGTGT CGTACCGCCG ATTGAAGGTT TTTTACAGGG 2220 

ATTAAGAGAT ATTACGACTG AATACGGCGC ATTGCTAATT TTCGATGAAG TAATGACTGG 228 0 

TTTCAGAGTC GGTTATCATT GTGCACAAGG TTACTTTGGT GTGACACCAG ATTTAACTTG 2 34 0 

CTTAGGAAAA GTTATCGGTG GAGGACTACC TGTAGGTGCA TTTGGTGGTA AAAAAGAAAT 2 4 00 

CATGGATCAT ATAGCACCAT TAGGAAATAT TTATCAAGCG GGTACGTTAT CAGGAAATCC 2 4 60 

TCTTGCAATG ACAAGTGGTT AT GAAACGT T AAGCCAATTA AC GC AG AG A CATATGAGTA 2 52 0 

TTTTAATATG TTAGGCGATA TACTTGAAGA CGGTTTAAAA CGTGTATTTG CTAAACACAA 2 58 0 

TGTACCAATA ACTGTAAATA GAGCAGGTTC AATGATTGGT TATTTCTTAA AT G AAGGAC C 2 64 0 

TGTAACTAAT TTTGAACAAG CGAATAAAAG TGATTTGAAA TTATTTGCAG AAATGTATCG 27 0 0 

AGAAATGGCA AAAGAAGGTG TGTTTTTACC ACCATCTCAA TTTGAAGGTA CATTCTTATC 27 60 

TACGGCACAC ACGAAAGAAG ATATTGAAAA AACGATTCAA GCATTTGATA CGGCTTTAAG 2 820 

TCGTATTGTA AAATAAATAT ACGGACAAAT TGAGAGCCTG AACTTTGTTC AGGCTCnTTT 2 880 

TAAATGTATA TAAGGCATGG GCGGCG\CTT GATAGTGAAA GTCCACTACT A 2 931 
(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

AATTATGAAT GCATTACCAG TATTATTACA AAAGAACAAT TAAAAATGTT TGTTTATGAT 60 

TATGATACGC ATCTCATTAA AAATGTAaTG GTTGCAGCAG ACGTGTTAAA GGCAAATGAT 120 

AT TC AAGGAC ATGAACCATT AATCGTTAAC CTTCAAACGA TTGATGAAC ATTACATCGT 180 

TTACCTATGC ATAATAGAAA AG AC AT GAT G GTTAATGGCG GTGTACTTAT GGCACATTTA 24 0 

AATGCCAAAA GTGGTCCGTG GTTAAAAGAT GTGCTAAGAC AAATTGAGAT AGCGATTGTA 300 

ACAGGTAAAG TAAGCAACGA AGAAACTGAA ATTTTGAAAT GGGTGGATAA TCATGTCAAA 360 

AT AT AG T C AA GATGTACTTC AATTACTCTA TAAAAATAAA CCGAATTATA TATCTGGACA 420 

AAGCATTGCG GAATCACTTA ATATTTCACG C AC T GC AG T A AAAAAAGTGA TTGACCAATT 4 80 

AAAGTTAGAG GGATGTAAAA TAGATTCAGT AAATCATAAA GGGCATTTAT TACAACAGCT 54 0 
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CCCAGATATT TGGTATCAAG GTATAATAGv CCAATATACA AAAAGTTCTG CTTTGTTTGA 600 

TTTTAGTGAA GTATACGATT CAATAGATTC T AC AC AAC T T GCTGCGAAAA AGTCACTTGT 660 

TGGAAAT CAA TCTTCATTTT TTATCTTGAG T GAT G AAC AA ACGAAAGGTC GTGGGCGATT 72 0 

TAATAGACAT TGGAGTTCTT CAAAAGGGCA AGGACTTTGG ATGTCTGTCG TGTXAGACC 780 

TAACGTTGCA TTCTCAATGA TATCTAAATT TAATTTATTT ATTGCATTAG GGATAAGAGA 84 0 

TGCGATTCAA CATTTTAGTC AAGAT GAAGT CAAAGTGAAA TGGCCGAATG ATATATTTAT 900 

TGATAATGGT AAAGTGTGTG GTTTCTTAAC TGAAATGGTT GCTAATAATG ATGGTATAGA 960 

AGCAATAATA TGTGGTATAG GTATTAATTT GACGCAACAA CTAGAAAACT TTGATGAAAG 1020 

TAT T AGAC AT AGAGCAACAA GTATACAATT ACATGATAAA AAT AAAT TAG AT AGAT AT C A 108 0 

ATTTTTAGAG ATATTACTTC AAGAAATTGA AAAAAGATAT AATCAATTTT TAACGTTACC 114 0 

TTTTTCTGAA ATTCGTGAAG AATATACTGC AGCTTCAAT ATTTGGAATA GAACGTTGCt 12 00 

ATTTACAGAA AATGATAAAC AGTTTAAAgG ACAAGCAATT GATTTAGATT ACGATGGCTA 12 60 

TCTAATTGTT AG AG AT G AAG CGGGTGAATC ACACCGTTTA ATTAGTGCAG ATATAGATTT 132 0 

T T AAC AC T AA AGCAAGGAGA GATAGCTATG GGTATGGCAA CCTATGCCGT TGTGGATTTG 1380 

GnAACAACAG GCAACCAATT AGATTTTGAC GATATCATTC A 1421 
(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2202 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

CCAAGTTGCC TAAAATGATT AAGCAAGGTT TATACCCTAT G C n AAC GAT T GAACAAGAAT 60 

CTGGAGCCAT CCGACTGCCA ACGATTTCTA GAGTGAnGCG TTCATTACAA TGGGGTAATG 120 

ATGCTTATAC AATGATTTTA GATCGTATGA ATATTGAAAC AAATGAATAA T AAAT G AAC G 180 

ATAAACAATG GTTATCTATC TGCACTAATA AGG T AG AT AA TCATTGTTTT TTCACGAAAA 24 0 

AAT T T AC AG A GTAAAAGAAC TTAAATTTCA TATTAAGTCT TTAGAACTCG AC AC T T AAAA 300 

ATGCTATAAT CATATGTATG TTAAAAAAGG AGTTTCGGA AATGTATGAC ATTAAAAAAT 360 

GGCGCCATAT TTTTAAATTA GACCCAGCTA AACATATTTC AGATGATGAT TTAGATGCGA 4 20 

TTTGTATGTC TCAAACAGAT GCAATTATGA TTGGTGGAaC TGATGACGTT ACTGAAGATA 4 80 

ATGTCATTCA TTTAATGAGC AGAGTAAGAA GATACCCATT GCCTTTAGTA CTTGAAATAT 54 0 

CAAACATCGA AAGTGTAATG CCTGGTTTTG ATTtTTATTT TGTACCTACA GTACTGAACA 60 0 
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GTACAGATGT TGTATTTCAC AATGGTACAT TATTAGAAGC GCTTAAAACA TATGGACATA 6 60 

GTATAGATTT TGAGGAAGTA ATATTTGAAG GGTATGTCGT GTGCAATGCT GATAGCAAAG 720 

TGGCAAAACA TACCAAAGCA AATACAGATT TAACAACAGA AG AT T T AG AA GCATATGCCC 7 80 

AAATGGTCAA TCATATGTAT CGATTACCGG TTATGTATAT AGAGTATAGT GGCATTTATG 84 0 

GCGACGTATC AAAGGTTCAA GCTGTCTCAG AACATCTAAC AGAAACGCAA CTTTTTTATG 900 

GTGGCGGTAT TTCCTCAGAA CAACAAGCGA CAGAGATGGC AGCTATGCA GATACAATTA 960 

TCGTCGGTGA TATTATTTAT AAAG AT AT T A AAAAAGCTTT AAAAACAGTA AAAATAAAGG 1020 

AGTCTAGTAA ATGAATGCGT TATTAAATCA TATGAATACA GAGCAAAGTG AAGCTGTAAA 1080 

GACAACAGAA GGACCATTGT TAATTATGGC AGGTGCTGGT TCAGGGAAGA CACGTGTTTT 114 0 

AACACATAGA ATTGCTTATT TAT TAG AC G A AAAAGATGTC T C AC CAT AC A ATGTTTTGGC 12 00 

TATTACTTTT ACAAATAAAG CTGCAAGAGA AATGAAAGAA CGTGTTCAAA AATTAGTAGG 12 60 

TGATCAAGCA GAAGTTATTT GGATGTCAAC ATTCCACTCA ATGTGTGTTC GTATTTTACG 1320 

TCGTGATGCA GATCGAATTG GTATAGAA;G CAATTTTACG ATAATTGATC CTACAGACCA 1380 

AAAATCTGTT ATTAAAGACG TCTTAAAAAA TGAAAATATT GATAGTAAAA AGTTTGAACC 14 4 0 

TCGTATGTTT ATCGGTGCGA TCAGTAATTT GAAAAATGAA CTTAAAACAC CTGCAGATGC 1500 

TCAAAAAGAA GCCACAGATT aTCACTCgcA AwTGGTaGCA ACgGTTTaTA GTigATATCA 1560 

ACGCCAATTG TCACGTAATG AAGCGTTAGA TTTTGATGAC CTTATTATGA CAACGAT T AA 1620 

CTTATTTGAG CGTGTACCAG AAGTTCTAGA ATATTATCAG AACAAATTCC AATATATTCA 168 0 

TGTAGATGAG TATCAAGATA CTAATAAAGC ACAATACACA TTAGTTAAAT TATTAGCAAG 17 4 0 

TAAGTTTAAA AACTTATGTG TTGTAGGTGA CTCAGATCAG TCAATTTATG GTTGGCGTGG 18 00 

TGCTGATATT CAAAATATCT TATCATTTGA AAAAG AC TAT CCAGAAGCGA ATACAATCTT 18 60 

TTTAGAGCAA AATTATCGTT CGACGAAAAC GATTTTAAAT GCGGCTAACG AAGTGrTTAA 1920 

AAATAATTCT GAACGTAAGC CAAAAGGACT GTGGA1TGCA AATACGAATG GTGAGAAAAT 198 0 

TCATTACTAT GAAGCAATGA CGGAACtGAT GAAGCGGAAT TTGTAATACG AGAAATTATG 2040 

AAGCATCAAC GTAATGGTAA GAAAT AT C AA GATATGGCAA TTTTATATAG AACGAATGCA 2100 

CAATCACGTG TACTTGAGGA AAC AT T CAT G AAATCTAATA TGCCATACAC AATGGTTGGT 2160 

GGCCAAAAGT TCTATGACCG TAAAGnAATC CAAAGATTTA TT 2202 
(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

AGTGGTGCAA AGATAGGCAT TGATaATACC GCTAAGCCAG cAAGATGATG GCACGATAAA 60 

ACCTAAACAG AAGAAmATAA ATAGTAATAC GAT GAT AAA T AATGGTCCAC TCATATGTTG 12 0 

AACTAAAGAT GATGAAAAGT GTAAGATTGT ATCTGAAATC ATACCTTCAT TCAACACTAA 180 

ATTAATACCT CGAGCTAAAC CAATAATTAA AGATACACCT ACTAAACTTG ATGCACCATT 24 0 

GACAAATGCA TCTACAGTTC CTTTTTCTCC CAATCCAGAT TTACCTGTCC CAGCAATAAA 300 

CATTATTATA ATTGTAAATA TTAAAAATGC TGAAGCCAA ACTGGGAACC ACCAACCTTG 3 60 

CGTCATAACT CCCCATACCA TAATTGGAAA TGGTAGTACA AATAATGTAA GGATTATCTT 4 20 

C T T AC GC AAA GTAAAATGGG CACTATCGTC ATCTTTTAAT ACAGACCATT GCTGTTCAAA 4 80 

AGCATCTTTG TCTTCATAAG AATATGACGC TTTAGGATCG TTTTTAATTT TTTTACAGTA 54 0 

CCAATATAAA TAACTAATAA CAAAAATCGC ACCGACAATA CAAGCACCTA TTCTCCAATA 60 0 

CAAGCCATCC GTAAAAGTTG TACCAGCGGC ATTAGAGGCA ATTACAACCG AGAACGGGTT 6 60 

AATAGTTGAA AATGTACTAC CGACAGAGCT GGcAAGGAAT ATGGCACCAA CTGGAAACGA 720 

TAGAATCGTA TCCTAACGCT AATAAATATA GGGACTAAAA TCGGATAAAA TGCTACAGCT 780 

TCTTC 785 
(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYEE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

CTAACGnGAT AAGGTTGCAA nTTTATCTGA ACATCTGATG ACTGTAATTT TGTtAATGAT 60 

AAAATATTTG TCACTAATAG ATATAAATAC TGACTTTCTT GAAACTATG TACAAGTAAT 120 

TGTTCCTTTT CTATGATAGA CATATCTTTA CTATGTGATA CTAAAATATC TAAATkTCCC 180 

ATAATTGTTG TTAACGGTGT ACGTATGTCA TGCGAAATTG ATCTTAAAAA ATTTGAATGT 24 0 

GTCAGTTGAC GTTCAGCCTG TAACATGGAT TCTCTCGTTT GTTTAAGTAA CGTCACATTT 300 

TCAACGGCGA GAGAAAGTTC ATTTAACATT GATTCTAATA TTGATGCATC ATATGGATTA 3 60 

ATCACTTGAG AACTTTGGTA ATCAATGGCT AGAATGCCTT TAATCGGAGA TGTGCCAATT 4 20 

GGTATCAACC ATTTATTAAT GCCTGGAAAT GTATCTGTTG TTGCACCAGC TTGTCTTTCA 4 80 
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TTTTTAATTA CCCAGCTTAA TGCTTGTTCA TGCTGTTGAG TCGTATTATC GATATGGTTT 54 0 

TGCAATGGTA TTGTTTTAAT TACTTTCGAT TGATTGATAA CGTATATAGT AATTGATTGT 600 

TGCAATAATT GATTAATTTG GTATCCAGCA T T TAT TAG T A AGTTTTCAAC TGTATAAGTT 660 

TGTTTAATCG AATCATTAAA TTGAAATAAT AAATCTGTAC GATAAAGTTGCTTTTTAGTA 7 20 

ATGGaGTAwT GGAATTTAAT TTGTnTTAAT AAAGCACTCG TTAAAATACT TGTnAAAATG 780 

CTAACGATAA ATGTAATAGG ATAGTCAAAG CG 812 
(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

ATnnATTATT ATTACTGCTA TTTTAAATTT TAAAAAATGC TTTTGATTAT ATTCAACAnT 60 

TTGTATAAAA TTAAATTTGC TTTTGATTAA AGCATGAAAA TTGTAATCAA ACCATAAATT 12 0 

GTCGTATGAT GTAGTTAGAA TTTTAAAATG CAGGAGGTCa AGTATATGAC T G AAAT a AC A 18 0 

TTCAAAGGTG GACCAATCCA CTTAAAAGGT CAACAAATTA ATGAAGGTGA TTTTGCACCT 24 0 

GAT T T T AC AG TGTTAGATAA TGACTTIAAT CAAGTAACAT TAGCAGATTA TGCTGGTAAA 300 

AAGAAATTAA TTAgTGTGGT ACCATCAATT GATACAGGTG TTTGTGATCA GCAGACTCGC 3 60 

AAATTCAACT CTGATGCTTC TAAAGAAGAG GGGATTGTGC TTACAATTTC AGCAGACTTA 4 20 

CCATTCGCAC AAAAAAGATG GTGCGCTTCA GCAGGTTTAG ACAATGTCAT TfiATTAAGT 4 80 

GACCACCGTG ACTTATCATT TGGTGAAAAC TATGGCGTTG TTATGGAAGA ACTTCGCTTA 54 0 

TTAGCTCGTG CAGTATTTGT ATTAGATGCA GATAATAAAG TTGTTTATAA AGAAATCGTT 600 

AGTGAAGGTA CTGATTTCCC AGATTTTGAT GCTGCTTTAG CTGCATACAA AAATATTTAA ' 660 

TCATTAAAGA GATAAATCTT AAAATGTATA CATCGTGTCC ATCGTTGTCA AC AG CAT T AA 7 20 

AATAGAATTG TTTTCTATGA TTGCTAAGAC CTATGGGCAC TTTTTATTGG AGAGGGACGA 7 80 

ATATGGCAGA ACAACAAACA ATTATGGAAC GCTTGTTTCA TACATTAGAT GAAAAAGCTA 84 0 

AAACATTAAA TAATGAAAAT GGCCaAAGTT TTATGAAAA TCTTGGGCTA GCAATGGAAC 90 0 

AAGTATATAC CAATGAAAGA GGATTGTTAG AACAATCGAC GTTACAAGAT CGACGTAAAG 960 

CATTCCAATT TGCATATTTA AGTTTAATGC a G GAAG AAAA GATAcAAGCA AATCATCAAA 1020 

TTACACCAGA TTCAATTGGA TTGATACTAG GATTTTTAGT TGAGCGTTTT ATGAACAACC 108 0 
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AAGAAGAATT ACATATTGTT GATATTGCAA GTGGTGCCGG TCATTTAAGT GCTACTGTAA 114 0 

AAGAAGTGTT ACCTGraAtT GcGGTTATGc ATcATTTaAT TGaAGTTGAt CCAGTTTTAT 1200 

CACGTGTTAG TGTACATTTA GCAAACTTCT TAGAAATTCC TTTCGATGTG TATCCTCAAG 12 60 

ATGCCATCAT GCCACTACCA TTAGAAGAAG CAGATATCGT TATTGGTGAT TTTCCAGTAG 1320 

GCTATTATCC AATTGATGAA AGAAGTAAGG AGTTTAAGCT AGGTTTTGAA GAAGGACATA 138 0 

GTTATTCACA TTATTTATTA ATAGAACAAG CAATAAATGC ATTAAAAGAT GCTGGATATG 14 4 0 

CCTTTCTAGT GGTACCAAGT AATATTTTTA CAGGTGAACA ETAAAACAG CTTGAAAAAT 1500 

ATATTGCAAC AGAGACAGAG ATGCAAGCAT TTTTAAATTT ACCACCAACT TTATTTAAAA 15 60 

AT G AAAAAG C GCGAAAATCT ATATTAATTT TACAAAAGAA AAAATCGGGT GaAACAAAGC 162 0 

CAGTTGAAGT ATTATTGGCA AATATTCCTg ATTTCcAAAA TTCCTTCACC AATTTCCAAG 1QB 

GATTTATGGA CAGAGTTAAA ATCCAGTGGG ATGGGACCAC CAAATCGTCC TA 1732 
(2) INFORMATION FOR SEQ ID NO: 28 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

AAAAG AAC T A GCTAAACGCA AGCAAGAAGC TATTAGTAGA ATTAAAGACT TTTCAAATGA 60 

AAAAATAAAT AG TAT T C G AA ATAGTGAAAT TGGCACAGCT GATGAAAAAC AAGCAGCAAT 120 

GAATCAAATT AACGAAATT5 TGCTTGAAAC AAT TAG AG AT ATTAATAATG CGCATACATT 18 0 

ACAGCAAGTT GAGGCTGCAT T G AAC AAT GG TATTGCTCGA ATTTCAGCAG TACAAATTGT 24 0 

AACATCTGAT CGTGCTAAAC AATCGTCAAG TACTGGAAAT GAATCTAATA GCCATTTAAC 300 

AATTGGTTAT GGAACTGCAA ATCATCCATT TAACAGTTCG ACT&TGGAC AT AAAAAG AA 3 60 

ACTTGATGAA GATGATGACA TTGATCCACT TCATATGCGT CACTTTAGTA ATAATTTCGG 4 20 

TAATGTTATT AAAAACGCTA TTGGTGTGGT GGGTATCTCT GGTTTACTAG CTAGTTTCTG 4 80 

GTTCTTCATT GCCAAACGTC GTCGTAAAGA AGATGAAGAG G AAG AAT TAG AAATAAGAGA 54 0 

TAATAATAAA GATTCAATAA AAG AG AC T T T AGACGATACA AAAC AT T T AC CACTTTTATT 600 

TGCGAAACGT CGCAGAAAAG AAGATGAAGA AGATGTTACT G T T G AAG AAA AAGATTCGCT 660 

AAATAATGGC GAGTCACTCG ATAAAGTTAA ACATACGCCG TTCTTCTTAC CAAAACGTCG 7 20 

TCGTAAAGAA GAT G AAGAAG ATGTGGAAGT TACAAATGAA AAC AC AG AT G AAAAAGTGTT 78 0 

GAAAGATAAC GAACAT TC AC CACTCTTATT CGCAAAACGA CGCAAAGATA AAGAGGAAGA 84 0 
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TGTTGAAACA AC AAC TAG T A TTGAATCTAA AGATGAGGAC GTTCCTTTAT TATTGGCTAA 900 

AAAGAAAAAT CAAAAAGATA ACCAATCCAA AGACAAAAAG TCAGCATCAA AAATACTTC 960 

TAAAAAGGTA GCAGCTAAAA AGAAGAAAAA GAAAGCTAAG AAAAATAAAA AATAATTTGT 1020 

TTCTTTGATA AATAGaGGAG CACCGATTGA CATCACATCA GTCGGTGCTC CTTTTATTTA 1080 

TTCTTTTTAA TTAATTTATA CAATGCCTGT TGAGCGTGTT GATTCGCTTC TTTGTTTTGT 114 0 

TCTCTCGGTA TCCATTTAAC AAATAATAAA TCAAAATCTT TTTCAAATAT TTCTATTTGA 1200 

TCAAAATAAG GTTTGAAATT TGCGTTTTTC ACATAACCAG CTTCAATGCT ATCTGCAATT 12 60 

AGCTTTGAGT CTGTATATAA TAGTGCGTTT TGAACATTTA ATTCACGTGC ATGTTCTAGT 132 0 

G C AT AAA T AC ATGCAGCCCA TTCTGCAGTG TGCTTATCCA TTTCGCCTAA CTCATGTGTA 1380 

TATGTATAAT GCTGCTCATC TTCTTTGATT ACAATGGCAC ATGTACTTAT GCCTGGATTT 14 4 0 

cCTTTCGTCG CAGCATCAAA ATTTATTTTC GCCATAATAA ACCTACTTTC TATTCAATAC 1500 

TTAGTTAAAG TT ACT ATT AC TGTAATACAA AATATGTTGG GTAATCCATT AAAAAACAG 1560 

CATCACTTAA ATAAGTAACA CGTGTTTAAA ATACTCGCTG ATTCAAAGAT GATTTTCTAA 1620 

TACGTAtACT GTaATATACT TCCTAAAAAA ATCATCTTCA GGCTGGGACA TAAATCAATG 168 0 

TTCTATGCTC TACGATGTTA TATTGGCAGT AGTTGACTGA ACGAAAATGC GCTTGTAACA 17 4 0 

AGCTTTTTTC AATTCTAGTC AGGGGCCCCA ACACAGAGAA TTTCGAAAAG AAATTCTACA 18 00 

GGCAATGCGA GTTGGGGTGT GGGTCCCAAC AC AGAAG AT G ACGAAAAGTC AGCTTACAAT 18 60 

AATGTGCaAG TTTGGGATGG GCCCCAACAA AGAGAAATTG GATTCCCAAT T T C T AC AG AC 192 0 

AATGCAAGTT GGGGTGGGAC GACGAAATAA ATTTTGCGAAAATATTATTT CTGTCCCACT 1980 

CCCTTAAAAC TTATTCTTTT GTGTAGTAAG TGCGTTAATA GCCTTGATCT AACTTATCAA 204 0 

TCTTACCTTT ACGATAAAAT GATTTAGCAA T AT AT CCAAA TGGTACATTG AAAACTGTTG 2100 

AAGCTAATTT TAATACGTAC GTTGTAATAA ATATTTCAAa TACAAmTGTa CCAGGTAAAC 290 

TTCCGATAAA TGCGATAGCT ACAAATAAAG CTGTATCAAT TATTGrGCTT AAAAATGTAC 2220 

TACCATATGg CACGGATGAA AAACGTTTTa TCmGaACTAA ATACTTTTTT AATTAGTGAA 228 0 

AAGATAAATA CATCAATATG TTGACCAATA ATATATGCGA CGATTGAGCC TAAAGCAATG 234 0 

CGTGGCACAA CATCAAAGAT TGCGTGTAAT GCTTTTTGTG CCATATCTTC TGGTGCAGGA 24 00 

ATAAAATGTA ATGACAATTG CATAACAATA AT C AT AAT T A ATGTTGATGA AAAGCCTAAC 24 60 

CAAACTGCTC TTTTTGCAAC TCTACGCCCA TAAATATCGT TTAATATATC TGTTGCTAAA 2520 

TAAAT AGAAG CAAACATGAC ATTACCTAAA GTTGCTGAAA TACCAAABT TTCTACAGTT 2 58 0 

TTAATCACTT GTATGTTGGC AATGATTGTG CCAATTGCAA CCCATGCAAT TAAACCTTGT 2 64 0 
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TTACCAAAAA AGCGATACAT AAGTACCATA AGCACGAACG TTGCAATAAA CGTAACTAGT 



CCTAAAATTT CATTATACAT ATTAAAATGT CCTCCTAAAT TTTGATCATG CGGGTGTTTA 



GAAACCGCTC AATAAATAA 



2700 

2760 
2779 



(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

ACTGATGTGC GTTCATCAAA AACAATATAA TCAAATTCAT TTTCATCAAA TTGCTTAAAA 60 

TTATCATCTC TAGATAATGT TTGAATGGTT GCAAATAAAT ATTTGGCATC G AC AT C T C T A 12 0 

TGTTTTCCTG TCAACAATCC AAAAX) AC T A TCATTTTTTA TAGGTAATAC TTTTTTAAAT 180 

TCTTCCTTAG CTCTATTTAA AATCCCCTCA T T AT GAACAA TAAATAAAAA TTTATTAGGG 24 0 

TTTACTTCTC TAACATCTAA TGCACATAAA ATCGTTTTAC CTGTACCAGT TGCAGATATT 300 

ATTAACGCCT TATCTTTGGC TTTATCCCTA ATAGCTTTTA ATGACCTTAATGCTTCTGCT 3 60 

TGCATTAAAT TGGGTACAAT TTCCACTGAT TTTTTCACCT TATCAGCTAG CAGCATTTGA 4 20 

GTTTGTTCAA CCTCCGCTAA TTTTTCTAAG GAACGGTACT CAAATGATTC TTTATATGAA 4 80 

TTAATCCATT GCTCAGTCAG TGGGGTACTC TTTTGCCATA AC AAG T C AAA TTCACTTTTT 54 0 

ACACTATCAA CTAAATCGCC ATTTTTCATA GTAGACAGTA AAACATTATG CTCATAATTA 600 

ACCTTTAACG CATTAGATGT T AAAT TAG AG CTTCCTATTA CCATAGAACT ATAATCCTTA 660 

TGCTCAAAAA TATATCCTTT GGCATGGAAT CCAGCAATAT CAGTTAATCT TACCTCTACA 720 

TTTTTTAATT TAAGTAATTC TCCATACATT T'AGGACTAT TAAACCCTAA G T AAT TAG AT 780 

GTTAATATTT TCCCTTTAAC ACCCTTATTG CTTAAATCTA ATAGTTGAGC CTTTAAGCTG 84 0 

GcTAAACCGC TTTCTGTTAT AAAAGCCACA GAAAAATAAA ACGTTTCACA TTTTTGAAGT 900 

TCATCTATAA TTGTTGAAAG AACTTTTTCA TTTTTATTAT TTACTAAAAG CTTCGGTEA. 960 

TAATTCCCTT TATGAGAAAT ATGTTTGTCT ATAAACCCTT TATGTAAAGA TTGATTGAAA 1020 

TCATTTAGTA ATCTACTCAT ATTATCCCTC AGTCATAATT TTATTAACGG CTGGTATATC 108 0 

CGCTGGGGCC CAATTTAATT TATCAAGTTC GTTTATTGAC AACCATTCAA TACTCTTATG 114 0 

TTCAGTTAGA GTTGGTAACT CTTTGTTTAA AGTACATTTG TATGTTGTTA ACCTAACAAT 12 00 

TCCAAAATCA TATTCATGTT CTGTAGTTAT AACTTTGTCT CCAACAATTA AATCACATTT 12 60 

CATTTCTTCT CTAATTTCTC TAATCAAAGC GTCTTTTTCA GTTTCATTCT TTTCAACCTT 1320 
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ACnGCCAGGA AATTCCCACA TTAAAGGCAG ACTCATTTTT TCACTTCTCT GTGCACAAAG 1380 

AATTTTGTTA TCAGAAAAAA TAATAGCTCC TACTACATTG ATTACTTTTT TCATAAGACT 14 4 0 

CACCCTTCAA TTTAAAATCA TCTTAATTGT TAT T C TAT C A AAAATTACAA AACTATATAT 150 0 

AAATCAATAT TAAAAATTAA TATTTTACAT T C AC AT G AAC GCTCTACTCC ATGCATTTTC 5L60 

ATACACATCT AT T AT AT AAT ACTTGTGAAA AGTATTGTCT TGGGGCTGTG TTTTTTTACT 162 0 

TTTGGGGCGT ATTTCTTTAT AAT TC ATT AC ATAAATGTAA GGGCTTTAGT TTTCATGTTT 168 0 

TATTAAGTCT AACTGAGATT TTGAAAGGAT GTTTAGCAAC AAT GG AT AAA GAATTATGGA 17 4 0 

TAGAACGAGc T AAT GAT AG T TTAGTTAAAC ATTTTTATGa GCAGCAATCT GACATTGAAC 1800 

AGCGAGAAGG TTTTGAAAGT AAATTAACAT TTGGTACTGC GGGTATACGC GGAAAAT TCG 18 60 

GTCTTGGTGA AGGTCGACTT AATAAGTTTA CTATTGAAAA ATTGGCATTA GGTTTAGCGC 192 0 

GTTATTTAAA TGCCCAAACA AACAGTCCAA CAATAGTCAT TCATTACBAT AT TAG AC AT C 1980 

TTTCcAACTG AATTCGCCC 1999 
(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1933 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GATGACTTTT CCCCCTCATA ATCTTCATGG TCCAGGCGTC CATTAATGCG TCAAAGGATG 60 

GCACATTTTA CCTGGAACAA AT GAT T CAT A TGGTTCATAA AAATCACGCG TCGTAATATA 12 0 

ATCTTCTAAA TCAAATGCAT AGAAAATCAT TGGCTTTTTA AATACTGCAT ATTCATATAT 18 0 

TAAAGATGAA TAGTCACTAA TTAATAAATC TGTTATGAAC AGTATATCAT TAACTTCTCT 24 0 

AAAGTCAGAA ACGTCAACAA AATATTGTTT ATGTTTGTCT GC AAT AT T AA GTCTATTTTT 300 

CACAAATGGA TGCATTTTAA ATAATACAAC CGCGTTATTT TTTTCGCAAT ATCTTGCTAA 3 60 

ACGTTCAAAA TCAATTTTGA AAAATGGGTA ATGTGCTGTA CCATGACCAC TACCTCTAAA 42 0 

TGTTGGTGCG AAAAG AAT G A CTTTCTTACC TTTAATAATT GGTAATTCAT CTTCCATCTC 480 

TTGTTTGATC TGTGTCGCAT AAGCTTCATC AAATAGTACA TCAGTACGT GG AAC AC C T G T 54 0 

AGGCACTACA TTTTTCTCTT TAATACCAAA TGCTTCAGCG TAGAATGGAA TATCGGTTTC 600 

AGATGATACA TAAGCTTTTG TATAGCTACG ATGATTTAAT GAATCAATAA ATGGTCCACC 660 

CTTTTTACCA G T AC G AC T AA AGCCAACTGT TTTAAAGGCA CCAACGGCAT GCCATACTTG 72 0 



886 

( opied horn 1080fi; ()(> on 08/ 1/ t?00 I 



AATAACTTCT TGAGAACGTC TAAAACGCAC TGTATAAATC AATGGGTGAA AGTCATCAAC 7 80 

AAAGATGTAG TCTGCCTTCC CAAGTAAATA TGGCAATCTA AACTTGTCGA TGATGCCACG 840 

TCTATCTGTA ATATTCGCTT T AAAAAC AG T GTGAATATCA TACTTTTTAT CTAAATTTTG 900 

ACGTAACATT TCGTTATAGA TGTATTCAAA G'TTCCAGAC ATCGTTGGTC TAGAGTCTGA 9 60 

TGTGAACAAC ACCGTATTCC CTTTTTTCAA GTGGAAAAAT TTCGTCGTAT TAAATATCGC 1020 

TTTAAAAATA AATTGTCTTG TATTAAATGA TTGTTTGCGG AAATACTTAC GTAATTCTTT 1080 

ATATTTACGA ACGATATAAA TACTTTTAAC TTCCGGAGTC GTTACAACAA CATCAAGAC 114 0 

AAATTCATTA ACATCGCTAG AAATTTCAGG TGTAACAGTA TAAACCGTTT TCTTCGAAAT 1200 

GCCGCCTTTT CTAAATTCTT TTAGGTAAGT CTGCAATAAG AAATTGATTT TACCATTTTG 12 60 

TGTTTCTAAT TCGTTGTATT CTTCTTCTTG TTCTGGCTTT AGATTTTGAT ATGCATCATT 1320 

AATCACATCT GGGTTTAACT GTGCAATATA ATCAAGTTCT TGCTCATTCA CTAATAAGTA 138 0 

CTTATCTTCA GGTAAGTAAT AACCATTATC TAAGATAGCT ACATTGAAAC GACAAACGAA 14 4 0 

TTGATTCCCA TCTATTTTGA CATCATTCGC CTTCATTGTA CGTGTCTCAG TTAAATTTCT 1500 

TAATACAAAA TTACTATCTT CTAAATCTAG GTTTTCACA TGTCCTTCAA CGAATAACTG 15 60 

AACACGTTCC CAATAGATTT TATCTATATA TATCTTACTT TTAACCAACG TTAATTCATC 162 0 

CTTTTCTATT TACATAATCC ATTTTAATAC TGTTTTACCC CAAGATGTAG ACAGGTCTGC 168 0 

TTCAAAAGCT TCTGTAAGAT CATTAATTGT TGcAATTTCA AATTCTTGAC CTTTTAACAA 17 4 0 

CGCTAATTTA nCTACAATAT CTGGGTATTG AATGTATAAG TCTACAACAT CTTGGAAATC 1800 

TTTTGAACCA CTTCGACTAC TACCAATCAA CGTTAGTCCT TTTTCCAATA CTAGACGTGT 18 60 

ATTAACTTCT ACTGGGAACT CACTTACACC TAACAGTnCA ATGCTTCCTT CTGGTGAAAT 192 0 

GTAATCGATC ATT 1933 
(2) INFORMATION FOR SEQ ID NO: 2 91: 

\ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

nGTnCGGnCA GATATATTGG TGGTCTTTAG TAAGTGTATC AAAT T CAT C A GATGTCAAGG 60 

GCATGTTATC ACCTCCTTAG GTTGATAACA aCATTATACa CGaAAGGAGC ATAAaCAAaT 12 0 

GAACACAaGA TCAGAAGGAT TGCGTATAGG CGTCCCACAA GTTCTAGCA AAGCTGATGC 180 

TTCTTCATCC TATTTAACGG AAAAGGAACG TAACTTAGGA GCGGAAATAT TAGAACTTAT 24 0 
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TAAAAAAAGT GATTACAGCT ACTTAGAAAT AAACAAAGTT TTCTATGCAT TAGATAGAGA 300 

ACTTCAATAC AGGGCGAATA ATAACAAACT TTAaCATTTA TCTAAAGGAG TGATAGAGAT 306 

GCCAAAAATC ATAATACCAC CAACACCAGA AAACACATAT CGAGGCGAAG AAAAATTTGT 4 20 

GAAAAAGT T A TACGCAACAC C T AC AC AAAT CCATCAATTG TTTGGAGTAT GTAGAAGTAC 4 80 

AG T AT AC AAC TGGTTGAAAT ATTACCGTGA AGATAATTTA GGTGTAGAAA ATTTATACAT 54 0 

TGATTATTCA GCAACGGGAA CATTGATTAA TATTTCTAAA TTAGAAGAGT AT T T GAT C AG 600 

AAAGCATAAA AAATGGTATT AGGAGGATTA TCAAATGAGC GACACATATA AAAGCTACCT 660 

ATTAGCAGTG TTGTGCTTCA CGGTCTTAGC GATTGTACTC ATGCCATTGC TGTACTTCAC 720 

TACAGCATGG TCAATTGCAG GATTCGCAAG TATAGTGACA TTCATATTT ATAAGGAATA 7 80 

CTTTTATGAA GAATAAAAAA ACTGCTACTT GTTGGAGCAA GTAACAGTGC AAGATGAGCA 84 0 

ATTGTCTTAA ATAATTATAT AAGGAGTTAT TAATATGACC T T AC AAC AAA AAAT AC TAT C 900 

ACATTTTGCA AC AT AT G AC A ATTTCAATTC TGATGATGTT GTTGAAACTT TTGGGATATC 960 

TAAAACACAT GCAAAATCCA CACTTTCAAA AC T T AAG AAA AAAGGAAAGA TTGCAATGGA 102 0 

AAGTTGGGGT GTCTGGCGTG TTATTGAATC GCAATTGCAT TTAAGTGTAG TCGAACGTAA 1080 

AAAAGAAATT T TAG AAG AAC AATTTGAATT GTTAGCAAGA T T AAAT G AAC AAAGTGATGA 1140 

C C C T AG AG AA ATAGAAGAAC GTATCAAGTT AATGATTCGT CTAGCTAACC AATTTTAAGG 1200 

AGGAGTTAAT CAATGGCAGT AT T AGAAGG T ATTTTTGAAG AATTAAAACT AT T AAAT AAG 12 60 

AACTTACGTG TGTTAAACAC TGAACTATCA ACTGTAGATT CATCAATTGT ACAAGAGAAA 1320 

GTTAAAGAAG CACCAATGCC AAAAGAAGAA ACAGCTCAAC TGGAATCAAT TGAAG&GTT 1380 

AAGGAAACTT CTGCTGATTT G AC T AAAG AT TATGTTTTAT C AG TAG G AAA AGAGTTCCTT 14 4 0 

AAAAAAGCAG ACACTTCTGA TAAGAAAGAA TTTAGAAATA AAC T T AAC G A ACTTGGTGCG 1500 

GATAAGCTAT CTACTATCAA AGAAGAGCAT TATGAAAAAA TTGTTGATTT TAT GAATGCG 1560 

AGAATAAATG CATGAAGCTA GAT C AC T C AA ATAGAGCTCA TGCAAAGCTT AGTGCAAGTG 1620 

GAGCAAAACA ATGGCTAAAC TGTCCACCGA GTATTAAGGC AAGTGAAGGT ATTGCAGATA 1680 

AAAGTTCAGT TTTTGCTGAA GAAGGTACAT TCGCTCATGA GTTAAGTGAG TTATATTTCA 17 4 0 

GTCTTAAATA TGAAGGCCTA ACACAGTTTG AGTTTAA'AA AGCTTTTCAA AATTATAAGC 1800 

G AAA T C AAT A TTACAGTGAA GAGTTGCGCG AATATGTTGa AGAGTACGTA GCTAATGTAG 18 60 

AAGAAAAG T A TAACGAGGCT TTGaGTAGAG ATGACGATGT AATAGCTTTA TTTGAAACAA 1920 

AATTGGATyT AGGTAAATAC GTCCCTGAAT CTTTTGGTAy TGGTGATGTC AtTATATTTT 1980 

CAGGTGGTGT ACTTGAAATT ATTGACCTTA AATACGGTAA AGGCATTGAA GTTTCAGCTA 204 0 
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TAGATAATC 

(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2049 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

ATGATGTTTC TATATTCGTA TTAGGAAAAC CTGTTGGTAT TACAACAAAC GCCCTAAAAT 60 

TACAATACCG CTGCTCCTAT ACCAATTGCA AC AAC AG T T T TAACTGAAAT ATCTTGTTTT 12 0 

TTCATCTTCA TTACTCCTtt ACATAAAAAA T T CAT TAT AT TGATGGTGCT TTAGATAAAT 180 

GAATCGTCCA ATCATTTCCA GTACCAATAT GATATAAATC T G AAAAT GAG TCTTGATTGA 240 

CTGCTACACC AATATTTACT AGCGAGTTAA CATACACAAG SGTTCACCC ACATTAACAT 300 

CTGCAAACGA TCGCGCAAAT TTAATAATAT TTTGATAGAC TTTCTTATCT T GAT GAT AAA 3 60 

TTGTTACCAC CAAATTATTA CCATGAACAA TTTCCAAGGA TTTTAAGAAT GCCAATGGAA 4 20 

TATTTGTCCA TAATGACCCA AATCTGATAT CTAAAATATC AATGCTTCCT GTAACAGAAT 4CB 

CCTCATTTTT TGTCACTTCT CTTATTTCTA ATGCCTCAAT ACTATCAACA TTAATTGCCT 540 

GACCGAGACG TTCAAACGCT ATCTTATTTG CAGCTAATCT CGCACCATTG TATGCATAAA 6 00 

CATCTCTACC ATGAAAAGTA TGACTTTCTT CCGAATGAGG CAATCGGCTT TTCACTTCAT 6 60 

CAATTTCGAT AACTTTTTTA ATA3CTTCGT AATGTTTAAT ATGACTTAAA GAACCATTAT 720 

CAGGTGTAAT AATGTAATGA CCTGAATATG TTAAGCAAGC AATGTCCGCC TATCACTACC 780 

TACACCCGGG TCTACCACTG ATACAAAraC TGTGCCTTTA GGCCAGTATT TTACAGTTTG 840 

ATATAAACGA TATGACGCTA CCCAAATGTC ATACGGTGGT ATATCATGG TTAAGTTTTC 90 0 

AACACGTATA T CAT CAT T AC AGTATATGCA ACTCCATACA TT 942 
(2) INFORMATION FOR SEQ ID NO: 2 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

TGTAATAAAA TTTTATGnAA CATGCTGnGA TGCTACCATG AACCTTCTTC ATTTCTTTAT 60 

GTGAGATTGT GaAATTAATC AAATAATAAT ACGGTGGATA CTTTCCTAAT TTACGATATT 120 
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CCATTTCCTG ACGATAAAAT GTTAAATAAT CATTTTTTTG AACATCCAAT ATTGAATAAT 180 

GATCTGGATT ATACGTTTGA AT GAT G AC T T GACCTGCCTT T T CAT GAC G A CCAGCTCTAC 24 0 

CAGCCACTTG CGTTAATAGT TGATAAGTAC GTTCGCTCGC CCGAAAATCA GGTAAATTTA 3 00 

ACATTGTATC TGCATTCAGC ACACCZ&CTA AAGTAATATT TGGATAATCT AATCCTTTCG 360 

CAATCATCTG AGTACCTAGT AAAATGTCAC CGTTACCTTT TTCGAATTCA GTCAATAACT 4 20 

TTTCATGTGC ACCTTTCTTT GAGGTTGTAT CTACATCCAT CCTAATTATG CGCGCATCTT 4 80 

CAAATTCTTG TTGCAATAGT TCTTCAACTT TCTGAGTACC AGTACCTACT (ETCGAATGT 54 0 

GTTCACTCTC ACAATTTGGA CATTGATTCG GTGGCGTCTC TTGGTAACCA CAATAGTGAC 600 

ATTTTAATAA GTCTGTCGTT TTATGATACG TTAATGAAAT AT C AC AG T T T GGACATTGCG 6 60 

GTACATATCC ACAATCCCGA CATAACATAA ACGATGCATA ACCACGTCGA TTTAAAAATA 7 20 

AAACAACTTG TTCCTGTCGA TCTAATCTTA ATTGTATGGC TTCACGTAAA TCTTTTGAAA 7 80 

ACATTGACCG ATTACCTTCA CTCAATTCTT CACGCATGTC TACTATATCA ATTTCAGGTA 840 

AAGCTTGTTG GTTCACTCTG TTTGGTAATG ATAGCAAATG ATAAACGCCT TTTTCAGCTC 900 

GTGCATAACT TTCAAGACAT GGTGTTGCAC TTCTAAAAT GACTGGACAG T GAT GAT AT T 9 60 

CAcTTCGCCA TTGGGCAATT TCTCTAGCGT GATATCTCGG ATAATCTTCT TGTTTATATG 1020 

TAGATTCATG TTCTTCATCA ATGATGATTA ACCCTAAATT TTTGAAAGGT GCGAACACAC 10 80 

TTGACCTTGC ACCAACACTT ACTCTCGCAC GACCATCCCT AATTTTTTGC CATTCATCA 1140 

AACGTTCCCC ATTAGATAAG CCAGAATGTA ATACAGCAAC GTCATCACCA AATCGACGTT 12 00 

TGAAGCGTAA AACCATTTGC GGTGTTAGAG CGATTTCAGG AACTAACATC ATCGCCTGTT 12 60 

TTCCTTGG 1268 
(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

T AC C AC C AAA TAATATATTA GCTGGCATTT TAATAACATT TAnATTTGTC ATGATATCAT 60 

CAATAAAATG TTGAAACTTC GTAATTTCAC CTTCATAATC ATCAATTGCT GCTAATTGCG 120 

CACTCGATGC TTGCTGATCT AAATTTAAAA TATTCGACAT GCGTTGACTA TAATAAACTA 180 

AATGTTCTAT TAAGCCATCG TCACTCTTTT CCTTTGCTGC T GAC AT GAC A GCGATACGTT 24 0 
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TCAAAGGATA GTGTTGCGCC AATTTTAATG TCATTAATCC ACCTAAAGAC ACACCCGTTG 



300 



CACTGATAGA TTCATAACCT TCATTGACTA AAAATTGGTA AGCTTTCTCA ACTTCTTCCC 



360 



ACCAATCATC T AC AT TAT AT GTCATGAAAT CTTTCAACAA T AAAC CAT G A CCTGGATAAT 



420 



TCGGTGCATA ACAACTAAAT CCTTGGTCAT TTAACTCAGC TGCAAGATGC TTCACATCCC 



480 



GATTTGTACC TGTAAATGAA TGTAATAATA ATATCGCATG TCCATTTGTG CCTTTTAAAT 



540 



ACGtGGACTC GGTGTTTTAA TTCTCATTTT TCtaTATAcC TCCACTAtGT CTAAAGakGT 



600 



TkGCTAAACG CGTTGtCGIC GATGATTAA 



629 



(2) INFORMATION FOR SEQ ID NO: 2 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

TATGAAAGTA ATGAATGGTA ATATTATTAA ATTTGATGGA AAAGTAGATA TTGATAATGC 60 

AGATAATATC GGTTTTTTAA TTGAGCATCC TAAATTATAT GATAATAAAT CAGGATTGTA 12 0 

TAACTTGAAA TTATTTGCAC AAGTATTAGG TAAGGGTTTT GATAAAGCAT ACACAGACAA 18 0 

AAT TAT AG AT GCATTTGGTA TGAGACCTTA TAT T AAAAAG AAAGTTAAGA AATATTCAAT 24 0 

GGGGATGAAG CAAAAGT T AG CAATTGCAGT ATCTTTArTG AATAAACCTA AATTTTTAAT 30 0 

CTTGGATGAG CCTACAAATG GkATGGATCC AGATGGCTCa ATTGATGTGC TGACTACAAT 360 

TAAGTCTTTA GkAAATGaAC TTGATATGAG AATTCtAATA TCAAGTCATA AGTTAGAAGA 4 20 

TATTGAATTA ATTTGTGATA GAGCTGTATT TTTAAGAGAC GGnCATTTTG TTCAAGATGT 4 80 

AAACATGGAG GAAGGTGTTG CATCTGACAC AAC GAT AG T T ACTGTTGATC AT AAAG AC T T 54 0 

TGATAGAACT G AAAAAT AT C TTGCAGAGCA TTTCCAATTA CAAAATGTCG ACAAAGCAGA 600 

CGGACATTTA ATGATCAATG CACAAAAAAA TTATCAAGTT AT AC T AAAAG CATTATCTGA 660 

ATTAGATATT TATCCGAAAT ATATTGAAAC ACGTAAAAGT TCATTGCGTG ATACGTACTT 72 0 

CAATATAAAT CAAAGAGGTG ATAAATAATG AG AAT T T T AA ATTTAGTAA GTATGATTTT 780 

TATAGTATAT TTAAaArTCC TTTAACATAT TTAGCGaTAC TAGTCGTATC TAGTTTGATT 84 0 

GCAACTCAAA GTATACTTAT GGCAAATTCG ATGGATAACC C G AAAC AT AT TATTGTCTAT 900 

GGATCTGTAT TTGCTGCAGC AAAATGGTTA TTGTTAATAA TTGGATTAAT GTTTGTTGTT 960 

AAGAC AAT T A CGCGTGATTT TTCACAAGGT ACAaTTCAAC TATATATGAG T AAAG T T AAA 102 0 

ACACGCGTTG GAtACATTAT TTCGAAAACA ATTTCAATTa TTTTAATTTC AATATTATTT 1080 
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GCATTAATTC ATTATGTGAT TTTGATTGTT GTGCAGGCAT CTAGTAATGG AAAAAATTTG 114 0 

GCGTTTTCTA AAT AT G TAG A TAATTTATG5 TTCTTCCTAA TCTTTTTACT ATTCTTTGGC 1200 

TTGTTTTTAT TCTTAATCAC ACTTGCATCA CAAAAAACAG CAATGATATT TTCATTAGGT 12 60 

GTATTTTTAG TACTCATTGT ACCGTTTATT AAACCTTTTA TTACATTTAT CCCAAGATAC 1320 

GGTGAAAAAG TTTTAGATGC TTTTGATTAT ATCCCTTTTG CTTACTTAAC TGASAAATG 13 8 0 

ATTAGCTCTA ACTTTGATTT TAGCAATTGG CAATGGGTAA TTTCATTAGG TTCTATAGTG 14 40 

ATATTCTTCA TTTTGAATAT CTTATATGTC GCTAAAAAAG ACATyTAATA AAAATAATTT 15 00 

TGAGGTTGGG AATTTTAAAT TTTCCCAACC TCAAAGTTTG TCTTATTGTA AATTTATTTA 1560 

TTTTCTAATT TATTTAGGAT GGAATTATAA ACTGCTTTCC AAAATGAAGC GTCAGTTTTA 162 0 

TAGCGGTTTG ATATAACTAA GTGTGTTTCT TTTTCTAAAT CTGCATAGTC TGGATGATCT 1680 

TTGCTCGGTA ATTTATCAGC ACGAACATCA GTTACAAATT TTTGGACTTC ATTTGCTCTT 17 4 0 

GGTCCCCAAA CTGTTTCTTG TTCGAATTGA TCATTCAAGA ATACGAAGAT AGGAATTGCA 1800 

CGTGATTTAC CATTTGTTAA ATATTGATCG ATCAGTTTTG TATCATCATC TCTATGGAAC 18 60 

ACGCGTACTT CTAAATTTAA TGCTTCACTG ATGTGTTTTA GAATTGGGAG ATTCATCATT 1920 

GCATCTCCAC ACCAGTCTTC AG T AAT TACT AATACTTTAG AATAATTCAT CTCTTTTATT 198 0 

TTTTTGATGC GTGAATCATC TTCTGGTAAC TCAAATGATT GATAGATACT GAGAACGGTA 2 04 0 

TCTTGATTTG TCTTCATTCC ATCAATGTAT TCATTTAAGG GTTGGCTATT TTTGAAATAA 2100 

GTTTCTAAAT TTGTCATTGT AAAAACCTCC TTTAGCATTT ACAACATTAT ACCAATTTAT 2160 

AGTAATAAAA GGTAATG£AA TAAATTAATT GCAAATTCTT TGTTAATTTT TGTTAAGGAT 2220 

GAAACGGGAA GCACCTTATG CTATATTTAA ATAAGTACAA AGAAAGGGTG ACATCAGTGC 2280 

GTATTCAAAA TCGCTGGGTT GTGTTTATAT TATTTTTAAT CTGTTCTTTT GGTGTATTAA 234 0 

TTGGTTTATA TCAATATCGT CAT AC G AAAA CTGTGGATTT GTCDAATCTT GAAATAAACG 2 4 00 

ATATTAAATT AAATGAGAAG TTTGATAAAA AAGGCTATGA AGTAAATAAA AAAATTAAGT 2 4 60 

TTGATCGTTT TAAATTTTAT AACAGTAAAG CTCACCCTGA TCTTACCGTT AAAGT GAGAG 2520 

AAAAGGATAA CATCGTTAAG GGGATAATAT TAGTAAGAGA TGAAAAGATA CATACTAATT 2 580 

TTGAtGGGGG AATTGGTTCG CCGATAAATA ACGCgATTGA AAATCTTGGa TTCgGATATA 2 64 0 

AAAG r AC a AA AGTTGGcAAT GrtTkCtCAT CgGTAAAGTA TATTGATAGA GATAACCATT 2700 

TAAAATTAAA CTTACTTTAT CAAGATTTAG AAATTAAACG TATTGAATTT TTTAGTAAAT 2 7 60 

AGCTTTAGGT CTTAAAGTTw TAAAi&ACGA ATGAaTAATT TTATTGGGAT GAGTGAC 2 817 
(2) INFORMATION FOR SEQ ID NO: 296: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

TCTGTTAAAA TGATTTTTCT TTTAnAAAGG CCGnAAATCA ATGTTCGATT nTTATTTGCA 6 0 

TTATGGTCTC GATATTGGTA G AAT AT C AAA TGGTTAAATG AG AAAAAC T T GGAGGTGCTC 12 0 

ACATGTCATC AATCGTAGTA GTTGGGACAC AATGGGGAGA CGAAGSAAA GGAAAAATAA 180 

CGGATTTCTT GGcAGAACAG TCAGATGTTA TCGCGCGTTT TTCAGGTGGT AATAATGCAG 24 0 

GCCATACCAT TCAATTTGGC GGAGAAACAT ATAAATTACA TTTAGTACCA TCTGGTATCT 30 0 

TTTACAAAGA CAAATTAGCG GTAATCGGTA ACGGAGTCGT TGTTGATCCA GTTGCACTAT 3 60 

TGAAAGAATT AGACGGATTA AATGAACGTG GCATTCCTAC AAGTAATTTA CGTATATCTA 4 20 

ATCGTGCGCA AGTGATTTTA CCATATCACT TAGCACAAGA T G AAT AT G AA GAACGTTTAC 4 80 

gTGGtGACAA TAAGATTGGT ACAACTAAAA AAGGTATCGG TCCAGCATAT GTAGACAAAG 54 0 

TTCAACGTAT CGGTATTCGT ATGGCAGATT TACTTGAAAA AGAAACATTC GAAAGATTAT 600 

TAAAATCAAA CATTGAATAT AAACAAGCAT ATTTCAAAGG TATGTTTAAC GAAACATGTC 6 60 

CATCATTTGA TGATATCTTT GAAGAATATT ATGCAGCAGG TCAACGTCTA AAAGAATTTG 720 

TAACAGACAC ATCAAAAATC TTAGACGATG CATTTGTAGC AGATGAAAAG GT&TTTTCG 780 

AAGGTGCGCA AGGTGTAATG TTAGATATCG ACCATGGTAC ATATCCATTC GTTACATCAA 840 

GTAATCCAAT TGCAGGTAAC GTTACTGTTG GTACAGGTGT AGGTCCTACA TTCGTTTCAA 900 

AGGTAATTGG TGTATGTAAA GCTTATACAT CACGTGTTGG TGATGGTCCA TTCCCTACTG 9 60 

AATTATTCGA TGAAGATGGA CATCATATTA GAGAGGTTGG TCGTGAATAC GGTACAACAA 1020 

CAGGACGTCC ACGtCGTGTA GGTTGGTTTG ATTCAGTTGT ATTACGTCAC TCTCGTCGTG 1080 

TAAGTGGTAT TACAGATTTA TCTATTAACT CAATCGATGT TTTAACAGGC C T AGAC AC AG 114 0 

TGAAAATCTG TACAGCTTAT GAAT TAG AC G GTAASAAAT TACTGAGTAC CCAGCAAACT 12 00 

TAGATCAATT AAAACGTTGT AAACCAATCT TTGAAGAGTT ACCAGGTTGG ACAGAAGACG 12 60 

TAACAAATGT GCGTACTTTA GAAGAATTAC CTGAAAATGC ACGTAAATAT TTAGAGCGTA 1320 

TTTCAGAATT ATGTAATGTA CAAATTTCTA TCTTCTCAGT TGGTCCAGAT AGAGAACAAA 1380 

CAAACCTATT AAAAG AAT T G TGGTAGAACT TTATATAAGT CATACACAAT GATTATAAAT 14 4 0 

ACATGAGCCT TCTATCTTTA TTGGTAGGAG GCTTTTGTTA TGCTTGCTTC TGTATCGATT 1500 

CGATTATTTA GATAAAAAAT AC T AAC G T AA AG GC GAT AT T TGCTAGTCAT AATTTAGAAG 1560 
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rTTAgATGAt AtTtAACGAA AAt TAAgATG anATACtTGA ATGGTAA 1607 
(2) INFORMATION FOR SEQ ID NO: 2 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

TTAGAAGTAA GCACTTTAAT ATCTTTACCT ACCCATGTAC CAACACCTTC TTTAGGCTTT 60 

GGATTTTCAG CATGGTTATT TGATTTATTC ACCTGTTCAG ATCTATTTAC TTTATTACCT 120 

TTATTCGGGT TCTCTAATAC ATCAAATTTT AATCTCGCTG AATAAAAAAG AT AT AT T AAA 180 

AATCCAACTA AAAATAATAC ACCGACGACT CTTATAATTA ACTTTTTCAT CAATCAACCA 24 0 

CCTAAAAAGT ATTAATACTA TTGTAAAAAA CAACACATTA AT T AGCAAAT TTTCAACACT 300 

GACATAACTG TGTCGTTTCG ATAAAACAAA ATCATCTTCA GGCATAATTT TAACGTCATA 360 

AAGATTTTCC CTACACTCTA TATCATAACC TATCTTTATG TTTTCAGGTT GAAT T T CAT T 420 

ATTAAGATTA AAATATGTAT AAAAAAATGG ACAGTTAAGG TATCAAATGA ATACCATCAA 4 80 

CTGTCCGACT ATTCTTCATC AAAAAACCTG ATAAAACAAA TTGCCTTATC AGATTAGTAT 54 0 

CATTTGTATA AGCATATTAA TGACCAAATG TTGCTTTAAT CAGTGATGTC GTTTCTCCAC 600 

CTGGATATAA TACATATAAT AATAAGTAAA CTGCTACACC TGTAATTGCA GTACAGAACC 660 

ATATAACTGA AGCGAATGGC CCGAATTTGC GGTGTACATT ATATTTATCT TTAAATGCAG 7 20 

TAATAATTTG AACTAGGCCT AGAATACCAC CAATTGTTGC TAAATAATG TGGAAAAATA 780 

AGAAAATCGT ATAATATTTC TTAATTGATG CTGGTCCGCC AAATGCTGTA TTACCGATAA 8 40 

AAATCGTTCT ACTTGCATAG ATTAAGAAAA AAGTTAAAGC AAAAAC AG C T GCAGCTAACA 900 

TAACATTTTT GTGTTTATTT ATTTCCCTTT TCCAAATAAG TCTCCAACCA ATGGCAATTA 9 60 

AAATTGCACT AATGACAATA CATGTCGTAC TAATCGTTGG TAAAATTGGA ACGCCCATAT 1020 

TTTTCATCCT AACTTAATTA AT C TAG AT C A AAGTAAGTAA TGAAACAATC ACAGCTAACA 108 0 

C G AAAAAG AT CACTAAATAA T T TAG T GAAT ATATAAACAT TTGTGTTGCC CATTTTGTTT 114 0 

GATCTGAATT TTTCTTAAAT GTTGTTi&AC C T AAT G C AAT CCATCCTAAA TTTAATAAGG 12 00 

TrGCTAACAC TaCGAATACG AC AC C T AAAT TTATTAGTAA TAAAGGT AC T GGCAATAAAA 12 60 

TAATCAACCA GATAAACATA CTGACACGTG TACGTTTAAA GCCCTTAACT GATGGTAACA 1320 

TTGGAATATT TGCAAGTGCA TATTCATCTT T AC G T T T AAT AGCTAAGGcA TAAAATGAA 138 0 
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TTGGtTGCCA ACmAAATACA ACTAAAAACA GCGCAATCGC TGTTAAACTA ATTTGTCCTT 14 4 0 

CAATTGCAAC CCATCCAATT AGTGGTGGTA CTGCTCCAGG AAAACTCCCA ATCACTGTGT 1500 

TCCATGTTGT ATGTCTTTTA GACCATATTG AGTAATAAGA CACATAACCT ACAATCCCCA 15 60 

T AAG AC CAAG TACGCCTGAT GGTATATTCA ATAAAAACAA ACAAATTTCT CCAACTAACA 1620 

TCATACCAAA ACTTAATAGT AATAAATTTT GATCTGTAAT TCTATTATTT ACAGTTGGTC 168 0 

TATTTTGTTT ACTAGGCATA AT AC GAT C AA TATCTTGGTC GTAATAATTA TTTAACGCAC 174 0 

ATGCGCCACC CATAATTAAA GTAGATCCAA ATAEATTAA TAAAATTTGA GGTATTGATG 1800 

AT AAG AAG GA ATGATTTGTC ATTACAACTG CTAGCCATGC GCcCGCAAAA GCCGGAATTA 18 60 

AGTTACCTTG AACAAGTCCC ATTTTAATTA TCTGTTGCAA TTCTTTGAAG TTAACTCTGC 192 0 

TAATATTTTG TGACmAAGTA TGCTCTTTGC TCATAATCCC CCTCCTTAAA TTTGTTCATA 1980 

T AAG AT TAT G ATATCTTAGA TTGCATAAAA AGACTAGGTT TAATAAAATT AAATTGTGAC 2 04 0 

AAATTAACGA CAAGAGAAAA TGTCAATTTT G T GAC AC AAA TAACATTTAA TTTATTGCTA 2100 

TAATGTATAT GTTAGAAAAT TTTAATAAGT AGAATCATGC AT C T AAAAGA GATTAATATT 2160 

TAAGCTTCAA ATTTGAGTAA ACGTGGATTA CATAATTATC CCAATAAAAA AATCATTACG 2220 

ATTAAGTTCT TTTTATGTCG TCCACATACA ATACTTGTAA AATTAAATCA TATTTCCTGC 2280 

GTTGGATCCC ATCTTTTCAT ATCCTACAAT CAGGTCTATT TATAGTATCA TCTCAAATCC 2 34 0 

GGCTATTAAT TCTAATTCTC AGTGATGCGT TTTTTTATGA EGGGTGTAT AAATTGTTTG 2 4 00 

GCAAAAAGAA TTTAAAATGG TTAGGTGTCG TAGCAACGTT AATGATGACA TTTGTACAAC 2 4 60 

TTGGTGGAGC CTTAGTTACC AAAACCGGAT CAGCTGATGG TTGTGGTTCT TCTTGGCCAC 2520 

TATGTCATGG TGCGTTGATT CCAGAATTCT TTCCTATTGA TACGATTATT GAGTTAAGTC 2 5CB 

ATAGAGCCGT TTCAGCTTTG TCTTTATTAA TGGTCTTATG GTTAGTTATC ACTGCATGGA 2 64 0 

AACATATAGG C TAT ATT AAA GAAATTAAAC CTTTATCAAT CATTAGTGTT GGATTCTTAT 2700 

TATTGCAAGC ATTAATCGGA GCTGCTGCTG TTATTTGGCA ACAAAACGAT TACGTTTTAG 2 7 60 

cATTGCACTT TGGTATATCA TT.SATCAGTT TCTCATCTGT ATTTTTAATA ACATTGATTA 2820 

TTTTCTCTAT AG AT C AAAAA TATGAAGCTG AC G AAT TATA TAT C AAAAAG C CAT T AAG AC 2 880 

GTTTAACATG GTTAATGGCA AT CAT C AT T T ATTGTGGTGT TTATACTGGT GCACTAGTGa 2 94 0 

GACATGCGGA TGCAAGTTTA GCATATGGTG GTTGGCCATT GCCATTACE CGATCTTGTA 3000 

C C AC AT T C AG GAACAAGATT GGGTTCAACT C AC GC AT C G T ATCAnGGTCn nTTAA 3 055 
(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 

TTCTTATTTA AAGAAGTCAT T T T TAG AAA T TGTTGAGACT TTAAAAAATG ATCCGTATAA 60 

AATAACACAA TCTTTTGAAA AAT T AGAGCC TAAATATTTA GAGCGATATT CAAGAAGAAT 12 0 

TAACCATCAG CACAGGGTCG TCTATACCGT AGATGATCGA AATAAAGAAG TATTAATACT 180 

ATCGGCATGG TCACATTATG AT T AAT G AAT ATTCAATATC TGAATAACTT TAATGATAAG 24 0 

TTAATTAAAG AAACTAGTAT TTAAGTGTAG GGAAAATAGC GACGTTAATG CGTTGTTATC 300 

TCTACACTTT TTAATTTTAT AATAGCGDAA G AC T AAAC AG ATTGAAATTA GTAACAATAA 3 60 

AAGAATAACG TATTATAATA AGGAATTTTA AATTGTGACT TTTTCGGAAT ATTAAATTTT 4 20 

AG AAAT AT G A GGTTTTTAAG CGGATTCCTC ACAAAATTTT AAAAAT AT T T AAGCCTGAAA 4 80 

ATGATAAAGC GGTAGGGAAC GTTTTTCTGA AaGTTAGTGA T AC AAT AG T T TSAGTTGAA 54 0 

ATACAGGAGG ATGAATAACA T G AAT C AG T C AG T C AAAT T A CTT AAAC ATT TAACAGATGT 600 

AAAC GGC ATT GCTGGTTATG AAATGCAAGT TAAAGAAGCA ATGCGTaACT ATATAGAGCC 660 

TGTCAGTGAT CaAATTATTG AAGATAACTT GGGTGGCATT TTTGGAAAGA AAAATGCTGA 720 

GAATGGTCAA TACTCAATTA TGGAnTTC 74 8 
(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

ACCTCCGAAT AATTGATTCC ATTAACTTTT TTTGTGaAAA At T T AAACAG GCGAAGTCTT 60 

CAATAAGTGA AGATGTCCAA AT T AT AAAAA ATACATTCCA AAAAGAAAAG TTAGGTACAG 120 

TAATTACtAC ycTGGCGCAA GTGGTGGTGT TACGTATAAA CCAATGATGA GTAAAGAAGA 180 

GGCGACTGAA GTTGTTAATG AGGTCATTAC TCTATTAGAA GAGAAAGAAC GTTTGTTAcC 24 0 

TGGCGGATAT TTATTTTTAT CAGATTTGGT AGGTAATCCA TCGCTACTAA ACAAAGTTGG 300 

TAAGTTAATT GCCAGTATTT ACATGGAAGA AAAATTAGAT GCTGTTGTTA CCATffCGAC 360 

AAAAGGTATT TCATTGGCAA ATGCGGTTGC TAATATTTTA AAT T T AC C AG TAGTAGTGAT 420 

TAGAAAAGAC AACAAGGTGA CTGAAGGTTC TACAGTTTCA ATTAATTACG TTTCAGGATC 4 80 
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TTCAAGAAAA ATAGAAACAA TGGTACTTTC GAAGAGAACT TTAGCAGAAA ATTCAAATGT 54 0 

TTTAGTTGTC GATGATTTTA TGAGGGCTGG TGGCTCTATT AATGGTGTTA TGAATTTAAT 600 

GAATGAGTTT AAAGCCCATG TAAAAGGGGT ATCAGTACTT GTAGAATCAA AAGAAGTTAA 6 60 

ACAAAGATTG ATTGAAGATT ATACTTCCTT AGTGAAATTA TCTGATGTAG ATGAATATAA 72 0 

TCaAGAGTTT AACGTAGAAC CTGGCAACAG TTTATCAAG TTTTCATAAA AGGAGTTTTA 7 80 

G T AT T AT G AA AATCATTAAC ACAACAAGAT TACCGGAAGC AC T T GG AC C A TATTCGCATG 84 0 

CAACAGTTGT GAATGGTATG GTTTATACTT CTGGTCAGAT TCCATTGAAT ATTGATGGAC 90 0 

ATATCGTAAG CGCTGATGTT CAAGCACAGA CAAAACAAGT TTTAGAAAAT TTAAAGGTTG 9 60 

TTTTGGAAGA AGCAGGATCT GATTTGAATT CTGTTGCGAA AGCGACCATT TTCATTAAAG 102 0 

ATATGAATGA TTTCCAAAAA ATAAATGAAG TGTATGGTCA ATATTTTAAT GAACACAAGC 108 0 

CAGCGCGTAG TTGTGTAGAG GTTGCGCGTT TGCCAAAAGA TGTGAAAGTA GAAATTGAAT 114 0 

TAGTAAGTAA AATTAAGG/A TTATAATTTT CGATTAATAT GTTTAATCAA GCTTCTAAAT 1200 

AAAACAGAGA GAT AT AT ACT ATAGGGGGGC TCACTACATG AAAGTGACAG ATGTAAGACT 12 60 

TAGAAAAATA C AAAC AG AT G GACGAATGAA AGCACTCGTT TCCATTACAT TAGATGAAGC 132 0 

TTTCGTAATT CATGATTTAC GTGTAATTGA AGG AAAC T C T GGCTGTTCG TTGCAATGCC 1380 

AAGTAAACGT ACACCAGATG GTGAATTCCG CGACATCGcg CATCCTATTA ATTCAGATAT 144 0 

GAGACAAGAA ATTCAAGATG C AG T G AT G AA AGTATATGAT G AAAC AG AT G AAGTAGTACC 1500 

AGATAAAAAC GCTACATCAG AAG AT T C AG A AGAAGCTTAA TCAATTTTAT ATTTAGCGAT 15 60 

GTAATACATT TGCAATAAGT TGATTTGATA CTGTCGATAA AGCATAAAGC TTTGTCGGCA 162 0 

GTTTTTTTAG TTTGTATTAA TGTTTTTTTA TTTTTAATGA AAGGCTAATA AATATATACG 168 0 

TTAACAGATT AT GAT GAT AT GAAAAT T AT T GATTCATTGC TGATAAAAAT TGCGTTTGAA 174 0 

TGaTGCTCGT ATTTTTGAAG TAAGAffiAAA GTTGTTTTTA AAATTACAAC GAATTAAAAA 1800 

CAATGCCTTT TATATGTTGA AAG AG T AT T G CaGATTAAAT TaTAATAATG ACGAaGgTAA 18 60 

AATTTAATGG GGGTTAATGT TCATGCGAAG ACACGCGATA ATTTTGGCAG CAGGTAAAGG 192 0 

CACAAGAATG AAATCTAAAA AGTATAAAGT GCTACACGAG GTTGCTGGGA ACCTATGGT 1980 

CGAACATGTA TTGGAAAGTG TGAAAGGCTC TGGTGTCGAT CAAGTTGTAA CCATCGTAGG 2 04 0 

ACATGGTGCT GAAAGTGTAA AAG G AC AT T T AGGCGAGCGT TCTTTATACA GTTTTCAAGA 210 0 

GGAACAACTC GGTACTGCGC ATGCatGCAA ATGGCGAAAT CACACTTAGA AGACAAGGAA 2160 

GGTACGACAA TCGTTGTATG TGGTGACACA CCGCTCATCA CAAAGGAAAC ATTAGTAACA 2 22 0 

TTGATTGCGC AT C AC G AGG A TGCTAATGCT CAAGCAACTG TAT TATCTGC ATCGATTCAA 228 0 

CAACCATATG GATACGGAAG AATCGTTCGA AATGCGTCAG GTCGTTTAGA ACGCATAGTT 234 0 
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GAAGAGAAAG ATGCAACGCA AGCTGAAAAG GA'ATTAATG AAATTAGTTC AGGTATTTTT 2 4 00 

GCGTTTAATA ATAAAACGTT GTTTGAAAAA TTAACACAAG TGAAAAATGA TAATGCGCAA 24 60 

GGTGAATATT ACCTCCCTGa TGTATTGTCG TTAATTTTAA ATGATGGCGG CAT C G T AG AA 252 0 

GTCTATCGTA CCAATGATGT TGAAGAAATC ATGGGTGTAA ATGATCGTGT AATGCTTAG 2 58 0 

CAGGCTGAGA AGGCGATGCA ACGTCGTACG AATCATTATC ACATGCTAAA TGGTGTGACA 2 64 0 

ATCATCGATC CTGACAGCAC TTATATTGGT CCAGACGTTA CAATTGGTAG TGATACAGTC 2700 

ATTGAACCAG GCGTACGAAT TAATGGTCGT AC AG AAAT T G GCGAAGATGT TGTTATTGGT 27 60 

CAGTACTCTG AAATTAACAA TAGTACGATT GAAAATGGTG CATGTATTCA ACAGTCTGTT 2 82 0 

GTTAATGATG CTAGCGTAGG AGCGAATACT AAGGTCGGAC CGTTTGCGCA ATTGAGACCA 288 0 

GGCGCGCAAT TAGGTGCAGA TGTTAAGGTT GGAAATTTTG TAGAAATTAA AAAAGCAGAT 2 94 0 

CTTAAAGATG GTGCCAAGGT T T C AC AT T T A AGTTATATTGGCGATGCTGT AATTGGCGAA 3000 

CGTACTAATA TTGGTTGCGG AACGATTACA GTTAACTATG ATGGTGAAAA T AAAT TT AAA 30 60 

AcTATCGTCG GCAAAGATTC ATTTGTAGGT TGCAATGTTA ATTTAGTAGC ACCTGTAACA 3120 

ATTGGTGATG ATGTATTGGT GGCAGCTGGT TCCACAATCA CAGATGACGT ACCAAATGAC 3310 

AGTTTAGCTG TGGCAAGAGC AAGACAAACA ACAAAAGAAG GATATAGGAA ATAATCATTT 324 0 

ACGTATTTAA AATGGCTAGG ATAAAAGGAT AATCCTATGT AATATTAATG TAATCTTTAT 3300 

GATTTAATGA TTCGCATAGT AATGGAGTTA CATyTTATAT ATAATAGTAA TTGCGTAAGT 3360 

AAATAATTGG AGGACTATAA ATGTTAAATA ATGAATATAA GAATTCGTCA TTAAAGATTT 34 20 

TTTCATTGAA AGGAAACGAA GCATTAGCGC AAGAAGTTGC TGACCAAGTA GGAATTGAAC 34 8 0 

TAGGTAAATG TTCAGTTAAA CGTTTTAGTG AT GG AG AAAT TCAAATTAAT ATCGAAGAGA 354 0 

GTATTCGTGG TTGTGACGTA TTTATTATTC AACCAACATC ATATCCTG3G AATCTACATT 3600 

TAATGGAATT ATTAATTATG ATTGATGCTT GTAAACGTGC TTCTGCAGCA ACAATCAATA 3660 

TTGTAGTGCC ATATTATGGA TATGCAAGAC AAGATAGAAA AGCCCGTAGC CGTGAGCCAA 37 2 0 

TCACTGCTAA ATTAGTTGCA AACTTAATCG AAAcAGctGG CGCAACTCGT ATGATTGCGT 37 8 0 

TAGACTTACA TGCACCACAA ATTCAAGGAT TCTTTGATAT TCCAATTGAC CACTTAATGG 38 4 0 

GTGTGCCAAT TCTTGCTAAA CATTTCAAAG ATGATCCGAA TATTAACCCA GAAGAATGTG 3 90 0 

TCGTTGTTcA CCAGACCATG GCGGsnTTAC ACGTGCACGT AAATTAGCTG AC AT T T T AAA 3 9 60 

AACTCCAATT GCAATTATAG ATAAACGTCG TCCTAGACCA AATGTTGCTG AAGTGATGAA 4020 

CATTGTTGGT GAGATTGAAG GACGTACGGC AATTATTATT GACGATATTA TTGATACAGC 4 08 0 

AGGTACAATC ACTTTAGCTG CACAAGCATT AAAAGATAAA GGTGCTAAAG AAGTATATGC 414 0 
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TTGTTGTACA CACCCTGTTT TATCAGGACC GGCTAAAGAA CGTATCGAAA ATTOTCTAT 4 200 

AAAAGAAT T A ATCGTAACAA ACT C AAT T C A TTTAGATGAA GATCGCAAAC CATCTAACAC 4 2 60 

TAAAGAATTA TCTGTTGCTG GTTTAATCGC ACAAGCTATC ATTCGTGTAT AC G AAAG AG A 4 32 0 

ATCAGTTAGC GTATTATTTG ACTAATATTT AAAAGGCGTT TGACGAACAT ATTCCAAACG 4 380 

TGTATAATAG TTTCGTTCGT GAT TAT AC G A ATAAATAAAC ACTTGCAAGC AACGATGATG 4 4 40 

TTGATGGGTA AGTGAGGTGC TCGTTTTGAG CAAAAATGAA AGGTGGAAAT GAGAATGGCT 4 500 

T CAT T AAAG T CAATCATCCG TCAAGGTAAA CAAACACGTT CAGATCTTAA ACAATTAAGA 4 5 60 

AAATCTGGTA AAGTAC CAGC AGTAGTATAC GGTTACGTA CTAAAAACGT GTCAGTTAAA 4 62 0 

GTTGATGAAG TAGAATTCAT CAAAGTTATC CGTGAAGTAG GTCGTAACGG TGTTATCGAA 4 680 

TTAGGCGTTG GTTCTAAAAC TATCAAAGTT ATGGTTGC 4 718 
(2) INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

AAAATGCATA TAAATACATA TTAAGGAGGA TTTTATGAAT TTTCTTAAAC CTGCAAAGA 60 

TATTAAGCCT TTGCCAGAAA AT C AG AT AG A T GAT AC C TAT AAACGAT TAC GTCTCCAAGT 120 

ATTTCTTGGT ATTTTCATCG GTTACGCTGG GTACTATTTA TTACGTAAAA ACTTTTCGTT 180 

AGCGATGCCG GCATTGCAAG AGCAAGGTTT TACAAAAGCG GAACTAGGTT TTGCACTTTC 24 0 

TGCTGTTTCC ATCGCATATG GAT T TAG T AA GTTCTTTATG GGTACTGTAA gTGATCGGAG 300 

CAATGCTCGG ATATTCTTAG TTCTTGGATT AGCACTCACT GCTATCGTCA ATTTGTTAAT 3 60 

GGGATTTGTA CCGTTCTTTA CATCAGGTAT CGGTATTATG TTTGTCCTAT TATTCTTAAA 4 20 

TGGATGGTTT CAAGGTATGG GCTGGCCACC TTCAGGCCGT GTTCTCGTTC ACTGGTTTAG 480 

TGTAAGTGAA CGCGGAAGTA AGACTGCCCT TTGGAACGTT GCGCATAATG TTGGTGGAGG 54 0 

TATTATGGCA CCTATTGCTG CTTGGGGTAT TACAACAACT GCATTTATCA ACTTTGGTTA 600 

TTTAAAAGGT TTCGAAGGTG TATTCATTTA CCCTGCACTC TTAGCACTTA TCATTGCCGC 6 60 

AATTTCATAC GTATTGATTA GAG AC AC AC C TCAATCTCAA GGTTTACCTC CAATCGAAAT 720 

T T AT AAAAAT GACTTTGCTA CAAGCGATAA G AAAAC AT T A GAAACAGAAT T AAC TAC AAA 7 80 

AGAAATTTTA TTTAAATATG TAC T G AAC AA TAAATGGGTA TGGGCAATTG CCTTTGCAAA 8 40 

TATATTTGTT TATTTCGTGC GTTATGGTGT ACTTGATTGG GCGCCAGTCT ACTTAAGTGA 900 
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AGAAAAACAT TTCGACTTAA AAGCATCAGG TTGGGCATAC TTCTTATACG AATGGGCTGG 9 60 

AATTCCTGGT ACATTATTAT GTGGTTACAT TTCTGATAAA TTATTCAAAG GTCGTCGTGG 1020 

ACCTGCAGGT TTCTTCTTTA TGTTAGGTGT C AC AG T AT T T GTATTASTT ATTGGTTAAA 1080 

TCCTCCAGGC AATGCTTGGT TAGACAATGT CTCATTAATT GCCATTGGTT TCTTAATATA 1140 

TGGACCAGTT ATGTTAATTG GTTTACAAGC ATTAGATTAT GTACCTAAAA AAGCAGCTGG 12 00 

CACAgcAGCT GGATTAACAG GATTATTTGG TTATCTGTTT GGTGCTGTAA TGGCCAACAT 12 60 

CGTCTTAGGT GCTGTAGTTG ATAAATTCGG ATGGGATGTC GGTTTTATTT TATTAACAGC 1320 

AATTAGTGTG TTTGCAATGT TGAGCTTTAT CCTCACTTGG AATAAAGTAG GACAAGAAAC 1380 

CGTTCATCAT TAAATGATAA AAAATAAAGT CATATGGTTA TCTTATCGAA AGATGATATA 14 40 

TTCATCTCTT ATAAGTTCAA CCATATGACT TTTTATTAGT ATTCAAAAAA ATATTTACAT 1500 

TGCCACTTTT GTGTTTGCCC TGCTGTTTTA TTCAATTGAT TACACCACTT AGGATAAACT 15 60 

CTAAAAGCCA TTTTCCCTTG AT AC T TAG AT GAAGCTAAGA TACCTTTTTC TACCAATAAT 162 0 

TCCCTAGGAA AT AGAAAG T A GCCATTTAAT TCATCATCGA TAACAGCAAC AATSGGTAA 168 0 

TCAGCAAACT CTTCAATTTT GTATGGCCGA TTATAATTAT CtTCGTCTTT TGTCCAACAT 174 0 

GTCACGAAAT ATCCCGATTT AGTCGGTGTT TTCTTAGCTA ACCTACTTTG ATATGTTTCC 1800 

TCTTTAAAGC TAAATGTTAA TGCyTCGTAA TCTTGATTAT ATTTTTCTTC AGTTAAATCT 18 60 

TTAACTTCTG ATTCTTCACT AAAAAT AT T T TTCAACAGTA TTTTAGATTT ACACATATGC 192 0 

CAATCTCACT TTATTTTTTC ATAATCGTAT CATATATTTA TTTTTTTCGA AAAAT AC AC T 198 0 

TCCGACCGAA ATATACTCAG CCAGAAGTGG ACGAAATATT TAACCTTCTA TTAATTCAGA 204 0 

TGTATCAACT GTCAAACGAT CACGTAATAA ATAGAGATT AACATTGcTG CTAAAGCACA 2100 

AAGTGACTCA GCAATTAATA ATGACCAAAT GACACCTGTT AGTCCAAACA AAGCATTCAT 2160 

AATAAATAAT ACTGGAATGA TAATTGCACC TTGTAAAATG GCCATAATTG TAGCACCACG 2220 

ACCTTGCCCA GTCGCTTGAA GCATACCAGT AAACAAGAAA CCTATACCAT TTAATAATAA 2280 

T GAT GC CAT T GTTACTTTCA AAATAAATGt CGCCATCTCA ACAATGGCTT GATCAGTAGT 234 0 

AAATAGTCCG ACCATATGAT GTCCAATTGT AAATACAGCA CTCATACATA CAACAAAGAT 24 00 

AACGCCGATA GACATGATAA CTGCTTTGAT AACGTCTTTC ATACGGCCTT TATTTGCCAT 2 4 60 

AAAGTTATAT GCAATTAGTG G T AC AAC AC C TTCACATAAT CCCATGATAA TAAGTTCTGG 2520 

aAATTGCACA AGTCTAAATG AGATACCATA ACTTGCaATC GCGAAGTTTC CATAATGTGC 2 58 0 

TAAAAATAAA TTTAAAACTA ATCCTGTGAA TCCCATTAAG ATACTCATTA AAAATGCAGG 2 64 0 

AATACCGATT TTAAAGATTT CAGAAAGCAT TTCTTTATTA GGTTCGCAA GTTTAATATT 27 00 
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AAC T G AC AC A ACGTCACTAT TTTTCATAAA AT AAAT GAT A AAGAACAGAG CAGCAGCAAC 27 60 

ATTACTGATT GCAGTACCCA AAGCTGcACC AACAACGTTT a AAT C AAAAC CAAAAATTAA 2 82 0 

AATTGGATCT AAAATAATAT TTAAGCCTAC ACTAGCTAAC ATACCAATCA TAGAAACCAT 2 88 0 

TGGTGCCCCA ATTGCACGTG CAAATTGTTC TAATATGAAG AACAAAATTA CAAAAGGTGC 2 94 0 

ACTTAAAAAC ATTACTTTCA AAT AAT TACT TGTTAAAGCT AACGTTTCAC CTCTCGCCCC 3000 

TAAAATTGCT GCGATTTGAT CACTGAATGG TAAAGTAACT AAAATCACGA TAAGTCCTAG 30 60 

TGCAATACCA CCATAAATAG AGAAJCTACT TACAAATTTA CTCTTAmTAT AGTCTTTCGC 312 0 

AC C T AAT AAA CGTGAAATAT AAGTTCCTGC ACCAACGCCA AAT AAAT T AC CTAACCCCAT 3180 

T 3181 
(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TCATCACCTT CAAATAAATA GGCATGCGAT AATTTATTTG AATGAffiTGC ATTCGTCAAT 60 

TGTTGCTGTT CATCCATTCA TTACAAACTC CCTTTTGCTT T T AT AT AAAA AGGTTGCCAA 120 

AGAGCAACTA ATTACTGCAA CAATAGCGAC CATGATTTGT GATTATTTTT AG AAC C AC AA 18 0 

AAATCAAATC TATTGGCATC TGAACTAGTC GCAGTCAAGC AACCCAAAAT TGTTAATAAC 24 0 

TTATTAATAC ACTATTTTAA GTATCTCTAA TTGATGATTG TAT T AT AG AA TTAAAATTGA 300 

TGGAATGCAT CAACTGGCAT AACAAATACA GTAGCACCGC CAACTTCAAC TTCAACTGGA 360 

TATGGAATGT ACGAATCCGC ACTGCCTCCC ATAGGTGTAA TAGGTGAAAC CAACTGTTCT 4 20 

CTATTACCAC ACGTTTGATT AATCACAG^C AATATTTCAT CTACACGGTC AT CAT T GAC A 4 80 

CCACATAAGA ATGTTGTATT ACCCGCTCTT AAAAAC C C AC CTGTTGTTGC CAATTTTGTT 54 0 

GCTCTAAAGT TATTTTTAAC AAGTTGATCT GCAAGTTCCT GACTATCTTG ATCTTGTACG 600 

ATCGCTATAA TCATTTTCAT TTTATAACAC CTCTTCTAAC AATTATATCA TAOTTTTCT 660 

AAAT AT T T G A TGATAGTTTG ATACGTGTCT TCAACAACAT TTTCAAGAGG TTGATCTGCA 720 

TTAACGCTTT TGAACCGTTG TGATTCATTA TGAATGATTT CTTGGTAACC TTCAATTACT 780 

TTTTCGTGAA cTTTAAATCT TCTTGATCTA ATCTATTTTG ATCTCTTGAA TTTTTAATAA 84 0 

TACGTTCGCG ACCTACTTCA GCACTAACAT TT AAAT AAAT CGTCAAGTCT GGaTATAATC 900 

CATTTATTGC AAATTCGTTT AATGCTCTTA CTTCTTCAAC GCCAATCCCT CTAGCATAAC 9 60 
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CTTGATAAGC TAATGAACTA TCGATATAGC GATCACACAA CACAACCTTA CCTTCTTTTA 102 0 

AAGCTGGTAT GACCTTTAAT ACAAGATGTT CTCTHTAGA TGCAGCAAAT AACATTGcTT 1080 

CaGTtCTAAT GTCCATATCA TTGCCcTTCT AATACAATTT TACGTATTTC TTCACCAGTA 114 0 

GGAACACCAC CTGGTTCTCT AGTCATAATG ACATCATAAT CTTTTACTAA TCTATGGTAA 12 00 

AC T T CAT T AA TTACAGTTGT TTTTCCAGAG CCTTCTGGGC CCTCAAAAGT TATAAAAGCT 12 60 

GACATTTTAT TCATCCTCAA CTAAAATTTT ATTATTTTTA ATTCCTTCAA CTATCATTCC 1320 

AGTTTCCAGA TATTCATTTA CCAATTCTAT CATATTTTCA GTTATTGTTT CTCCTTTGAA 1380 

AATAATAGGA ATcCCTGGcG GATACGGGAC AATATGTCGC GCCAGAACTT TACCTTTgCC 14 4 0 

TTTTcAAATC ACACCAAGTA ACATATTCAA AGCGTTTTGG TTTATAGTTA CCTTCAGTyG 1500 

TTAAAAGTTG TGTTTGTTTA ACTTTAGAAA CTGATTTTTT CGGTAAAATC ATATCTTCAA 15 60 

TTTTACGTAA AAGCGAATCA AATAAATACG TATCATCATG ATGCCATAAC GGCAATATCG 162 0 

CTAATGCTTG ATAGTCGTCC GCTAATTCTA AATAGATATG TGATTCATA AACCAATTTT 168 0 

GAATATCATG ACCTGTAAAA CCTTCATATT TTATCAGCAA CTTCAACGGA TCATCAACTT 17 4 0 

GAAGCATTTC AAAACCCTTC TTCTCCAAAC ATTCGATTAA TTGCGCTCTC TTATCAAAAA 1800 

ACACGGTACT ATCATATGTT T T AT AG AAC T CGGCAGCTGA CTCTAAACTA GCCATAATCA 18 60 

AATACGAAGG ACTAGATGTT TGGAAGTAGC TTAGATATTC TATAATAGTT TCTCTATAAG 1920 

GTGCATTTTT ATGAATATAA AGTACCGAGC CCATCGTTAA AGCTGGTAAC GTTTTATGAA 1980 

AAGATTGAAC AAC AT AG T C A GCTTGATAAT TTAATGTAGA ATCTGGAAAT CCTTGCAATC 204 0 

CAAAGTGCGC GCCGTGTGCT TCGTCAATGA GTACAGGAAT ATTTAATTGG TGCAAAGATT 2100 

TGATAACCTC TTCTACATTA AATGTTTCAC CGTAATAGTT AGGATAAGTC AACACAGCAA 2160 

GTTTGTGACC GTCATTATTC AAACGGCTTA AATTAACTTT ATTATAATGA TTCGTTAACG 2220 

GACTTTGATG CGTTTCAATA AAATGCCCTT CTTGTTGGCT AATATCGAGCGCATGTAACA 2280 

CAGATTTATG TACATTTCTT GCCAT T AAGA TATCGCCTTT TTTCTGTGAA AAAGACTGGA 234 0 

TGACAGATAA TATTCCTGAA GTGGTGCCAT TCACTAAGAA AT AAGC AT C A TAATCTGAAT 24 00 

GTTTCTCCAC CTGCTTCATA CTTTCCAAAA TGACTTCTTC AGGATGATGT AAATCAnCTn 24 60 

AATCCAGGTA TTTCAGTTTT ATCCATTGTC ATTGATAATT GAGATAAATG ACCGATAGTC 2 52 0 

ATATTTTTAT GACCCGGAAC ATGCAAAGAA ATCGCTTCTT CTTGATTTAA ACTTTCTAAT 2 580 

TTATTTAAAA TAGGTTGCTT CAT GAT AT AC GCTTCCTTTA TTTACACTGT TTTGGAATTA 2 64 0 

GTTACTTTCA AAAGTATTAA TTATATAGTA iCACTTCTTT GACAAAAGTT AGTGTTACTT 27 00 

ATGCAATAGC TTGTCTATTG TATAATAATT AATTTCCTTT TTGTACTTCG ATTTAAAAGA 2 7 60 



902 

( opied horn 1( 80(ii>0(i on 08/ 1 , /">■ ; 



TATTAGACAT AAAATCTAAA AACAGCAGTA AGATGATTTA TGATTAAAAA CTATCTTACT 2820 

GCTGTTCACT TTTTATAATA CTTCTGAATG TCTTCACTTA TACTTCTAGT CACAGAHTA 288 0 

AATAATCAAA AGTGCACATT ATTAAAATAT CAATTTCACA CTCAATGCGG CTCATCGCAT 2 94 0 

TCATTTCTTG TCTAGCAACG TTCTACTCTA GCGGAACGTA AGTTAGCTAC CATCCTCGCT 3000 

AAGAACCTTT CTTGACTTGT GACAATCGCT TGCTTCTTTC CTCTCCTTCG GCTCTCGCTT 30 60 

ACTCATTTAG CTCTACTAAA CTCGTTGCGC TCTTTTCTCG TTTCGTCAGA TTCAAACGTT 312 0 

TTCACTTCGC CAAGCCATTT TTCTTTGTGT TTACTTTTTA TTTTGACGTT T TAG AC AT AA 318 0 

AAAAAGAGAC cTCACGGTCT CAACTTGCCT GGCAACGTTC TACTCTAGCG GAACGTAAGT 32 4 0 

TGGCTACCAT CGTCGCTAAA GACCTTTCTT GACTTGTGA AATCGCTTGC TTCTTTCCTC 3300 

TCCTTCGGCT CTCGCTTACT CATTTAGCTC TACTAAACTC GTTGCGCTCT TTTCTCGTTT 33 60 

CGTCAGATTC AAACGTTTTC ACTTCGCCAA GCCATTTTTC TTTGTGTTTA CTTTTTATTT 34 20 

TGACGTTTTA GrCATAAAAA AAAGAGACcT TGCGGTCTCA ATGCGGCTCA TCGCATCCAT 34 8 0 

TTTTTGCCTG GCAACGTTCT ACTCTAGCGG AACGTAAGTT GGCTACCATC GwCGCTAAgA 354 0 

aCCTTTCTTG ACTTGTGACA ATCGCTTGCT TCTTTCCTCT yCTTCGGCTC TCGCTTACTC 3600 

ATTTAGCTCT ACTAAACTCG TTGCGCTCTT TTCTCGTTTC G T C AG AT T C A AACGTTTTCA 3660 

CTTCGCCAAG CCATTTTTCT TTGTGTTTGC TTTTTATTTT GACGTTTTAG ACATAAAAAA 3720 

AAGAGACCTT GCGGTCTCAA TGCGGCTCAT CGCATCCATT TTTTGCCTGG CAACGTTCTA 37 8 0 

CTCTAGCGGA ACGTAAGTTC GACTACCATC GACGCTAAGG AGCTTAACTT CTGTGTTCGG 38 4 0 

CATGGGAACA GGTGTGACCT CCTTGCTATA GTCACCAGAC ATATGATGT AATTTATACA 3900 

TTCAAAACTA GATAGTAAGT gAAAAGTGGA TTTTGCTTCG CAAAACATTT ATTTTGGATT 3960 

AAGTCTTCGA TCGATTAGTG ATTCGTGCAG CTCCACATGT GCACCATGCT TGCCACCTCG 4020 

GAACCTATT 4 02 9 
(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 

GATGCAATAG TTGAGTAGTT ATAAGTAGCT ACATACAACC GCTCAAATAT AGGTTCAAGA 60 

ACATTTTTAA TGAAGAAAGC GATTTTCCCm C AG AG T G C AA ACGCTTGACT CAATAACGAA 120 

ATGTCACCTA TGTATGGCGT GACTTATTAT ACAGGaGGTG CAAAGTATGT TTGCTATTAT 180 
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TGAAACAGGT GGAAAACAAA TCEAAGTAGA AGAAGGTCAA GAAATCTTCG TTGAAAAATT 24 0 

AGACGTAAAC GAAGGAGATA CTTTTACATT TGATAAAGTA TTATTTGTAG GTGGAGATTC 3 00 

AGTTAAAGTT GGAGCGCCAA CAGTTGAAGG TGCAACAGTT ACTGCTACTG TTAATAAACA 3 60 

AGGTCGCGGT AAAAAAATCA CTGTATTCAC ATACAAACGT CGTAAAAAT CAAAACGTAA 420 

AAAAGGCCAT CGTCAACCAT ACACTAAATT AACAATCGAT AAAATCAACG CGTAATTATT 4 80 

ATGATTACTG TTGATATTAC AGTTAATGAT GAAGGCAAAG T AAC AG AC G T TAT TAT GG AT 54 0 

GGCCATGCTG aCCATGGTGa ATATGGTCAT GATATCGTTT GTGCTGGAGC TTCAgCTGTA 600 

TTGTTTGGTA GTGTTAATGC GAT T AT AGGA TTGACATCTG AGAGACCAGA TATCAATTAT 6 60 

GACGACAATG GTGGTCATTT TCATATAAGA AGCGTTGATA CAAACAACGA TGAAGCGCAA 720 

CTAATTCTTC AAACAATGCT TGTGTCTTTA CAAAC TATTG AAGAAGAATA TAATGAGAAT 7 80 

ATTAGATTAA ATTATAAGTG AGGTGCATTC CGATGTTAAA ATTAAACTTA CAATTCTTCG 84 0 

CATCTAAAAA AGGGGTAAGT TCTACAAAAA ACGGACGTGA CTCTGAATCA AAACGCTTAG 9 00 

GTGCTAAACG TGCTGACGGT CAATTCGTAA CAGGTGGTTC AATTTTATAT CGCCAACGTG 9 60 

GTACTAAAAT TTACCCTGGT G AAAAT G TAG GTCGTGGTGG CGATGATACA TTATTGCTA 102 0 

AAATCGACGG CGTTGTTAAA TTCGAACGTA AAGGTCGCGA CAAAAAACAA GTTTCTGTAT 1080 

ATGCAGTAGC TGAATAATTT TGTCTAGTTA AC A C C AG AAG TGAATCTTCT GGTGTTtTTT 114 0 

ACTTTTTATA AAATGTTTTT CATTATTATT TTCATTATGT TATTAAAAAA TGGGAATTTT 1200 

AGACGTTATA CTAAATGTGC AcTGTATAAG AGCCCCTAAT CACTAAACTA TAAGGGGGAC 12 60 

AAAGGAATAC AGTTGCAGCG TTTAAAGAAT AAACTGTACC ACAATTGGTG CTGAGAAATA 1320 

TAAGTATTTT AAAGCAAAGA TTTATAAAAG TAACTGCATA AGAGCCCCTA ATTATTTACA 1380 

ATATATAAGG GGCTCTAATA TGCTATAATT ATTGGGAAA T G AAAAT TAT AT G T AAAAG A 14 4 0 

GGTGAGATAT ATGTTTGTCG ATCAAGTCAA AATATCTCTT AAAGCCGGTG ATGGTGGTAA 1500 

TGGTATTACC GCATACAGAA GAGAAAAATA TGTACCATTT GGTGGACCAG CTGGCGGTGA 15 60 

CGGTGGTAAA GGTGCTTCAG TCGTATTTGA AGTGGATGAA GGTTTAAGAA CGTTATTAGA 1620 

TTTTAGATAT CAACGTCATT TTAAAGCAAG CAAAGGTGAA AATGGCCAAA GTAGTAATAT 168 0 

GCATGGTAAA AATGCGGAAG ATTTAGTATT AAAAGTTCCA CCTGGTACAA TTATTAAAAA 17 4 0 

TGTTGAAACA GACGAAgTGT TAGCAGATCT TGTTGAAGAT GGTCAAAGAG CTGTAGTAGC 18 00 

GAAGGGCGGT CGAGGTGGCC GAGGTAATTC ACGTTTTGCA ACACCTAGAA ACCCTGCACC 18 60 

TGACTTCAGT GAAAAAGGTG AACCAGGTGA G G AAT TAG AT GTATCTTTAG AATTGAAATT 192 0 

ATTAGCTGAT G TAG GAT TAG TAGGTTTCCC TAGTGTGGGT AAATCGACTT TATTATCTAT 1980 
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CGTTTCAAAA GCTAAGCCTA AAATTGGGGC ATATCATTTT ACAAGATTA AACCAAATCT 2 04 0 

AGGTGTTGTT TCAACGCCTG ATCAACGTAG TTTTGTTATG GCAGATTTAC CAGGTTTAAT 2100 

TGAAGGTGCA TCTGATGGCG TTGGATTAGG ACATCAATTT TTAAGACATG TAGAGAGAAC 2160 

AAAAGTTATT G T T C AC AT G A T TGAT AT GAG CGGTTCTGAA GGTAGAGAAC CTATTGAAGA 2220 

TTATAAAGTC ATTAATCAAG AATTAGCTGC GTACGAGCAA CGTTTAGAAG ATAGACCTCA 22 8 0 

AATCGTAGTA GCTAACAAGA TGGATTTACC TGAATCACAA GATAATTTAA ACTTGTTTAA 2 34 0 

AGAAGAAATT GGCGAAGATG TGCCAGTTAT TCCAGTTTCA ACAATAACGC GTGATAATAT 24 00 

TGATCAATTA T TAT AT GC AA TAGCAGATAA ATTAGAAGAA TATAAAGATG TTGACTTCAC 24 60 

AGTTGAAGAA GAGGAGTCAG TTGGCATTAA CCGAGTATTA TATAAACATA CACCGTCACA 2520 

AG AT AAAT T T ACAATTTCAA GAG AT GAT GA TGGTGCTTAT GTGGTAAGTG GTAATGCTAT 2580 

TGAAAGAATG TTTAAAATGA CTGACTTTAA CAGTGATCCA GCAGTACGTC G&TTGCTCG 2 64 0 

TCAAATGCGT TCGATGGGTA TTGATGATGC GCTTAGAGAA CGTGGTTGTA AAAATGGTGA 2700 

TATCGTTAGA ATTCTTGGCG GAGAATTTGA ATTCGTTGAA TAGGAGCGAA ACATGATGGA 27 60 

CAATAAAGAT TATAAAAAGT TTTATTTAAT TAGAGAAGAT GTCTTGCCTG AATCCGTGGT 2820 

TAAAACATTG AAGATTAAAG ATGCCTTAAA AAGTGATCCG ACATTGTCCA TTTATGATGC 28 80 

CGTTAAACAG TTTGATCTAT CTAGAAGTGC TTTTTATAAA TATAGAGAAA CGATATTTCC 2 94 0 

AGTAGACGAT AAAATGCTTG ACCATCGAGA ATTTACATTA ATTTTATATG TAACTGATAT 3000 

TGTTGGTATG TTGGCACGTG TACTAGATGT TATA"CAAAG TTAGAACTAT CTGTATTAAC 30 60 

GATTCATCAA AGTATTCCAA TGGAAGAAAA AGCAACAATA ACATTATCAC TGAATGCTAA 3120 

ATCTAAAGAA ACTTCAGTAG AAGATGTTAT TGGCGCTTTG AGAAATTTAG ATTATGTATC 318 0 

AAAAG T AG AA TTAATTAGTA TGAGTATGTA AGGATGTGCC TATAATGTAC GCGTATGTCA 324 0 

AAGGTAAGTT AACACATTTA TATCCTACAC ACGTAGTTGT TGAAACTGCT GGTGTTGGTT 3300 

ATGAAATTCA AACACCAAAT TCTTATCGTT TTCAAAAGCA TCTAGATCAT GAAGTTTTAA 33 60 

TTCATACATC TTTAATTGTT CGTGAAGATG CACAATTATT GTATGGATTT AGTAGTGAAG 34 2 0 

AAGAGAAAGA TATGTTCTTG AGTTTAATTA AAGTTACTGG TATTGGTCCG AAATCAGCTT 34 80 

TAGCTATTTT AGCGACAAGT ACGCCTAATG AAGTAAAACG TGCCATTGAA AATGAAAATG 354 0 

ATACGTATTT AACTAAATTC CCAGGAATTG GTAAGAAAAC GGCAAGACAG ATTGTCTTAG 3600 

ATTTAAAAGG TAAAGTGAAA ATTACTGAAG AAGATAGCGA TCATTATTA CAAGTAGACG 3660 

CTACTTCGAC GGTGCAAGAT CAATTCGTGC AAGAAGCAAT GTTAGCGTTA GAAGCATTAG 37 2 0 

GTTATTCTAA ACGAGAGCTT GCAAAAGTTG AGAAAACGTT AAATAAAAAT AAATATGACT 37 8 0 

CAGTTGATGA AGCTGTTAAG GCAGGTCTTC AATTAGTTGT ATCTTAATTT TAAATAGATT 3801 



905 

( opied h orn 1< 80(ii>0(i on 08/ 1 , t?00 ; 



AATAGGGGAA GTGTTGTCAT GAATGAGCGT ATGGTTGATC AATCAATGCA TAG T GAAG AA 3900 

ACTGATTTCG AATTGTCGCT TAGACCTACG A GAT TACGAC AATATATTGG TCAAAATTCA 3960 

ATAAAAAGTA ATTTAGAAGT AT T TAT T AAA GCGGCTAAAC TTCGTCATGA ACCATTAGAT 4020 

CATGTATTGC TTTTTGGCCC CCCTGGATTA GGTAAGACAA CATTATCTAA TATCATTGCC 4 08 0 

AATGAAATGG AAGTTAATAT AC G T AC AG T A TCAGGGCCTT CAT T AG AAAG ACCTGGTGAT 414 0 

TTGGCTGCAA TTTTATCAGG ACTTCAACCT GGAGATGTTT TGTTTATTGA T G AAAT AC AC 4 200 

AGACTGAGTA GTGTTGTTGA AGAAGTGTTA TACCCTGCAA TGGAAGATT CTTTTTAGAT 4 2 60 

ATTATCATTG GTAAAGGCGA TGAGGCTAGA AGTATCCGTA TCGACTTACC TCCATTCACT 4 32 0 

TTGGTAGGTG CAACAACGCG AGCTGGCAGC TTAACAGGTC CACTAAGGGA TCGATTTGGT 4 38 0 

GTGCACTTAA GATTAGAATA TTATAACGAA T C AG AT T T AA AAG AAAT CAT TATTAGAACA 4 4 40 

GCTGAGGTTT TAGGCACAGG TATTGATGAA GAAAGTGCCA TTGAACTTGC TAAACGTTCT 4 500 

AGAGGGACTC CAAGAGTAGC AAATCGACTA TTGAAGCGGG TAAGAGACTT CCAGCAAGTG 4 5 60 

AATGAAGATG AACAAATATA CATTGAAACA AC GAAG C AC G CATTAGGTTT ACTTCAAGTT 4 620 

GATCAACACG G AC TAG AT T A CATTGATCAT AAAATGATGA ACTGTATTAT TAAGCAGTAT 4 68 0 

AATGGCGGAC CTGTTGGTTT AGATACGATT GCCGTAACAA TTGGTGAAGA ACGTATTACA 4740 

ATTGAGGACG TTTATGAGCC ATTTCTTATT CAGAAAGGCT TTTTAGAACG TACGCCACGT 4800 

GGCAGAAAAG CAACACCATT AGCTTATGAA CATTTTGCAA AGTCGAATGA GGAGASGAA ~ 4 8 60 

TAACGTGAAT ATTGAAGAAT TTGACTATGA CTTACCAGAA TCATTAATTG CTCAAACGCC 4 920 

TTTAAAAGAT CGTGATCATA GTCGTTTATT AGTCATGGAT AGAGAAACTG G T G AAAT G AA 4 980 

ACATTTACAT T T C AAAG AT A TCATTGAGTA TTTTAGACCT GGTGATACAT TAGTGCTTAA 504 0 

CGATACGCGA GTAATGCCAG CTAGACTTTT TGGTTTAAAA GAAGAAACTG GTGCAAAAGT 5100 

TGAAATGTTA ATGTtAACTC AAATTGAAGG TAATGATTGG GAAGTCTTAC TGAAACCAGC 5160 

TAAGCGTATT AAAGTTGGTA AT AAAT T GAA TTTTGGTAAT GGCAAAATTA TAGCTGAATG 522 0 

CATAAAAGAA ATGGATCAAG GTGGACGCAT CATGCGTTA CAT TAT GAAG GTATTTTACA 52 8 0 

AGAAAGATTA GATGAATTAG GGGAAATGCC ACTGCCACCA TACATCAAAG AACGTTTAGA 534 0 

TGATCCAGAT CGTTATCAAA CAGTTTACGC T AAAG AAAG T GGTTCAGCGG CAGCACCAAC 54 00 

AGCAGGATTA CATTTTACTG ATGAGTTATT AATTGAAATT AAAAATAAAG GTGTTAATAT 5 4 60 

CGCATTTGTT ACATTACATG TTGGGTTAGG TACGTTTAGA CCGGTGAGCG TAG AC GAT G T 5520 

GAATGACCAC GAAATGCATA GTGAATATTA TCAAATGACm CAAGAAACAG CTGATTTATT 558 0 

AAATGATACT AAGcCAAAGG ACATCGCATT ATATCAGTTG GTACAACTTC AACACGTACA 5 64 0 
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CTTGAAACAA TTCGACGCGA TCATGATAAA TTTGTTGAAA CGAGTGGCTG GACTAATATA 5700 

TTTATTTATC CAGGATTTGA TTTTAAAGCA ATTGATGGCC AG AT T AC T AA TTTTCATTTA 57 60 

CCAAAATCAA CAT T AGT T AT GCTAGTATCA GCGTTTAGTA GTCGTGAAAA TGTTCTGAAT 5820 

GCTTATAAAA CGGCAGTAAA T T TAGAAT AT AGATTCTTTA GTTTEGCGA TGCAATGTTA 5 88 0 

ATTATATAAA AAGAATGTGA GGATTTTGAA TATGCCTGCA GTAACATACG AAC AC AT T AA 594 0 

AACTTGTAAA CAATCAGGTG CGCGTTTAGG TATCGTGCAC ACACCACACG GTTCATTTGA 6000 

AACACCTATG TTTATGCCAG TTGGTACTAA AGCAACCGTT AAAACAATGA GTCCAGAAGA 60 60 

GTTAAGACAA ATTGAAGCAA AAATCATTTT G G G C AA C AC A TATCATTTGT GGTTACAACC 612 0 

CGGAAATGAT ATTATCAAAC ACGCTGGGGG AT TACATAAA T T CAT GAAT T GGGATGGTCC 618 0 

GATTCTTACA GATTCAGGCG GTTTCCAAGT GTTTAGTTTA AGTAATTTAC GTAAAATTAC 624 0 

AGAAGAAGGC GTGGAATTTA GACATCATAC TAATGGGTCT AAATTATTTT TGAGTCCTGA 6300 

GAAATCAATG CAAATTCAAA ATGATTTAGG ATCTGATATT ATGATGGCAT TTGATGAATG 63 60 

TCCACCGATG CCTGCTGAAT AT GAT TAT GT AAAAAAATCT ATTGAACGTA CAACACGTTG 64 2 0 

GGCGAAAAGA TGTCTAGATG CACACCAAAG ACCTGAAGAT CAAGCATTGT TGGCATTAT 64 8 0 

ACAAGGTGGC GAATATGAAG ATTTAAGAGA ACAAAGTGCA AAGGATTTAG TAGAATTAGA 654 0 

TTTTCCTGGT TATGCAATCG GCGGTTTGTC AGTTGGTGAA CCTAAACCGG TTATGTATAA 6600 

AATGGTTGAA CATACAGAGC AGTTTATGCC TAAAGATAAA CCAAGATATT TAATGGGTGT 6 6 60 

AGGATCTCCa GATGCGTTAA T C GAAT G TAG TATTCGCGGC ATGGATATGT TTGATTGTGT 67 2 0 

CTTACCGACA CGTATTGCCA GAAATGGTAC TTGTATGACA TCGCAAGGTC GTTTAGTTAT 67 8 0 

TAAAAATGCA AAATTTGCAG ATGATTTAAG ACCGTTAGAT GAGAATTGTG ACTGTTATAC 684 0 

ATGT CAAAAC TATTCAAGAG CGTATATACG TCATTAATC AAGGCAGAGG AAACTTTTGG 6900 

TATTCGTCTT ACT ACT AT TC ATAATTTACA TTTTCTGCTA AAATTAATGG AAGATATAAG 6 9 60 

ACAAGCCATT C G AG AAG AT C GTCTTTTAGA TTTCAAAGAA GAATTCTTCG AGCAATATGG 7 02 0 

ATTAAATGTT GAGAACCCAA AAAACTTTTA AGCAAGAGGA GCGTATAAAA TGCAATTTTC 7080 

ATTACTAATA TATATAGTCG TAATTTTTGC GGTTATGTAT TTCTTGATGA TCAGACCACA 714 0 

ACAAAAACTG TGCGAAACA 7159 
(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

TCCATTTATA GAAATTTCTA AAGCAGAnAA TAAGATAGAA GATATCGGCC AAGGTGCAGA 60 

AAT CAT C AAA AGAACACAAG ACATTACTAG CAAACGATTA GCTATAACTC AAAAC AT T C A 120 

ATTTGATTTT GTAAAAGATA AAAAATATAA CAAAGATGCC CTAGTTGTTA AGATGCAAGG 180 

CTTCATTAGC TCTAGAaCAA CAT AT TC AGA CTTAAAAAAA TATCCATATA TTAAAAGAAT 24 0 

GATATGGCCA TTTCAATATA ATATCAGTTT GArAAC&AA GACTCTAATG TTGATTTAAT 300 

TAATTATCTT CCTAAAAATA rAATTGATTC AGCAGATGTT AG T C AG AAAT TAGGCTATAA 360 

TATCGGCGGA AACTTCCAAT CAGCGCCATC AATCGGAGGC AGTGGCTCAT TCAACTACTC 4 20 

T AAAAC AAT T AGTTATAATC AAAAAAACTA TGTTACTGAA GTAGAAAGTC AGAACTCTAA 4 80 

AGGTGTTAAA TGGGGAGTGA AAGCAAATTC ATTCGTTACA CCGAATGGTC AAGTATCTGC 54 0 

ATATGATCAA TACTTATTTG CACAAGACCC AACTGGTCCA GCAGCACGAG ACTATTTCGT 600 

CCCAGATAAT CAACTACCTC CTTTAATTCA AAGTGGCTTT AATCCATCAT TTATTACAAC 660 

ATTGTCACAC GAAAGAGGTA AAGGTGATAA AAGCGAGTTT G AAAT c AC T T ACGGCAGAAA 720 

CATGGATGCT ACATATGCTT ACG TGACAAG ACATCGTTTA GCCGTTGATA G AAAAC AT G A 780 

TGCTTTTAAA AACCGAAACG T T AC AG T T AA ATATGAAGTG AACTGGAAAA CACATGAAGT 84 0 

AAAAATTAAA AG CATC AC AC CTAAGTAAAC AGTTCaATCA TCT2LAAAAA TCCTGGGACA 900 

CTTCATACTT GTCTCAGGAT TTTTTmACAA ATT G AAT C AG CCTCATAACA T T AAAT T ATT 960 

TTATCGTACA TTAAATTTAA TAATAACAAC TGATTTTTAT AAGAATAAAG TATCGAmCCA 102 0 

TAGTAGATAC ACAAATAATA CAAATGAAAC AATTTAACTT GAAAGCTTAa ATAAATATTA 1080 

TCAAGTTAAT AAACAATTAA TTTTTAGATG GATTCATCAA AAATCGTAAA AAAGCACAAT 114 0 

TTGTATTTTA CAAACATTAA TTAAAAAAGA AAGCAAGACA TTCGTGCAAT CGGTTACCTT 1200 

AAATTGTTTA CAACTGTCAA CAATACCAAG GTTTTATTAA CTATATTTCT CACAAAATTA 1260 

GCTTTTAGCA TTCCAAACAA AAAAGGTTAA ATCGAACGGA ATTATGGCAT TTTTAACTTA 1320 

ATTGTAAAAA AAGTTGATAA TGGTCAATTG TTAATGAACA GTTAATTATA ATAACGCCCA 138 0 

AAATATATTA TTATTTAATT AAGTTAAATA AAAT TAT AG A AAGAAAGTGA AACTTATGCT 14 4 0 

TAAAAATAAA ATATTAACTA CAACTTTATC TGTGAGCTTA CTTGCCCCTC TGCCAATCC 1500 

GTTATTAGAA AATGctAAAG CTGCTAACGA TACTGAAGAC ATCGGTAAAG GAAGCGATAT 1560 

AGAAATTATC AAAAG G AC AG AAGAT AAAAC AAGTAATAAA TGGGGCGTGA CTCAAAATAT 162 0 

TCAATTTGAT TTTGTAAAGG ATAAAAAATA TAACAAAGAT GCTTTGATAT TAAAGATGCA 168 0 

AGGATTCATT AGCTCTAGAA CAACATATTA C AAC TAT AAA AAAAC TAATC ATGTTAAAGC 174 0 
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TATGCGATGG CCATTCCAAT aTaATATTGG TTTAAAAACA AATGATAAAT ATGTTTCTTT 18 00 

AATTAATTAT TTACCTAAAA AT AAAAT T G A ATCTACAAAC GTGAGTCAGA CATTAGGATA 18 60 

CAATATCGGT GGTAATTTCC AATCAGCCCC ATCACTCGGT GGTAATGGAT CATTTAACTA 1920 

TTCTAAATCG ATTAGCTATA CACAACAAAA TTATGTAAGT GAAGTAGAAC AACAAAACTC 198 0 

AAAAAGTGTT TTATGGGGCG TCAAAGCGAA TTCATTCGCC ACT G AAT C AG GTCAAAAATC 204 0 

AGCCTTTGAT AGCGATTTAT TTGTAGGCTA CAAACCTCAT AGTAAAGATC CTAGAGATH 1 2100 

TTTCGTTCCA GACAGTGAGT TACCACCTCT TGTACAAAGT GGATTTAACC CTTCATTTAT 2160 

CGCCACAGTA TCTCATGAAA AAGGTTCAAG CGATACAAGC GAAT T T G AAA TTACTTACGG 2220 

AAGAAACATG GATGTCACTC ATGCCATTAA AAGATCAACG CATTATGGCA ACAGTTATTT 228 0 

AGACGGACAT AGAGTCCATA ATGCATTTGT AAATAGAAAC TATACTGTGA AATACGAGGT 2 34 0 

CAATTGGAAG ACTCATGAAA TCAAGGTGAA AGGACAGAAT T GAT AT G AAA ATGAATAAAT 2 4 00 

TAGTCAAATC ATCCGTTGCT ACATCTATGG CATTATTATT ACTTTCTGGT ACTGCTAATG 24 60 

CTGAAGGTAA AATAACACCA GTCAGCGTAA AAAAAGTCGATGACAAAGTT ACTTTATACA 252 0 

AAACAACAGC CACAGCAGAT TCTGATAAAT TTAAAATTTC ACAGATTTTA ACATTTAATT 2580 

T CAT C AAAGA TAAAAGTTAT GATAAAGATA CTTTAGTACT TAAAGCTACT GGGaATATTA 2 64 0 

ACTCAGGCTT TGTGnAACCT AATCCTAATG AC TAT G AC T T TTCAAAATTA TATTGGGGAG 2070 

CTAAATACAA TGTATCTATA AGCTCACAAT CTAATGATTC AGTAAACGTC GTTGATTATG 2 7 60 

CACCAAAAAA TCAAAATGAA GAGTTTCAAG T T C AAAAT AC TTTAGGCTAT ACATTTGGTG 2820 

GTGACATTAG TATCTCTAAT GGTTTATCTG GTGGACTTAA TGGAAATACA GCTTTTTCTG 28 8 0 

AAACAATTAA TTATAAACAA GmAGTTACA GAACAACATT AAGTCGCAAC ACAAATTATA 2 94 0 

AAAATGTTGG CTGGGGAGTT GAAGCACATA AAATTATGAA TAATGGTTGG GGACCTTATG 3000 

G AAG AG AT AG CTTCCACCCA ACATATGGTA ATGAACTCTT CTTAGCTGGC AGACAAAGCA 3060 

GTGCATACGC TGGCCAAAAC TTCATAGCGC AACACCAAAT GCCATTATA TCTAGAAGTA 3120 

ACTTCAATCC AGAATTTTTA AGCGTACTAT CACACAGAC 3159 
(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

GCAAAAACTT TTCTCCAACT ATGTCAGACT ATAATGGCAT GAAC AAAAT T GATATGATGA 60 
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ATCAAATTAA AGTTGATACG ATGTTGCATG GTTATCACGC TGGATTTTTA TTTGCATTAT 120 

TGATTACAGT TGTTAGTTTC TTTTGTTCAT TTATGTTACA AGGTAAGAAA AAAGAAGTTG 180 

ATTCTCGTCA ATAAATATAA GTTGCTAGCT ATATAAAGCT TTTTAGCAAA AGTTCAACAT 24 0 

ATTGACTTAT CCGGCATTTC AG AT T AAAAT ATTTTTATTC CGATTAGAAT AATAAGAATA 300 

AGGAGATATA TTCTATGAAA AGACTTTTAT TTGTGATGAT AGCTTTCGTT TTCATATTGG 360 

CTGCATGCGG AAACAATTCG TCGAAAGACA AGGAAGCTAG TAAAGATAGC AAGACAATTA 4 20 

ATGTTGGGAC TGAGGGGACT TATGCACCAT TTAGTTTCCA CGATAAAGAT GGTAAATTAA 4 80 

CTGGTTACGA TATTGATGTT AT T AAAGC AG TGGCTAAAGA AGAAGGTTTAAAACTTAAAT 54 0 

TTAATGAAAC TTCTTGGGAT TCTATGTTTG CAGGTTTAGA CGCAGGGCGT TTTGATGTAA 600 

TCGCGAACCA AGTAGGTATT AATCCTGATA GAGAAAAGAA ATATAAATTT TCTAAGCCTT 660 

ACACATTCTC AAGTGCTGTT TTAGTTATTC G T G AAAAT G A AAAAGATATT AAAGATTTTG 72 0 

ATGATGTTAA AGGTAAGAAG TTAGCACAAA CAT T C AC AT C TAATTATGGT AAATTAGCTA 780 

AGGATAAAGG TGCTGATATT ACAAAAGTTG ATGGCTTTAA CCAATCAATG GATTTATTAT 84 0 

TGTCTAAGCG TGTTGATGGT ACATTTAATG ATAGTCTGTC ATACTTGGAT TATAAAAAAC 90 0 

AAAAACCTAA TGcTAAGATC AAAGCAATCA A7SGTAATGC TGAACAAAGT AGATCTGCAT 960 

TTGCATTTTC TAAAAAAGCA GAT GAT G AAA CAGTTCAAAA ATTCAATGAT GGCTTGAAAA 102 0 

AAATCGAGGA AAACGGTGAA TTAGCTAAAA TAGGTAAGAA ATGGTTTGGT CAAGATGTTT 108 0 

CTAAATCTAA AT AG C GAAC A ACAAC AT GCG CTAGATGCTG CAAAACAAGC TTTCGGACT 114 0 

ATGCTAGAAG GTTTGGTCAA ATATTCAATT CCTATTACAT TAGTTACATT TGTTTTAGGA 1200 

TTGATTATTG CATTATTTAC AGCATTAATG CGAATTTCAA CGAGTAAAAT TTTAAGAAGT 1260 

ATTTCACGTG TCTATGTATC TATTATTCGA GGAACACCAA TGATAGTACA ACTATTTATC 1320 

ATATTTTATG GTATTCCAGA ATTAGGTAGA TTATTAACAA ATGACGCTGA CAACCAATGG 138 0 

ACATTGGCAC CTGTAGTGGC TGCTATTATT GGTTTATCAT TAAATGTAGG TGCGTATGCT 14 4 0 

TCGGAAATTA TTCGTGGCGG TATTATTTCT ATACCGAAAG GACAAACAGA AGCTGCaTAT 1500 

TcCaTCGGTA TGACGTATGG TCAAACGATA CAACGTATCATTTTACCGCA GGCAATTCGA 1560 

GTGTCGATTC CTGCACTAGG TAATACATTT TTAAGTTTAA TCAAAGATAC ATCATTATTA 162 0 

GGATTTATTT TAGTGGCTGA AATGTTTAGA AAAGCTCAAG AAGTTGCGTC TACAACATAT 168 0 

GAATATTTAA CAATTTATGT GTTAGTTGCG CTAATGTACT GGGTGGTATG CTTTATTATT 714 0 

TCAATTATCC AAGGTATCTA TGAATCTTAT ATTGAAAGAG GGTATCGCTC ATGATTCAAT 1800 

TGAACAATAT CCATAAATCA TTTAATGATG TTGAAGTCAT CAAAGGTATT GATTTATCTG 18 60 
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TTGAACAAGG TGAGGTTGTA ACCTTAATCG GTCGATCTGG TTCAGGTAAA ACAACATTGT 1920 

TACGTATGAT TAATGCATTA G\AATTCCAA CTGAAGGTAC AGTTTATGTT AACGGCAAAA 198 0 

CATATACATC TAAAGATAAA AAATCACAAA TAGAAGTTCG TAAACAGTCT GGTATGGTAT 204 0 

TTCAAAGTTA TAACCTTTTT CCGCATAAGA CGGCATTAGA AAATGTAATG GAAGGTCTTA 2100 

TCACAGTTAA AAAGTTGAAA AAGGATGAGG CACGTGGGAA ATCACTCGAG TTACTTGAGA 2160 

AAGTTGGTTT AACACATGTC AAAGATCAAC GTCCACATGC ATTATCAGGT GGTCAACAAC 2220 

AACGTGTTGC TATTGTCAAG AGCACTAGCA ATGAACCCTA AAGTGATGTT GTTTGATGAA 22 8 0 

CCAACATCTG CACTTGATCC TGAACTTGTG AATGATGTTT TAAAGGTTAT TAAAGATTTG 234 0 

GCTAATGAAG GCATGACAAT GGTCATTGTG AC AC AT G AAA TGCGTTTTGC TAAAGAAGTA 2400 

TCTAATAACA TTGTATTTAT TCmTGAAGGC ATGATCGGAG AACAAGGGGC TCCAGAAGAG 24 60 

ATGTTCAATC GTCCGAAAAC AGAAGAATTA AGACGTTTCT TAAATGTTAT AAATGAAGAA 2 520 

TAATCAAATA GAACCACGTA TCATGTTTTv GTATGGCGAT G AAGC CAT AT ACATGATGCG 258 0 

TGGTTCTTTG TTATGTTGTC ATAATCTTGG AGCGATATTT TAACGACGTT TATGATTTAA 2 64 0 

GTACGTTTGT ACCTTATATA GTAATGGTTT GATAGGCTTA ATAAAGTCAC CGATATACTC 27 00 

TTCTACATGT GCGTTAAAAC CTTTTTTGAA TTGTTGGACG CCATAGTCTT CTGAGACTC 2 7 60 

TGAAAAGTCA CCGGTAATAC CATAAAAATT ATAGCGATCA ATATGATGCG CTTTAGCAAA 2 820 

CTTAATCATT TCCCaCTGCA AATGGTAGGC ACCCATATAA GCATTATATT TAGGGTTTGA 28 80 

ACCACTAGAT AAGTAATAAA CTTCATGCTC ATTGTAGATA AATAAAGCAG AAGCTAAGTT 2 94 0 

TAAGACTGCA CCATCTTGTT CAATTTGTTC TATTGTATTG TCGATTTTAC GCTTATTGCT 3000 

ATTTAGCTGT TGTTCTAGCT GTGTGCGTTT CGTTTTATTT TTCTTTGAAT TAGGACTTTC 3060 

TTCCAATGCT TCTTCAACAC CTGAGAGTTC AGCTGTTAAT TGTTGTTGCT TTAATTGTAA 3120 

CGTTTTTAAA TACTCGTTTA AATCAATATA CGCCAKTTT AACATGGCGT GGTCATCGTA 3180 

TAACTTTTGC ATTTCTTCAA AGTATGGTAA CTCACGGAAT TTGAAACCGT GCTTTTCCTC 32 4 0 

AGCCATATGG AATAAGTCGA AAAAAGTTTG CGTTTCATCA ATCGTTAAcG TTTTaGTTTT 3300 

GACACCAAtA TCaTATGTtT tTtTAATATT ACGTCTCGtT TGATAATCCA TTTCTTTTAA 3360 

AAGTTGGTCT TCAGTCTTAT CTTTTAAATC TAACACTGAC AGCCAACGGA TTTGGCTCAT 3420 

TGAATCATAA CCTACAGGGA AACCTTGGTG TTTATAACCT AATTTATCCA TTGTTCTaAC 3480 

AAATGCTCGG TTATCATAAG ATTTAACAAT TTCACCGTCT GCATTGCGTA AATTTTCAAT 354 0 

TAAATATGGA TCTACAAG3A CATATAAACA ATTGTGTTTC TTTAAATATG ACGTTAATGC 3600 

TTTAAAGAAA AATGCTACTA ATGATTGATT TGTATAATCC ATCACTGGCC CGCGATGTGT 3 6 60 

ATAAAAATAT TTGAAAAATT TAAGTGTGCG TGCTTCTGTC AATAAGCATC CTGCAATCAC 3720 
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TTGACCATTG TCATCTTTAA CCCCTACrAG ATGCACATCG CCTTTAAAT CAACTCTATG 37 80 

ATTGTAATGA ATAGCTGATT GTGTGTAATG TGAAAAATGC T 3821 
(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

GCGACACCAT TGTTTGCAGC AACTGTTAAT TTACCAGTTG ATGGTTTCGA CGGTGTTGTT 60 

GGTTTTGATG GTGTAGTAGG GGTAGTTGTA GATTGCGTAC CCCATGGCGC CACTTTACCC 120 

ATTTTTATTA AATATTTTTC ATTAATTAAG TCATATAATT GATCATAACT ATAATTATGA 18 0 

CTTCTTAAAT ATCCATGTGG ATcGGCATGG TCAGTACCAC CTAAATATTT ACTTACAGCG 24 0 

TAGTGAGTCC ATACTGTACC ATTTCCATCA TACTCAGCAC TGTCTGGTTT TAAACCATAA 300 

TATTGTAATT GTGTAGCTGC ATAGTCAGCA TAGTTATTCA TTGAACGTGC AAATGAAGCA 3 60 

TAGTCGTGTG TGTGTACGAT TTCAACATTG ATGAATCTAG GGTTACCGAC TGCACCGACA 420 

CCCCAAGATA AGTAATCCGT TGGTGCTGTT TCGATTATAC GATCCCCATC AACAAATGCA 480 

TGTACGAATG CGTTTTGATA GTTATTTTTC AT AT AAC T AA TTTCACATT TATCGTCGAA 54 0 

C GAT CAT TAG CTGTATCATG AAC T AC GAT A CCTTCAGGAC GACCTACGCC GTTACGGTAT 60 0 

GCGTATTTAG GGAAGTAAGA TGTATAATCT TCTTCAATTT TAGGTGCTTT TAAGTTATTT 660 

TTACAAATGT AATCGTTAAT TGAAGAGTTT ACTTGTGGTT TATATTTTGG CAAACTCGTT 72 0 

TTTGGTGTTG CAGCAACTGA TCTTGGTTGT GCTGAAGCGC TAAAAGTAGT TACTTTAGGT 780 

GTCGCTTCAG TTTTAGCTTT AGGTGCTGAT GTAGTTGCAG CTTTAGGTGC TGCGGTTTTA 84 0 

TATTGCGTyT CAAGAGCTGC AGGTTTAGCA GCTGATTTAA TTAATTCTGG ATTAATTTGA 900 

TTTTCTGAAT TATCATCTTC ATCATCAATT AAACTATAAC CAGCATTTGT AACATTAGTG 960 

TTAGTTTTAG GTGCTGTAGT GCTTGTTGAC TTTGCAACAG GCTGCGTATT ATTTGTAGTC 102 0 

GCTGATTGAT TAGCACGAGT GTCACCATTT ACTTGTGCAG TATCAACTTT TTGACTTACT 108 0 

TGAGCATTGC CTGTTTTGTT ATTTGCTGTT TTTGGTTGGA CAATAGCAGG GTETGATAT 114 0 

ACTTGAGTGC CAGAAATGTT TTGCGTTGGA TTTTTTACCT CAGCTTTTGC TTGTTCAGTA 1200 

GTTGCTTTAA CTTTATTACT ATCTAAAACG TTTTTATTAG TAGTTTGATC TTGTGTCGTC 1260 

TCAGCTGCTT GAACTTGATG TGCAGTGACT GCTGAACCTA CAAGCGTTAA TGCAACCATT 132 0 
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GAgGTAGTTT GTAATTGAAT TTTTTCGCCA TTCTATTTAT TACTCCyAAC ATTTATTAAT 1380 

TATtACTAAC AT T AT AG T AC CTGTnTTATA TACCTGTGCG TA 14 22 
(2) INFORMATION FOR SEQ ID NO: 306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6076 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

ACGCGTTGGT TATTTCAATC ATAATATTAC TCTGCAAATA CACAGCCCTG TAACCGAGTA 60 

AATGGATAGA GATTTGAACA AATGAAAACA ATCAACTAAT GGAAAGGATA AAATATTATG 120 

CACAACAATA ATGAATTCAA CAAAAAACTG AAAGATTTTA TAGGTAGCGA TAAACGGATG 180 

GCTTTAGTAA AGGGTTATGT T AAC GAG TAT AAATTAACAA CTGTTTTAAG AGCTTTAAAT 24 0 

AATTCCGATT ACACAAAAGG GACGATACAC ACTCGTTCAA TAGGCAACCT AAAAC&ATA 300 

GTAGATAATC GAATTATTCC TAAAAATATA ACTCAAAACA CAATCTTCAA ATTGAGTAAT 360 

TTAACCTTAG AGGTTAACCT CTATGAAAGA AACACAATTT ATCACGGAGA CTTTGCTATT 420 

TACTATCCTG TAGAATCTGC TCTAATAATA GAAACAGATA CTCCTAGGCT AAT T AAT CAT 480 

ATAAATAACA ACCCAGTCCA AAAAATATTT ATAATAACAA CAAATGACTG GAGCTTTAAT 54 0 

ACACTTGaAT TAGAAAAAAT AGTAGACGAA ACTATAATTT AT GAT T T AAA ACAAGAAGAT 600 

CCTAAACAAT ATGAAATTTT ATATAAAAAT AAACACGGAA AATTGCCTTA TTGATGAATT 660 

AAAACGCAAA CTTAGAACAC CTTACCCCTT TTCCGTCOT ATTGTAATTT TCCAACTACT 720 

CAAATTAAAA ACCACCCGTG CAAACAGGTG GTTTTATTAT AT AT AAT TAT TCCCACTCAA 78 0 

TTGTGCTTGG TGGTTTTGAT GTAATGTCAT AGACTACGCG GTTGACGTGA TCTACTTCGT 84 0 

TTACGATACG AC T AG AAAT C TTTTGTAAGA CTTCCCAATC GATGCGTGCG AAGTCACTTG 900 

TCATACCGTC GATAGATGTT ACTGCACGAA TACCTACTGT GTGATCATAC GTACGGTAGT 960 

CTCCCATAAC ACCTACTGAT TGAATGTTTG GTAACACTGT GAAGTATTGC CAAATTTCTC 102 0 

TTTCAAGACC TTCTTCTCTA ATCACTTGGC GTAAAATCGC GTCTGATTCT CTAACGATTT 108 0 

CTAGTTTATC TTCAGTAATT TCTCCAAGTA CACGAATACC AAGACCAGGT CCTGGGAATG 114 0 

GTTGTCTCCA TACTAAATGT TCTGGAATAC CTAACTCAAT ACCTAATTTA CGTACTTCAT 1200 

CTTTAAACAA TGTATTGATT GGTTCGATTA ATTCGAATTC CATGTCTTCT GGTAATCCAC 12 60 

CAACATTGTG GTGTGATTTG ATTGTTTGTG CTGTTTTAGT ACCTS.TTCG ATGACGTCTG 1320 

TATATAGTGT TCCTTGCGCA AGGAAGTCTA CACCTTTCAG TTTTGATGCT TCATCATCAA 138 0 



913 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



ATACGTATAC AAAT T CAT T A CCAATGATTT TACGTTTTTG TTCAGGATCT GAAACACCTT 14 4 0 

TTAATTTATT CATAAAGCGA TCTTTCGCAT TAACACGAAT AATATTCATG TTGAAACCTT 1500 

CACCGAATTG CTCCATAACC ATGTCGCCTT CACCTTTACG AAGtAAGCCA TGGTCTACAA 15 60 

AGATACATGT TAG T T GAT C A CCTATTGCTT TATGCAATAG TACAGCTACA AC AG AT GAAT 1620 

CTACGCCGCC ACTCATCGCA CATAATACAC GACGGTCTCC TACGCGTTGA CGAATCTTTT 1680 

CAATTTCGAT TTCGATAAAG TTTTCCJTTG TCCATTGACC TCTACAATCA C AAAC AC G AC 17 4 0 

GGACAAAATT ATT T AAT AAA T CAT T AC CAT ATTCTGTATG ACGTACTTCT GGATGGAATT 1800 

GAAC AC CAT A AATGCGACGT TTCTTATCTT CGATTGCTGC ATAGTCTGTG CTTGGGCTAT 18 60 

CAGCGATAAC TTCAAAGCCT TCTGGAATTT CAATAACTTT AT C AG AAT G A COATCCAAA 1920 

CAGTTTGTTC TGCTGGTAAG CCAGCGAATA ACTCATCTGA CTTCGCATTA ATGATTGCTT 1980 

TACCGTATTC ACGTTCATTG GCACGTTCAA CTTTACCACC TAATAATTTA GTAGTTAATT 204 0 

GCATGCCGTA ACAAATACCA AGTACTGGAA TTCCTAAATT ATATATTTCC GGATCAATTG 2100 

TAAATGAACC TTCTTCATAA ACTGAATTTG GACCACCTGA TAAGATAATA CCTTTTGGAT 2160 

TCATTTTCTT AATTTCTTCA ATTGAAATTT CATGATCGTG TAATTCACTA TAAACGCCCA 2220 

TTTCACGAAT TCGGCGTGTA ATTAATTGGT TGTATTGGCT ACCAAAGTCT AAGACAAGGA 22 8 0 

TTAACTCTTG TTCTTTTGCC ATTTCCATAT TTGTJGTTCT CCTTTATCTT AAT TAG AAT G 234 0 

AGTAGTTCGG TGATTCTTTC GTAATTTGAA TATTATGTGG ATGGCTTTCT GCTAAACCAG 24 00 

CAGGACCCAT ACGTGTAAAT TGTGCTTCTT CGCGTAATTC TCTTAAATCG T G T GAAC C AG 24 60 

TATAACCCAT ACCAGCACGC ACACCGCCCA TTAATTGGTA AATTGTATCT TGTAACGCAC 2 520 

CTTTATAAGC CGTACGTCCT TCGATACCTT CAGGAACAAA TTTCTTAGGC GCTTTGTCCT 2580 

CTTGGAAGTA ACGGTCGTTT GAACCTTTTT CCATCGCACC TAAAGAGCCC ATACCACGGT 2 64 0 

ATACTTTATA TTGTCTACCT TGGAAAATTT CTGTTGCGCC TGGGCTTTCT TCAGTACCTG 2700 

CTAATAAGCT ACCTAACATA ACCGCATGTC CACCAGCAGC TAATGCTTTA ATGATATCTC 27 60 

C T GAG AAT T T AATACCACCA TCAGCAATGA TAGCTTTACC ATGTTTGCGT GCTTCAGTtG 2 820 

CACAATCATA AATTGCTGTA ATTTGTGGTA CACCAACACC TGCTACAACA CGCGTCGTAC 28 8 0 

AAAT T GAAC C TGGGCCAATA CCAACTTTAA CAATATCTGC SCCGCTTCA AAT AAAT C T T 2 94 0 

TTGTTGCTTC TGCAGTTGCT ACGTTACCTG CTACTAATGT GATTTCTGGG TAAGTCTTCT 3000 

TAATATGTTT CACTTGATCG ATAACACCTT TAGAGTGACC ATGTGCTGTA TCGATAACTA 3060 

AGACATCCAC ACCTGCTTCG ACTAATTTTT GAGCACGAAT ATCAGTATCT TTTGAAATAC 3102 

CAATTGCTGC GGCTACAAGT AGACGACCAT GTTCATCTTT TGCTGCATTA GGGAATTCGA 318 0 
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TAACTTTTTC AATATCTTTA ATAGTAATAA GACCTTCTAG ACGTCCGTCT TTAACTAATG 32 4 0 

GTAACTTTTC AATCTTATGT TTTTGGAGAA TTTTTTCTGC TTCTTCAAGT GTTGTATTCA 3300 

CTGGAGCTGT AATTAAATTt TCTTGCGTCA TTACATCTAC AATTTTAATC GAGAAGTCTT 3360 

CAATAAAACG TAAGTCACGG TTTGTTAAAA TACCTACTAA GTTGCGATCT TCTTTATTAT 34 20 

CAACAATTGG TACACCTGAA ATACGGTATT TACCCATTAA TGCTTCTGCT TCATAAACGC 34 80 

TTTCTTCTGG CGTTAAGAAA AATGGGTTTG AAATGACACC ATTTTCTG& CGTTTTACTT 354 0 

TTTGAACTTC GTCCGCTTGT TCTTCAACGC CCATATTTTT ATGAATAACA CCTAAACCAC 3600 

CTTGACGAGC CATAGCAATC GCCATTTTAG ATTCAGTTAC AGTATCCATA CCAGCAGAAA 3 6 60 

TAACTGGAAT ATTTAATTTA ACTTTGTCTG ATAATTGTAC GCTTAAATCA ACGTCTTTCG 37 2 0 

GTAAAATATC AGATTGTGCT GGAATTAATA ACACATCATC AAACGTTAAT GATTCTTTTG 37 80 

CAAATTTACT TTCCCACATT AAAAAC AG C C TCCATTTTTC AAATTAATTA GTTATATTAT 38 4 0 

TTCACATTTT CTTCCGTTTG TTTATACTTT ATTCCATTAA AAAAGAAATT AAGAATAATT 3900 

GCAGAGATTG CGCCTAAAAC AATTCCGTTT TGAGTCAACC ATGCAAATTG TTCACCTAAA 3960 

CCTTTAAATG CTTGTGGTAC AGCGCTTATA CCAGTACCTA AtCCTACTGA TACAGCGATA 4020 

ATTAATAAAT TGTTTTGATT TTTAAAATCG ATATGTCCTA ATATACTAAC ACCATATGCC 4 080 

ATTACCATGC CAAACATAGC TATCATCGCA CCGCCTAACA CAGGTAGCGG TATGASTTT 414 0 

GCTAATGCGC CAAGCTTAGG TATACAACCA CATATAAGTA ATAACACGAC CATGCCGTAT 4 2 00 

ATAACATTGT TTTTCTTAGC GCCGGATAAA GAAACAAGTC CTACATTTTG CGAATAGGCT 4 2 60 

GTATACGGAA ATGAATTGAA T AT AGAACC T AAcACTATCG CTAGACCTTC CGCAGTATAA 4320 

CCTTTACGAA AATCTTTTCT TTCTAACTTC TTACCGGTAA TTTCACTTAA CGCATGATAG 4380 

ACACCTGTCG ACTCAATTAA ACTAACGATA GCTACAATAA AGAACACTAA CGTCGATGTC 44 40 

ACATCAAAGC TAAATCCAGA GAATCTAAAC GGCACTGGGA TGCCTAACCA ACCGGCATGA 4 5 00 

TTGACTTGAT TAATATCGAC CATCCCAAGT AAGCCAG&C CTATCGTTCC TAAAACGAGT 4 5 60 

CCAATTAATA TGGCAATACT CTTAATAAAT CCAGTTGTGA ATCTTTGTAA AAGAAGAATA 4 620 

ATGATTAATG TCATTAAACC TAACAAAATG TTCTTAACAT CTCCATAGTC CTTTGCACCT 4 68 0 

TGACCTCCAG CTAAGTAATT CATTGCTACT GGCATTAAAT TGATACCAAT GATAGTAACA 4 74 0 

ACACTACCCG TTACTACTGG TGGGAAGAAT TTTACAAGAT GTGAAAAGAA AGGCGCGATG 4 8 00 

ATAATAACTA ATATCCCTGA TAAAAATAGC GAACCATAAA GTACATCTAT TCCTTTCGTT 4 8 60 

TGACCAATTA AAATCATGGG CGCAACAGCC GTGAATGTAC ATCCAAGAAC GATTGGTAAT 4 920 

CCTGTTCCTG TTACTTTATT GGCTTGTAAA AATGTGGCAA CCCCACACAT AAATATATCT 4 980 

ACTGTAACTA AG T AAGCGAT TTGTTCAGGT GTAAACTTCA AACTTGTACC AACAATGATT 50 4 0 
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GGAACTAAGA TAGCACCTGC GTACATAGCT AAAAGATGTT GAACACTTAG GATTAAATTT 5100 

TTCATTATTC TTCTCCCACC AATGTCACTT TGTTTCCTTC TAGTBAGCA ACCTTGCAGA 5160 

GAGAAGAAAC TGTTAAACCT GCTTCTTCTA AACGTTGATG CCCATTTTGG AAACTCTTTT 5220 

CAACAACAAT ACCAATACCA GCTGTCTTAG CATTCGCTTG CTGTGCGATA TCGTATAATC 52 8 0 

CTAATGAAGC ATCACCATTT GCTAAAAAGT CATCGATGAT AAGTACAGTA TCTTCTTCTG 534 0 

AT AAAAAC T C TTTTGAAACA ATGACCGTAC TTGTTTTATT TTTAGTAAAT GAATGAATAG 54 00 

ATGTTTCATA ATAACCATCC GTCAAAGTGC TAGGTTTTGC TTTTTTCGCA AATAAACATG 54 60 

GCACATCAAA ATGCAGTGCA GCCATGATTG CAGGTGCGAT ACCGGAAGCT TCAATGGTTA 552 0 

AGATTTTAGT AATCCCTTTA TCTTTAZATT GCTCGTAAAA AGTGCGACCA ACTTCATTCA 5580 

TTAACTTTGC ATCAATTTGA TGATTTAAAA ATCCATCGAC TTTTAAAATC TTCTCATCAA 5 64 0 

TGACAACGCC GTCTTCCTTT ACTTTTTGTC CTAGTAACTC CACTCTTAAA ACCTCCTCAG 5700 

ATTTGTGTGA AACATTTTGC TCTTAAATTG GTGCTAGATA CAAAAAAATC CCAAACTAA 57 60 

ATAATAGTTT CAGGGTTTAT GAGTGAACGa ACATGCATAA CGAATTTGTC ATGCAATCAA 5820 

TGTAAAAGAA GTTTCATCAA ATAAACTGTG ACCATCATAT AAAATGATAT AAATCACCCA 5880 

CCATGGTTAC AATTTAATGG CTGAAGCTAC TCCTAGTATT GTGTTGTTAC TCATAGTCAT 5 94 0 

GTCGTTCAAG GCAACATGGT AGAAACTTCT AAAGCCATAT TCTTTAGATT ATATGAGTTT 6000 

ATGTAAATTA TTTAACGATA ATAGCAAATT TTCGGCATTT TTTCAATAAC TGCTTAGGTA 60 60 

ATCTTTTAAT AGTTTT 607 6 
(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

ACCGCCGATT GATAGCTTTA CTGCTGCGAT AAAACTGCCT TCAACAATTG GTGCATCAAC 60 

TTTTAACACA CGATGATTAC CATCATACAT TTCAATTGCC ATATCTACAT TCATTTCTGA 120 

AGATCCAATA TCGTAAAAAC ACAATGCATC ATCCTCTAAT T TAG T C AAAA CTTCTTGGAT 180 

GATATCAAAT GAAGTTCCAA TTGAACCATC TGGTAATCCC CCGATTGGTA TAATACAAC 24 0 

GTCACCTGCC ATTTGCTTTA AC AAAG AT T T TGTACCACTT GcAATTTCTT TACTGTGGCT 300 

AACAAGTATA ATTTTAGGCA TTATCTTCAT CTCCAATCAA AGCGTTTAAA ATATAAACCA 3 60 
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TACTTTGAGC ACCTGGATCA ATATAACCTT TTGATTCTTC TCCAAAATAT GCAGCTCTAC 420 

CTTTCGTTGC TACCATATCT TTTGTATTAT CTGCTAATTG CTGTAAATCA TTGAATGTTA 4 80 

AAGTTTCACC ATTTTTAAGC TTCTCTGCTG CTCGCGCTAC TACATCATAC ATTGTCTTTT 54 0 

CATTTAAAGT AACTTTACCA CGTGATTCAA CCGCTTCGGC AAATGCCTGA ATTAGTGTAA 600 

TGAAATCTTG ATTATCCATA TCATCTTTGG TGACTGC&GA CATTTTAACA AAGCTAAAGC 660 

CATACAGTGG TCCTGATGCA CCCCCAACAT TTGACATCAA TGCCATACCA GTTGATTTGA 72 0 

ACAATGATTG CATTGAGCTA TCATCAAGTT TGTCTTTAAG ACTACTAAAC CCACGAACCA -7 8 0 

TGTTAACCCC GTGGTCACCA TCACCAATTG CTCGATCTAA TTCAGTTAAT TCAGATTCAT 84 0 

GTTTTTTAAA CGTTTCTTCT AAATTTAATA AACGTGCTTT CATATCATTC ACTTTCATTT 900 

GTGCCACCTC ATAAATGTAT ATTTATTCAT ATTCACTTCT TATTTAAAGT ATTGACTTGT 960 

TGTTGGTGCT AAAAATGCTT CCAAATATTC TGGTTTATTA GGCACGATAG T T AT AG AAAA 1020 

ACCTTGCATG TCTAAAGATG TCATATAATC ACCAACAAAC CATTTAGCAA CATTAACCGT 10 8 0 

TCTTGCAGCT AAATTTTGTT GAATATATTT AG T T AC GAT A TTTAATTCAG ATAACGGCGT 114 0 

ACCACCCATA CCATTTACCA TTAATATGAC ATCATTGGCA GTAACTTCTT TATACAATTC 1200 

GTCTAACAAT GTTCCAACAA TATGATCAAT ATCCTTTACT TCTtCCCTAT GAATACCTTT 12 60 

TTCACCATGT ATACCAATAC CGATTTCCAT TTTGTCGTCT TCAATATCAA AGCCATATTT 1320 

TCCAGTAGTT GGAACAAGCG GTGGCTCAAT TGCCATACCA ATACTTTTAA TTTCAGGTAA 138 0 

CAACGCTTCT ACACGCGATT TTATCTCTGT TAATGAATAA CCTTTTTCAG CAAGATAACC 14 4 0 

GGCAAGCTTA TGAACAAACA CTGTTCCTGC AACACCACGA CGTTGTACTT CGTTTGTCAC 1500 

AGCAATGTCG TCACGAACAA TAACAGTTTG AACATTTATA CCTTCCATTT CTGCAAGCTC 1560 

TTGTGCCATT TCGAAATTCA TCACGTCACC TGCATAGTTT TTTACAACTA GTAATACACC 162 0 

ATCACCAGTA TCTACTGCTT TAATAGCTTC TAATATTTTA TCAGGTGTAG GTGATGTAAA 168 0 

TACTTCGCCA CAAACCGCTG CATCTAGCAT ACCTTCTGCA ACAAAACCGG CATGCGCAGG 17 4 0 

TTCATGTCCG CTTCCACCTC CAGAGACTAT TGCTACACCA TGTTCTTTCT TAGCTTTTTT 1800 

TACAATAACT GTATTAGCAA T C AG AT C T AA CTCTGGGTGC GCAATTAACA BCCTTCAAG 18 60 

CATATCAGTT AAAAATGTTT CTTTTTTATT GATTAACTTT TTCATCATGT TGTACCTCCT 1920 

TGGTATTATC AGTCATAGTA TAAAACAGAA TAAATGAATG CGCTATCATA AAGAATTAAC 198 0 

CTGATACCGT TATCAAATAT ATTTTTGATA AGATCCTCTT GATAACTTAA TGGTTCATTA 204 0 

TTGAAAAAAT AAATAGTCTT GAGTGGCTCA TTAAATAAAA AATGCTATGA AAAGCCTTAT 2100 

AATAAAGTGC CTTTCATAGC aATAAGTTGT GTCCATTGAC ACTATACATT TTTCGTTTTG 2160 

TACATTAAAT ATAAGAAATA CGGTGCACCA ATAATTGCTA CGACAATACC TGCTGGAACC 2220 
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CCACCTGGTT GTAATACAAT TTTGCCAATT GTA n CAGCTA TAACAAGTAA ACATGCCCCT 2280 

ACTAAAATGG CAATTGGTAA AAACAACTGG TGACGTGGTC CAACGATACG TTTGGCAATA 234 0 

TGCGGACCCA TTAATCCGAT AAACGAAATT GAACCTGCTA cTGCTACAGC AGCAGATGAT 2 4 00 

AACATCACTG CGATAAAGAA T AAT AT T AAA CGTTCTCTGC TTAACCTTAC ACCTAGACC 24 60 

CGTGCAATAT TATCACCCGT AT GAAT AAT A TTTAGTGTAT TCGATTTAAA TAGTAAGTAA 2520 

GGAATAATAA TCAACACCCA CGGTAAAAAT GCAATGACAA ATGGCCATTC GTCACCCCAA 258 0 

ATATTACCTG CAAACCAAGC AGCGATGAAA TCAGATTGCT TATCATCAAA TTTTGACATA 2 64 0 

ATTGTAATTG AGCCACCATA TAATGCTGTT TGTAAACCTA CACCTATTAA TACCATACTC 27 00 

GCAGGTGTAA CACCTTCATT TTTATTAAAA CTGaAAATAA AAAT AAT C a A TGCAGTGgTG 27 60 

ATACCACCTA GTATACTTAT TAACGGCAGT ACATAAACAA AGTTGTCAGC ATTAATTTTA 282 0 

CCAATTGCAA TAAATAATGC GATTGCAAAT CCGCCACtGCGTTAATACCT AATATACCTG 2880 

GTTCAGCTAT TGGATTTTTT GTGACACTTT GCAC TATTGC ACCACTAATA CTAAgCtGCG 2 94 0 

CCAGCCAAAA TAGTAATCAT CATCCGAGGT AACCTGAAAT CTAATAAGAT TAACTCATCA 3000 

ATGGCATCAC CTTGTCCAAT TAAAGTTTTG AAAAATCTTT CAACAGGTAT GTTGTATTCA 3SX) 

CCTGAGGTAA TACTCCAAGT ACAACCTAGA AATAGTAGAA TGCTAAAAAC AGCCAGTGCT 3120 

ATCAATTGTC TGCGTTTATT ATTTGAACTA ATCATATTGA GCGTCCTCCT TTTTTAACTA 3180 

AATATAAAAA GTAAGGAACA CCGATAAATG AAATGATTGC ACCAACAGGC GCTTCTCCTA 32 4 0 

AATATCGTGC TATCACATCG GCAACAAGCA CGAGTATCCC ACCTAACAAG GCTGTTAATG 3300 

GTAGAATTTT AGC AT AAT C A GTTCCAATTA AAAATCTTGC TATATGAGGT ACCATCAAAC 3360 

CTACAAATGC AACTTGTCCA GCGATAGCAA CTGCAATACC TGCTAGAATC ATAGCAATAA 3420 

TTAAACATAT GCCTCTGATC ATTGTTACAT TTTGACCTAA ACCTTTAGT AAT GAT TC AC 3480 

CAAGATTTAA AATGGTAAGT TGTTTACTAA T T GT T AAT AT AATGAATAAC GCAATACCAA 35 4 0 

TTAATGGAAT TGCCCACTTA AGGTGTGACC ATGTTGTGCC TGAAACGCCT CCAGCAGTCC 3600 

AAAATGTTAC TGTTTGATTT AGTCTAAAAG CTAATGCAAT ACCTTGACTT AGCGCTGTTA 3660 

ACATAGCACT TACTGCTGCA CCCGCTAAAA TAATACGCAT CGGATTAAAT CCATCACGTC 3720 

TAGATCGGCC TAT CAT T AAT ACAATAGCAC CTCCTAGAAT AGCACCTAAA AATCCAGCAA 3780 

ACATCAATAT TAAAAATGAA GTGTTTGGTA AAACTGCATA TGTTAATGCT AAAGCAAATG 38 4 0 

AAGCACCTGA AT T TAAACC T ATGAGCGCCG GATCAGCAAG ACCATTACGA GTAACACCTT 3900 

GTATAATCGC ACCAGAAACT GCAAGCGCCA TACCTACAAT TACTGCTGCT ATATTTCTGG 3960 

GAATCCTAAT CTCATTGATG ATGTTTTGCT GTTGATTGCT AGGATTATAA T T AAAAAT AG 4 020 



918 

Copied from 10806206 on 08/12/2004 



CCTCTATAAT TGTAGAGGCT TGAATTTTGG CGTCACCTAT TAATGTAGAA ATAASAGTG 4 08 0 

TGATTAGTAG TATCATACTT AAACCTATAA TATAGGATAA AAACTTCAAT GGCGTTGGGT 414 0 

TCTCTCTATT TGTCATGTTA ATTGTCCTTT T TAT CAT ATT AACTTACTTA ATTAAGAATA 4 200 

AGCTCTGCGA CATAAGTCAT AAGTTACCAG TAAAGGTTTT CCAGTTTTAG GATCTTTACT 4 2 60 

TAAAACAACA TCAATATTAA AAACTTTTTC TAATATTTCC TGTGTTAATA CGTCTTCTGT 4 32 0 

TGAACCTGTA GCGAT GAT AT CCCCTTCTTT C AT CGCAAT A AGATGATCTG AGAAACGAAT 4 380 

CGCTTGGTTG ATATCATGAA G AAC CAT G AC AATTGTACAA CCTTGTTCCT GATTTAGCTT 44 4 0 

CTGAACTAAT TCTAGTATTT CTAATTGATG ACAGATA'CT AAATATGTTG TTGGTTCGTC 4 5 00 

TAAAAAGATA ATATCAGTTC TTTGTGCTAA TGCCATTGCA AT C C AAAC AC GTTGTCTTTG 4 5 60 

ACCACCACTT AAATCATTGA TTGAACGGTG TCGGAATGTA TCAGTTCCTG TAACTTCCAT 4 620 

TGCCCAATCA ATtTCTTTCT TAt CCTCAGC AgTTAATCTA CCAAATCCTT TTTGATGTGG 4 68 0 

AAAAC G AC C A TATGAAACTA ATTCCCCAAC AGTTAAGCCA TCTGCTACTT CaGGTGaTTG 4 74 0 

aGGTAAAaTG GCtATTTTTt TGcAATCyCy TTCGTAGATt GTGtATGAAT ATTTTCACcA 4800 

TCTAAAAATA CTTCGCCTTC TTTAACTGCC AATAAACGTG ACAATGCCTT TAGCAAAGTA 4 8 60 

GATTTCCCGC AGCCGTTAG3 ACCAATGATT GACGTCACTT TGCCATCTGG TATTTCAACA 4 920 

TCTAATTTAT TTATAATCGT GTTATCCCCG TAACCAATTT TAACTTGTTG TCCATGCAAA 4 98 0 

CGATTCATAA TTTCCCTACT TTCAATAAAA TTCTTTCTGT TTATAAAAAA TAATTTCTAT 50 4 0 

TTTTAAATTA TCAATTTTCA AAGACATCCC AATTGATAAT GATSTCATG AACATCATTA 5100 

TAACATTTTT CAATCTTATT GACTAACATT ACTTTTTAAA TTGGATAGCT CGATTTGTCA 5160 

TGTCTTGTAT ATTACTTTTA TAAAATAAAA AACGCCCACA GATAAGTCTT CATAGTTCAA 5220 

AAACTTGTCC GTGGACTTCT ATTTAAGTAT GTGTGCTCAT ACCATTTATT TATTCATCTG 52 8 0 

CAAGAAAGCC ATTACCATAG ACATCTCTTA CATCATGAAT TACGAGGAAT GCATCTTTAT 534 0 

CGATTTGTTT AATTAATCGC TTTGCTTTTG AAACTTGTGT TTTAGAAATA ACAACGTATA 54 00 

AG AC AT C T T T TTCTTCACGC GTATAATAGC CATGTCCGTT TAAAATGGTT AAACCTCTTC 54 60 

CAATTTGCTC GTCTATTGCT TTGGCi&GTT TGTCGGGATT AGTTGAAATA ATCGTCATAG 5520 

CTTTTTTAGT GTTTAAACCT TCTATGACAT ATTCCATCAC TTTTGTTCCT ATATAAAGTG 5580 

ATATTACTGT TACTAATACT TTATCAAGTG GAATAACTGT AAGTGAAATT G C AAC AAC G A 5 64 0 

TCATATCGAA GAAAAGCAAA GCATATGGCG TGCTTACATC GAGGTATTTC ETGCAATTC 57 00 

TCGCCAAAAT TGTTGTACCT GCTGTTGTAC CGCCTGCAAG GATAATTACT CCGATTCCTA 57 60 

GTCCAACGCT TACACCACCA AAAATGGCAT TCACAATGCT GTTTCCAGTT TCTACTTGCC 582 0 
ATGATTCTGT TAAACTCAAA AATATTGAAA TAAGAATTGT TACAAGAATA GTTAAGTACA - 588 0 
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TACTTCTCTT ACTCAAAAAt TTATAAcCTA TGGCAATCAA TACTGCGTTG ACCAAGAAGT 



5940 



TAGTGATGGC TGGTGAAATA TGAAACGCAT AATATAAAAT AATTGCTAAA CCTGTAACCC 



6000 



CGCCTTCACC TAAGTTACCA GAAATAATAA ATGCATTTAC ACCTGCAGCA AAGATAAATG 



6060 



AACCTAAGAC AACTAGTATT AAATCTTTAA CCGTTTTATT CACGAAACCA TCCCCTTTAT 



6120 



ATATTTATTA G AC TAT 



6136 



(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2576 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

GATATCGTAs CTACTGAAAA GTCATCACCA CCATGGAATG ATTTCTTTAA ACGTTTTAGC 60 

TTCTATGCAA TTGCAATTCA ATACTTTGTT GTACAATTTA TCATTACATT ATTCTAATT 120 

TGGTTACCGA CGTATTTAAC AGAAGTATTC CACGTTAACT TTAAAGAAAT GAGCATTAGT 18 0 

TCATTACCTT GGTTATTAAT GTTCTTCTTA ATCTTATCAG CAGGTGCAAT TTCTGACCGT 24 0 

GTATTAGGAT TAGGTCGTTC AAAATTCGTA GCTAGAGGTG TAATTGCAAT TGCAGGATTT 300 

ATTGTGTTTG CAGTTTCAAT TATCTTTGCT GTACGCACAG GAAATTTATA TGTAAGTATT 360 

TTCTGGTTAT CACTAGGTCT TGGTGGTATC GGTATTTCAA TGGGTATGAG TTGGGCTGCA 4 20 

GCAACTGACT TAGGACGTAA CTTCTCTGGT ACAGTATCAG GGTGGATGAA CTTATGGGGT 4 80 

AATATAGGTG CATTAATCAG TCCGCTATTA GCAGGTJTAT TCGTAGAACA TTTGGGTTGG 54 0 

ACAATGACAT TCCAATTGTT AATCGTTCCA GCAGTAATCG CTGTGATTAT GTGGTTCTAT 600 

GTGAAACCAG ATCAACCTTT AATTGTTAGT GAT G AT AAAG CAATAGAAAA ATAATTTAAA 660 

CAAGCAGTAA GCTTTCACAT AGTTGGGGCT TATTGCTTTT TTTGCGTTGA AATTGAAACT 7 20 

TTTTAAAACA GATATGGTTT AAGATGAAAA TGAAGTTATT GAAAT GAT AT AT G T AAAG AA 780 

ATAAGGTTTT AAAAC AT TAG TCAGGTAACG CTTGTaAAAG TACATATAAA TTTTAACTAG 84 0 

CGCAAAGGTG GGCGACCAAA GtTcAACGAT GTTAAATAAC aTTAGrAATT AATTTTAATT 900 

GGACTTTAAA AGTTTTTAAA TTTAGATAAT TGAGCATAAG GTGTTATAAT GACATATGTT 960 

GCGTAATTAA AATTTATAGC AACAAATTCA TTTTAACTAT GCTAATAAAA AGATTATGGA 1020 

AATATTTTGA CAAGGAAAGG AGAAGTCGAA ATGACATCTT T T T G AC AT C A CTCATAAAAA 108 0 

TCAAtCGACT TAACTTAGAC TTT TAT AAAG GTGTAAGACA GGfiCTGTTA ATGATTATTC 114 0 
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CTGCAATAAT CGGTTACTTA TGTGGTAATT TCCAATTTGG ATTATTAGTT GCAACCGGAA 1200 

CACTAGCCCA TATTTATGTT TTTAAAGGTC CGTCGCGATC TAAGCTGCGA ACTGTAATAA 12 60 

TTTGTAATTT AGCGTTTGCA ATATGTATGA TGCTTGGTAC GCTAACAGCC AAAACGCCAC 1320 

TCGTTTTTGG AAT G AC AT T A TTAATTGTTA CGGTTATACC ATTTTATATA TTTACTGCCT 1380 

TAAAAATAGC TGGACCGTCA TCGACATTCT TCATTGTGAC ATTCAGTCTA CCCATTAACT 14 4 0 

TACCTATAGC TCCCGAAGAA GC AT TAT AT A GAGGCTTTGC GATTTTAGTA GGCGGTATAC 1500 

TTGCCACTAT GATGGTGTTA ATCACEATCG TATTTTCTAA AAACAAAGCT GAAGAACAAG 15 60 

CAATTCAAAA TGATTTTAAA CTCATATCTA AGTTGTTACA CACTTATAAT GATAAATCTG 1620 

CTTTTTTACA AGTAGCAAAA ACAGCGGTGG ATAGTTTTAA AGCATCTGAT AAATTATTAA 1680 

TCACTTCTAC TTCAAGTAAC GATAAATTAA GTAGACGTTT CCAAAAATT AT TAT T AT T AC 17 4 0 

ACACATCTGC CCAAGGGATT TATTCTGAAC TGTTAGAGTT GAACGCTAAA CAAATTCGAC 18 00 

CATTGCCAGA TGAGTTAATT GAAATGATGG ATCATATCAT TGCACAACTA GATAATAGTG 18 60 

AGGAAAATGT AAGATATTGG CGAAAAGAAG TGACAGTAAC AGAGGAATTT CAAAATTTAT 1920 

TCAACCATAT ATTGAAAATT GAT GAAATGG TGCATGCAAA TGAAGCGCGT ATTGCGTATG 198 0 

AAGCAGACAT GCGAAAACCT TTATATAGTA AACGCATTTA TCaAAATTTA ACaTTAGACT 2 04 0 

CtATkGTTTT TAGAAATACA TTGAGATATA CAGCGATTAT GATGATAGCG ATATTTATTG 2100 

CGTTAATGTT TGATTTTGAA AAAGCATACT G(ATACCGTT ATCTGCACAT AC AAT AT T AC 2160 

T AGGAAC AT C AACTATACAT GCAATCGAGA GAGGTATGGC ACGAGGTTTA GGTACTATTT 2220 

TAGGTGTGTT AGTACTTTCA GTCATATTGT TGTTTTCAAT ACCAACACCT GTTGCAGTAA 2280 

TTTTAATGGG CATTGCAGCA TTGTTTACTG AAGCATTGGT GGGAGCAAAT TATGCGATG 234 0 

CAGTAGTTTT TATTACAATA CAAGTTATTT TAATGAACGG ATTAGCATCA C AG AAT T T AA 24 00 

CAATTAACAT TGCGTTTCCA AGAGTTATTG ACGTTGCAAT GGGTATTGTG ATTGCAATCA 24 60 

TAGGTTTATT TGTCCTTGGA CAACGTACCG CATCCGCATT GCTTCCTAAT GTAATGGCTG 2520 

AAGTTGTTCG TAAAGAAGCA ACGCTCTTTC AT TATTTATT TTCTGAAAAT CAATAT 2576 
(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPCLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

CAAGCACATT ACGATGGTCT AATACTTTGC T AAT AAT T T C TTCTTGTCCT GGTCGAAACG 60 
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TTTCATACCC AAAGTAATGC GATAATGTTT GTTGCATCAT AAATTGACCC CTTATTGTTG 12 0 

TTCTTTAATT TCTTCTAACT CACTCCATCT TGTGAETCT AAATCATATT GAATTTCAAG 180 

TTGTTCTTTT TCTTCGTTTA ATTCTTTAAT TTTCCCATAA TCTGCACTTG CCTCAATCAT 24 0 

GAGCACATCA ATTTCTTCCA TTCTTACTTC CGCTTGTTCT ATGCGTTTCA TCAATTGTTC 300 

ATATTCTAAT TTTTCTTTAT ATGATAAACC ATTTTTCTTA CGTACAGTTG TAGAAGATTT 3 60 

AGATTGTTGC TTCAATGTGG ATTTATTTTT ATCTAATGAT TTTTTATAAC TTTCATAATC 420 

TTCAAAAGTT CCGATAATCT TTTCCATCTG AC CAT CAT G A ATAAACCAAT ATGACTGTGC 4 80 

AACTTTATTT AAGAAGTAGC GATCATGGCT TACGGTAATC ACTGTACCAC CGAAAGTATG 54 0 

AATATAATCT TCAAGTATTG TTAAAGTCTC AG TAT C T AAA TCATTTGTCG GTTCATCTAA 600 

CAACAGAACA TTTGGCTGGT GTACGAGTAG ACGTAATAAA TACAAACGCT TTTGCTCTCC 6 60 

AC C AG AT A 6 68 



(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

CATCAGTTTA TTTTGAAAGG CAATGCGATC ATTTCATCTA TTTATGTTGT TTGAAACATC 60 

GTTAGATAAC AATAGTGATA TTGCACATTT TAAGCTGAAG AT GGAAAAAT ATTCGATAAT 120 

CAAATAAACA ATGAATTTTA GAAGGTACAA TGACGTTCAC TAATTTAAAT ATAGCTGAAT 180 

GTGTTGGTGA GTGATGTTCA CTATAGATTT ATATTAATAT ACAAAGACAA AGGTTGTTAA 24 0 

TTTTTATTAA GCGTTAGGTT GAATGTATGA G AAT T T TAG A TTTATAATAG AAGATAGAAA 300 

CGAAAATTTT TCTTAAAAGC AGTAATGTTG ACTCAAAATA AGCTATAATA ATGACACTTA 3 60 

TTTAATTGAT TAACATTTGC TAATAAATAT CAATATAGAA TATAACTTTC C AAT AAT G AC 420 

TGAGAAAATC GAAATGTCAG TCTCGAATCA TATAATTAGA AAATTGATTA TTTTCTGTCA 4 80 

ATTTAGGGTT GAAC TAT AC A TATGATATTG TTAGAATATT TTTTAACATT ATATTTTATT 54 0 

GCTTTAAAGT GGAATATACT TGAAAT AAT T AGTAGAGGTG AG T AAGG AT G AGTAATAAAT 600 

TAGAATCATA CAGAAGTGAG ATTGTATCAC TGAATCATCA AATTTAGAC TTATTATCTA 6 60 

AACGTGGTGA ACTAGCACAA AAAATTGGGG AAGAAAAATT AAAACAAGGT ACACGTATTT 720 

ATGATCCACA ACGTGAAAAA GAAATGCTTA ACGACTTAAT CGATAGTAAC AAAGGACCAT 780 
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TCAACGATAA TACTATTAAG CAATTATTTA AAGAAATTTT CAAA.GCCTCT ACAGATTTAC 84 0 

AAAAATCTGA AAATGAAAAA CATTTATATG TATCACGTAA GTTGAAACCT GAAGATACGA 900 

TTGTAACATT TGATAATGGG GGCATTATAG GAGACGGCAA TAAATCATTT GTATTTGGGC 960 

CATGTTCAGT TGAATCATTT GAACAAGTTG AAGCTGTTGC TAAAAACTTA CATGCTAAAG 102 0 

GTGAAAAATT TATTCGTGGC GGTGCATTTA AACCACGTAC ATCACCATAT GATTTCCAAG 1080 

GCCTAGGTGT TGAAGGACTT AAAATACTTA AAC AG AT T AA AGATAAATAT GATTTAAATG 114 0 

TTGTCAGCGA AATCGTAAAT CCAAATGATT TTGAAGTGGC T GAT GAG TAT TTAGACGTAT 1200 

TCCAAATTGG TGCACGTAAT ATGCAAAACT TCGAGTTATT AAAAGAAGCT GGCGTACGA 12 60 

AAAAGCCTAT TCTATTAAAA CGTGGTTTAT CTGCTACAAT CGAAGAGTTT GTTTATGCAG 132 0 

CTGAATACAT TGCTTCACAA GGTAATCAAA ACATTATTTT ATGTGAACGT GGAATCCGAA 1380 

CTTATGAAAA GGCGACACGT AACACTTTAG ATATTTCAGC AGTACCAATT TTAAAACAAG 14 4 0 

GTACACACTT ACCAGTCATG GTAGATGTTA CGCATAGTAC AGGTCGTAAA GATATCATGT 1500 

TACCAACTGC GAArsAgCAT TAGCAGTTGG TGCTGATGGA GTTATGGCTG AGGTGCATCC 15 60 

AGATCCATCT GTTGCACTTA GTGATGCGGG TCAACAAATG GATTTAGATG AATTCCAAGC 1620 

ATTTTATGAT GAATTAAAGC CTTTAGCTGA TTTAA.TAAC GCTAAAAAGT TAAAATAATA 168 0 

TTCCAAGGAA ACTATAGACT AC T T AAC T AA TATGTCATGT TGAAGTAGAA TATTATCTTT 17 4 0 

GAATCGACAA TTTTAAACTT ACAGCCATTC T AAGAG TATA TTACTTTTAG AGTGGCTATT 1800 

ATTTTTTGTA TAGAAATAAA GGTATACTGC ACTTAACGAT TGTTATAATA CTTCGACACT 18 60 

TGTTCAATTT CACAATTATT AAAGATTATG ACTGATAGCA GTAATTAAAA TTATAACTAT 192 0 

GAATTATCTG TAAAATATAA TAGATTCACA CATTTGTTGC TGAAATGTGA ACATTTTTCA 1980 

ACAAATGCAA TTGATATTTG AAAAGGCTTT CTCAAAACAT TACAATTAAA AATGAAAAAA 204 0 

GTTTATATAA AAT T AflAAT A TATCGTTCGT TATCATTTAG CGTTTGTTTT TATTTCAAGC 2100 

TTTTCGCTAA ATTTTTCCAA ACAAAAATAT GTTACTGTAA ATTAAAATAT GGTAAACTAT 2160 

GAAAATGAAA TGAAAACATG T TAT TAT AAT GAATAAAACG TTTACAAGGA GGAAATTATG 2220 

ACAGTTACTA TATATGATGT AG C AAGAG AA GCGCGTGTCT (TATGGCCAC AGTGTCGCGT 2280 

GTTGTTAATG GGAACCAAAA TGTTAAAGCA GAAAC TAAAA ATAAAGTTAA CGAAGTCATT 2 34 0 

AAGCGTTTGA ATTATCGTCC AAATGCTGTT GCTAGAGGTT TAGC TAGTAA AAAGACAACA 24 00 

ACAGTAGGTG TGATCATTCC AGAT AT AT C T AATATCTATT ATTCACAACT TGCTCGTGGA 2405 

CTTGAAGATA TTGCAACAAT GTATAAATAT CACTCAATTA TTTCAAATTC AGATAACGAT 2 52 0 

CCTGAAAAGG AAAAAGAAAT T T T T AAT AAC TTATTAAGTA AACAGGTTGA TGGTATTATT 2 58 0 

TTCCTTGGTG GTACAATTAC TGAAGAAATG AAAGAATTGA TAAATCAATC ATCTGTACCT 2 64 0 
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GTAGTAGTAT CAGGAACAAA TGGTAAGGAT GCACATATAG CATCAGTTAA TATTGATTTT 27 00 

ACTGAAGCTG CGAAAGAAAT TACGGGAGAA TTAATTGAAA AAGGCGCTAA ATCATTTGCT 27 60 

TTAGTAGGTG GAGAACATTC TAAAAAAGCT CAAGAAGATG TTTTAGAAGG TTTAACTGAA 2820 

GTGTTAAATA AAAATGGCCT TCAATTAGGT GATACATTGA ATTGTTCTG TGCTGAAAGT 2 88 0 

TATAAAGAAG GCGTAAAAGC TTTTGCCAAA ATGAAAGGCA ATTTGCCAGA TGCCATTTTA 2 94 0 

TGTATCAGCG ACGAAGAAGC AATTGGTATT ATGCATAGTG CAATGGATGC TGGTATTAAA 3000 

GTTCCAGAGG AATTACAAAT TATTAGTTTC AATAATACAC GAT TAG T T G A GATGGTTAGA 30 60 

CCACAACTTT CTAGTGTTAT TCAACCATTA TATGATATCG GTGCAGTAGG GATGCGCTTA 3120 

TTAACAAAAT AT AT G AAC G A TGAAAAGATA GAAGAACCAA ATGTAGTTTT ACCTCACAGA 3180 

ATTGAATACC GAGGAACTAC AAAATAAATT CACAAAATTA GGCATTCATC TAACGACCCA 32 4 0 

AATTATATGG GTGTTGGAAG AATGCCTTTT ATTTATCTTT TAAAATCGTT GCAGATTAGG 3300 

TTACTTATTG AC GAG TAG AT TCGTACCAAC TCGCTATATG TAAAGCTAAT TTTTTATTTT 33 60 

TTTCACTAAT TTCTTTTGTG CGGGGGACAT AGGTATAATC ATTTAAACGA TCTTCCCATC 3420 

TTTTAGGTAA TAAT TCAGAT GAATAATGTT TCCATTTATT AATCCATTCT AACGGSAAT 34 8 0 

AACCACTTTG AATTGGTTGA TCAATTAAAC T T AAGAAT AC ATGACTCCAT GCACGTGGTA 35 4 0 

CGACTCTCCA AAT AT T G TAG CCTCCGCCAC CAAACATAAT TACCTTTCCA TTCGTATAAG 3 600 

AATCAGCTAA ATATTTTACA AAATATGGAA T T T C AT AT AA TGAATGTAAC G T AC AAT T T A 3660 

GAT GAG T TAG TGGATCACGA TAATGTATAT CGACACCATT TACGCTTAGA ATAATATCAG 3720 

GTTTAAAACT CTTTACGACA GGCTCAACTG TTAATTTAAA AC AC T C C AAA AATGATGCAT 37 80 

CTTCTGTATA CGGTTCAAGT GGGACATTTA CAGTGTGTCC ATAGCCGATA TCTTCACCGC 38 4 0 

GCTCAGTATA GTGACCAGAG CCTGGGAAAA GAAATTTTC GGTTTCATGG ATAGAATAAG 3 900 

TAG T AAC AT G GTTATCGGCA TAGAAACTCC ATTGTGTACC ATCTCCATGA TGTGCATCGG 3960 

TATCTATGAT TAAAACGCGT TGATTGTATT CTTTAGCTAA GTATTGTGCG GTAATTGCAA 4 020 

TATCATTGTA TATACAAAAA CCACTTGCTC GACCAGGTTG AGCGTGATGC AAACCACCAC 4 080 

CTAAGTGACA ACCATTTAAT ACTTTGCCTG ACATAATAAG ATCTGCTAAA GTTAAAGCGC 414 0 

CTCCAACAAT TGTGGCACTA TGGCGGTGCA TATGCTTAAA TTGACCATTC TCTTCATCAT 4 2 00 

TTAATCCATA TTTCTTAGCC TCATCTTCAC TGATAATGCC ATGTGAAGCA TGCTTAATAG 4 2 60 

CTTCGACGTA ATCATATTTA TGAATTAACA TTAATTCGTC ATCTGTTGCA ATTCTAGGTT 4 32 0 

GTACTATTTG TTCTGGAGAC AATAAATTTG CAT T C AAAAG TAGCTCTGTT GTTAATTTTA 4 380 

AACGCATTTG ATTGAAGGGA TGTTGGTCAT GAAATCGATA TTGTAATAAC TTATCTGAAT 4 4 40 
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AAACATATGC AGTTTTTGAT GAATGTTGTT GCATATAATC CCTCGATAT TCCAAAAATT 4 500 

AAAAGAAAAA CCGATTCATA TAACGAATAT CATCAAACGC TTGTTGCTGT TCTAATGTAA 4 5 60 

TGTTTTTGCC AATTCTTGCC ATTAAACAAT TAGCTGGATG ACTTGTTATT TCTGGATCAT 4 620 

CTGTAGCGAA TATTTCAAGT CCACCAGTTG CCATTAACCG CTGCATTAAT TTTTTATAGT 4 680 

C AAAT AC AT C TAACTTTGAA TTTTTTAAAT CCCAATGCCA GTAATATTCT GTAGTTATAA 4 74 0 

CGATATAATT CTCGAATTCT GGTGTAGAAA GGCTAAGTTG TATCAGCTTT TCTGCAAGTT 4 800 

GTAATTGCCT AAAATTGATG CTGACTTCAA TTGCCCCCAA TTCGATTAAA TATGGAAGCT 4 8 60 

TACCTGTAGA CCAGCGCTCA ATTTCATAG GATAGTGGAA AGTGACATAA CCCACAATGA 4 920 

GTTGATCTTG GCGAGCAACA TAAATTCTAC CTTCTGGTAA TGTTGTAATT TCTAACAAAG 4 980 

CTTTATACTG ATCTTCAGCA TCTCTAAATG CGGTTAAATG CGCATCGAAA GTAAGCGCTT 5040 

TCAAATCTtC GTGTGTTAAA GGACCTTCAA TAACAAATTG CTTGTCATGA AGTAATAAT 5100 

CTTCGGATTG ATACGTCTTT AAAT GAT T C A TATTTTCAAC TCCTCAATCG ACGTCGTGAT 5160 

TGTATTAAAT TCATTATATA GAAaATTTAC AATAATTAAT AC T AGAAAAA GGATAAAAGT 5220 

AAAAATTTTG AAT AAT TAG A AATGTTATGT ATAATATTGA GAAAGAAAGC GTTTTCACAT 5280 

AACAAAGGGG GAGTTTCAAA TGAAAGTCGA AGTTTATAAA GGAGCGCAAG GTAAACATAA 5 34 0 

CCTTAAAGAT TAT G AAG AAA CAT AT AAT AC TTTTGATTGG AAAG AC G TAG AACAAGCATT 5400 

TTCTTGGAGT GAAACTGGAA AAAT GAAC AT G G C AT AT G AA TGCATAGATC GCCATGTAGA 5 4 60 

T CAAGGAT T A GGGGATAAAA TAGCGTTAAA TTA(AAAGAT GAGCACAGAA AAGAATCGTA 5 52 0 

TACT TAT AAA GATATGCAAC GGTTATCTAA TAAAGCAGCG AATGTTTTGT CTGAACATGC 5580 

AGAAGTTGAC AAAG G T G AC A GAGTATTTAT ATTTATGTCG CGTACACCTG AACTATATTT 5 64 0 

TGCGTTGTTA GGTGTTTTAA AAATTGGTGC AATTGTTGGG CCGTTATTTG AAGCATT TAT 5700 

GGAAAAGGCA GTTGCGGATA GAT T AG AG AA CAGTGAAGCT AAAGTGTTAA TTACTAATAA 57 60 

GGCATTGTTA CCTCGAGTAC CTGTAGATAA ATTACCAAAC TTGAAAAAAA TTGTTGTCGT 5 820 

AGATGAGGAT G TAG AAG AC A ATTACATAGA CTTCATTAGT TTGATGGAAA CTGCTAGCGA 5 8 80 

TGAATTTGAC ATTGAATGGT TAAAGTCGGA TGATGGTTTG ATTTTACATT ATACATCAGG 5 94 0 

TTCTACTGGG cAACCTAAAG GtGTATTGCA TGTTCAACAA GCAATGTTAG TGCACTATAT 6000 

TTCTGGAAAA TATGTATTAG ATTTACAAGA AGAT GATGTT TATTGGTGTA CAGCAGATCC 6060 

AGGTTGGGTT aCAGGAACAT CTTATGGTAT TTTTGCACCA EGTTAAATG GCGCTACAAA 6120 

TTGTATAGCT GGTGGTCGCT TTTCGCCAGA ACAGTGGTAT AGTATGATTG AAGATTTTAA 6180 

AGTGACGATT TGGTATACGG CACCAACAGC TTTAAGAATG T T AAT GAG T G CTGGTGACGA 62 4 0 

TATTGTTGAG AAATATGACT TGTCATCGTT ACGTTCGATT C T AT C AG TAG GTGAGCCTTT 6 3D 
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AAATCCTGAA GTTATAAAAT GGGCGAAAAA AGTATACGGT TTAACGGTGT TAG AT AC T T G 63 60 

GTGGATGACA GAAACAGGTG G AC AT AT GAT TGTTAACTAT CCAACGATGG ACGTcAAGCT 64 20 

TGGCTCAATG GGCAAACCAT TACCTGGTAT TCAAGCTGCA ATTATCGATG AT G C AG G G AA 64 8 0 

TGAATTACCA CCAAATCGAA TGGSCAACCT TGCTATAAAA AAAGGCTGGC CATCAATGAT 65 4 0 

GTATCGTATC TGGAAGAATC CAGAAAAATA TAAATCATAT TTTATTGGAG ACTGGTATGT 6600 

ATCTGGTGAT TCGGCATATA AAGATGAAGA TGGTTACTTC TGGTTCCAAG GACGTGTTGA 6660 

TGATGTAATT ATGACAGCTG GTGAACGAGT TGGACCATTT GAGGTTGACB CTAAATTGGT 6720 

TGAACACGAA GCAGTTGCCG AAGCAGGAAT TATTGGTAAA CCTGATCCGG TTCGCGGTGA 67 8 0 

AATAATTAAG GCGTTTGTTG C AC T GAG AAA AGGATATGAA CCAACAGACG AATTAAAAGA 68 4 0 

AGAAATTCGT ATATTTGTTA AAGAAGGTTT GTCGGCACAT GCAGCACCAC GTGAAATCGA 6900 

ATT T AAAGAT AAATTACCTA AAACACGGTC AGGTAAAATT ATGAGACGTG TAT T AAAAGC 6960 

TTGGGAATTA AAT T TAG AT G CTGGGGATTT AAGTACAATG GAATAATGAC ATGAATGTTA 7 020 

TTGAAGATTT TTTTCGAAGA ATAAAGGGTG ACAACATATT TCATGTCAAT GTTTAAATAA 7080 

TCGTTTACTT TACGATAAGC AATATAAAGA ACTGTTAACT TGTGTCATAT CATTTCGTAG 714 0 

AAAGCATTTG AAAAT GAT G A CATAACAATA ATGGCATATC TTTATATTGC TTTTTATTTT 7200 

TAATATGATC TTTGGAAGAT GATTATTTTA AATAATAGAA AAATATAGTT ATCAATAGTA 72 60 

TCAAGCGCTA AAAGTTGTAT AATACAAAAC TTTAATAAGT GAATTTATTG CAAAARGAA 7 32 0 

AGCGCTAACC CGATTTAGTC GACAAGTTTT TAACAGTTCG TTATTATATG AATGTAAGTA 7 38 0 

AAAATTTCTT AGCTACAACT TACATATTAT AAATGCATAA AT T AAAC AAA AAGGGGCGAA 74 4 0 

AAAAGTTGAC TCATTTATCA GATTTAGATA TTGCGAATCA AT C AAC AC T A CAACCAATTA 7500 

AGGATATTGC TGCATCAGTA GGTATTTCAG AGGATGCATT AGAACCTTAT GGTCATTACA 7 5 60 

AAGCTAAAAT CGACATTAAT AAAAT TACGC CAAGAGAAAA CAAAGGGAAA GTTGTTTTAG 7 620 

TAACTGCGAT GAGCCCAACA CCAGCTGGTG AAGGTAAATC AACGGTTACA GTTGGTTTAG 7 68 0 

CTGATGCATT CCATGAGTTA AATAAAAACG TTATGGTEC ATTAAGAGAG CCTGCTTTAG 77 4 0 

GACCAACATT TGGTATCAAA GGTGGTGCGA CTGGTGGTGG TTATGCGCAA GTCTTACCTA 7 800 

TGGAAGATAT CAACTTACAT T T C AAC G G AG ATTTCCATGC GATTACAACT GCAAATAATG 7 8 60 

CATTGTCTGC GTTTATCGAT AATCATATTC AC CAAGGT AA CGAATTAGGA ATCGAT C AAA 7 92 0 

GACGTATTGA GTGGAAACGT GTATTAGATA TGAATGATCG TGCACTTAGA CAT G T AAAC G 7 98 0 

TTGGGTTAGG TGGACCTACA AATGGTGTAC CACGTGAAGA TGGCTTTAAT ATTACAGTAG 8 04 0 

CGTCTGAAAT TATGGCGATT TTATGTTTAA GTAGAAGTAT TAAAGACTTA AAAGAT AAAA 8100 
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TTAGTCGTAT TACTATTGGT TACACTAGAG ATCGCAAGCC AGTTACAGTT GCAGATTTAA 8160 

AAGTGGAAGG TGCACTTGCA ATGATTTTAA AAGATGCAAT AAAACCAAAC T TAG T AC AAT 8 22 0 

CAATTGAAGG GACACCTGCA TTAGTTCATG GTGGACCATT TGCGAATATC GCACACGGTT 828 0 

GTAACTCAAT TTTAGCAACT GAAACAGCAC GTGATTTAGC TGATfflCGTT GTAACGGAAG 8 34 0 

CTGGATTTGG TTCAGACTTA GGCGCTGAAA AATTCATGGA CATTAAAGCG CGTGAAGCAG 8400 

GATTTGATCC GGCAGCTGTC GTTGTTGTTG CGACAATTCG TGCGTTAAAA ATGCATGGTG 8 4 60 

GTGTAGCGAA AGATAATTTA AAAGAAGAAA ATGTAGAAGC AGTAAAAGCA GGAATTGTTA 8520 

ATTTAGAGCG TCATGTTAAT AATATTAAAA AATTCGGTGT AGAACCGGTT GTTGCAATTA 8 58 0 

ATGCATTTAT ACATGATACC GATGCAGAAG TAGAATATGT AAAATCTTGG GCTAAAGAAA 8 64 0 

ATAACGTACG AATTGCCTTA ACTGAAGTTT GGGAAAAAGG TGGTAAAGGT GGCGTTGACT 8700 

TAGCAAATGA AGTATTAGAA GTCATTGYTC AACCTAATTC ATTTAAACCT TTATATGAAT 87 60 

TAGAATTACC ATTAGAGCAA AAGATTGAAA AGATTGTGAC TGAAATCTAT GGCGGTTCAA 8 820 

AAGTAACGTT TAGCAGTAAA GCGCAAAAAC AAT T AAAACA ATTTAAAGAA AATGGTTGGG 8 880 

ATAATTACCC AGTATGTATG GCGAAAACAC AATATTCATT CTCAGATGAT CAACGTTGT 8 94 0 

TAGGTGCACC ATCAGGATTT GAAATTACAA TTCGTGAATT AG AAG C G AAA ACAGGTGCAG 9000 

GATTTATCGT AGCGTTGACA GGTGCAATCA TGACTATGCC TGGTTTACCT AAAAAAC CAG 90 60 

C AGC AT T AAA CATGGATGTT ACT GAT GAT G GTCATGCAAT TGGGTTATTC TAATAAATCA 912 0 

TGTCAATTGT TTAATAAAGA TAAGTAAATA GTTTAATAGA CCGGACTGTT GGAGATGCAT 9180 

TATTTCAGCA GTTCGGTTTT TTGCTGTGCT AAAAATAGAT TCAATTTGGC GAATCTAACG 924 0 

ACAATGTTTG AAGGTGGTTA ATTAATGTAT AT G AAG AT AA AAAGTGGGCT TGAAGAATAG 9300 

GAAAGCGATG CAATGAATAT TCCATATTAA AAAffiATTAA TAAAATAGGT TGCAATATTT 9360 

AATTGGGATG CGCTACAATT AACACTAATA ATT GAT AT TG ATAATTATTA TCAATTAAAT 94 20 

ATAATCTTAT AGGAGTTGTT AACAACATGA AC AAAC AT C A CCCAAAATTA AGGTCTTTCT 94 8 0 

ATTCTATTAG AAAATCAACT CTAGGCGTTG CATCGGTCAT TGTCAGTACA CTATTTTTAA 954 0 

TTACTTCTCA ACATCAAGCA CAAGCAGCAG AAAATACAAA TACTTCAGAT AAAATCTCGG 9600 

AAAATCAAAA TAATAATGCA ACTACAACTC AGCCACCTAA GGATACAAAT CAAACACAAC 9660 

CTGctACGCA ACCAGCAAAC ACTGCGAAAA ACTATCCTGC AGCGGATGAA TCACTTAAAG 9720 

ATGCAATTAA AGATCCTGCA T T AG AAAAT A AAG AAC AT G A TATAGGTCCA AGAGAACAAG 97 8 0 

TCAATTTCCA GTTATTAGAT AAAAACAATG AAACGCAGTA CTATCACTTT TTCAGCATCA 98 4 0 

AAGATCCAGC AGATGTGTAT TACACTAAAA AGAAAGCAGA AGTTGAATTA G AC AT C AAT A 9900 

CTGCTTCAAC ATGGAAGAAG TTTGAAGTCT ATGAAAACAA CAAAAATTG CCAGTGAGAC 9 9 60 
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TTGTATCATA TAGTCCTGTA CCAGAAGACC ATGCCTATAT TCGATTCCCA GTTTCAGATG 10020 

GCACACAAGA ATTGAAAATT GTTTCTTCGA CTCAAATTGA TGATGGAGAA GAAACAAATT 10080 

ATGATTATAC TAAATTAGTA TTTGCTAAAC CTATTTATAA CGATCCTTCA CTTGTAAAAT 10103 

CAGATACAAA TGATGCAGTA GTAACGAATG ATCAATCAAG TTCAGTCGCA AG T AAT C AAA 10200 

CAAACACGAA TACATCTAAT CAAAATATAT CAACGATCAA CAATGCTAAT AATCAACCGC 102 60 

AGGCAACGAC CAATATGAGT CAACCTGCAC AACCAAAATC GTCAACGAAT GCAGATCAAG 10320 

CGTCAAGCCA ACCAGCTCAT GAMCAAATT CTAATGGTAA TACTAACGAT AAAACGAATG 10380 

AGTCAAGTAA TCAGTCGGAT GTTAATCAAC AG TATCCACC AGCAGATGAA TCACTACAAG 10440 

ATGCAATTAA AAACCCGGCT ATCATCGATA Aa GAACAT AC AGCTGATAAT TGGCGAC CAA 10500 

TTGATTTTCA AATGAAAAAT GATAAAGGTG AAAG AC AG T T CTATCATTK GCTAGTACTG 105 60 

TTGAACCAGC AACTGTCATT TTTACAAAAA CAGGACCAAT AAT T G AAT T A GGTTTAAAGA 10 620 

CAGCTTCAAC ATGGAAGAAA TTTGAAGTTT ATGAAGGTGA CAAAAAGTTA CCAGTCGAAT 10680 

TAGTATCATA TGATTCTGAT AAAGAT T AT G CCTATATTCG TTTCCCAGTA TCTAATGGTA 107 4 0 

CGAGAGAAGT TAAAATTGTG TCATCTATTG AATATGGTGA GAACAT C CAT GAAGACTATG 10800 

ATTATACGCT AATGGTCTTT GCACAGCCTA TTACTAATAA CCCAGACGAC TATGTGGATG 108 60 

AAGAAACATA C AAT T T AC AA AAATTATTAG CTCCGTATCA CAAAGC TAAA ACGTTAGAAA 10 920 

GACAAGTTTA TGAATTAGAA AAATTACAAG AGAAATTGCC AGAAAAATAT AAGGCGGAAT 10 98 0 

ATAAAAAGAA AT T AGAT CAA ACTAGAGTAG AGTTAGCTGA TCAAGTTAAA TCAGCAGTGA 11040 

CGGAATTTGA AAATGTtACA CCTACAAATG AT C AAT T AAC AGAT T T AC AA GAAGCGCATT 11100 

TTGTTGTTTT T G AAAG T G AA GAAAATAGTG AGTCAGTTAT GGACGGCTTT GTTGA&ATC 11160 

CATTCTATAC AGCAACTTTA AATGGTCAAA AATATGTAGT GATGAAAACA AAGG AT GAC A 11220 

GTTACTGGAA AGATTTAATT GTAGAAGGTA AACGTGTCAC TACTGTTTCT AAAGAT C C T A 11280 

AAAATAATTC TAGAACGCTG ATTTTCCCAT AT AT AC C T G A CAAAGCAGTT T AC AAT G C G A 11340 

TTGTTAAAGT CGTTGTGGCA AACATTGGTT ATGAAGGTCA ATATCATGTC AG AAT T AT AA 114 00 

ATCAGGATAT CAATACAAAA GATGATGATA CAT C AC AAAA T AAC AC GAG T GAACCGCTAA 114 60 

AT G T AC AAAC AGGACAAGAA GGTAAGGTTG CTGATACAGA TGTAGCTGAA AATAGCAGCA 11520 

CTGCAACAAA TCCT AAAGAT GCGTCTGATA AAGCAGAET GATAGAACCA GAGTCTGACG 11580 

TGGTTAAAGA TGCTGATAAT AAT AT TG ATA AAGATGTGCA ACATGATGTT GATCATTTAT 11640 

CCGATATGTC GGATAATAAT CACTTCGATA AATATGATTT AAAAGAAATG GATACTCAAA 117 00 

TTGCCAAAGA TACTGATAGA AATGTGGATA AAGATGCCGA TAATAGCGTT GGTATGTCAT 11760 
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CTAATGTCGA TACTGATAAA GACTCTAATA AAAATAAAGA CAAAGTCATA CAGCTGAATC 11820 

ATATTGCCGA TAAAAATAAT CATACTGGAA AAGCAGCAAA GCTTGACGTA G T G AAAC AAA 11880 

ATTATAATAA TACAGACAAA G T T AC T G AC A AAAAAACAAC TGAACATCTG CCGAGTGATA 1194 0 

TTCATAAAAC TGTAGATAAA ACAGTGAAAA CAAAAGAAAA AGCCGGCACA CCATCGAAAG 12000 

AAAACAAACT TAGtCmATCT AAAATGCTAc CAAAAACTGG AGAAACAACT TCAAGCCAAT 12060 

CATGGTGGGG CTTATATGCG TTATTAGGTA TGTTAGCTTT ATTCATTCCT AAATTCAGAA 12120 

AAGAATCTAA ATAATTAnCT AAATATAGCA TATGTATGAT TAACTTGTA GAC 12173 
(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

CAACATTAAT ATTGATATTA AATCTTCCTG GATAACGTGC TTGTTCGAGT GATAAGTATG 60 

CACGCACTTG ACTTAACTCT TTATCTAAAG TAATCGTATG TTGCTTAGAG CCTTGTAAAT 120 

TCGCTCTGAA AAAATAACTC AATTCTAATA GTAACTCTCG TGCCTTTTCG CTATTTATTC 810 

TAACTAAAGC TGAGATCGTG TTAATTGAAT TGAAGAAAAA ATGTGGACTC ACTTGTGCCT 24 0 

GTAATGACTT AATCTCAGCA TCTTTCAATA ACTTACTTTG CGTTTCGGCT TCACCAAGTT 300 

CAATTTGGCT ACTAAAAATA TTTGCCAATC CTTCTGCAAG TTGACGTTCC ACAAAAGTTA 3 60 

AATCATTAGG GTTTGTAAAA TATATCTTCA ATGTACCGAC GATAGAACCA TGCATCTCAA 420 

GTGGTATCAC GATAGCTGCT CTAAGCGGGC AATTCGGATG ACTACAACCA ATCTCTTCTT 4 80 

TAGTATGAAC TTCTTTCAAC TTTCCTGATT TCAATACATC TTTAGACAGA CTTGTTAATA 54 0 

TTTCATTTGT TGGTATGTGA T GAT C AC T AC CTGCACCTAC ATGCGATAG ATTTCATTTT 600 

TGCTTGTAAT TGCTACGGCA GATACTTTCA TTAAATTTTT AATAATCATC GCAATTTGCT 660 

GTGCCGATTC TCTATTCAAT CCTTCTTTAA AATACGGCAA TGTCTGGTTC ATCAATTGCA 720 

GTACATCATG TGTTTGAACA GCCTTCATTT GCTCCTCTTG CTTTAATGTT GAAATGATAA 7 80 

TAGACATAAA AATCGCCGTA CCAACGCTAT TAACAATAAT CATTGGTAGT GCAATTAATG 840 

ATATGAGGTC AACCGCATAT GCTTTGTCGT GGGAAAATGT TAAAATGCTC AACATTTGAA 900 

TCATTTCCAT AACAATTCCA ATCATGGCAC TTTTCGCAAT ACTCGGGTAA CGCTTGCGTC 960 

TTTGAGCTTG TAAGCCAAAA TAACCAGCAA TTATACCAAT AAATATAGAT GAGATAAGAT 1020 

AAACTTGTGC ATCCGCCCCA CCCATATACA CTCTGAAAAT ACCTGAAATA ACGCCAACAA 1080 
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ATAGACCTAC AAAAGGGCCA CCAACTAATC CTGCGACACC TATCGTTAAT 



ACACGTGTGT 



1140 



TAGCTAAAGA TACATCATCA TCTAAACGGA AG T AC AC AC T TCCTGACAAA 



CTATGTGAT 



1200 



GATCGATGAC GATACCAGTT AAATTAGACA TTAAGGCAAA CAAACTGAAA 



ATAATACATA 



1260 



ATTGCCAACG TGCTTTCCAT GTACGTCGAC GATTCmTTAC GTTTTTACAA 



TATGGC 



1316 



(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

TATAAATATT ATTTTATTAT CGTTTATCGC TATTTGTGTT ACATAATCAA AACCATAAAT 60 

TCTTACTCAT TCAGATTTAC CCAATATTTT TACTTTTATA ATGTAATGCG TTTTATCCAA 12 0 

GTTATTTTTT AAAAATAAAT ATTGAATTnG GGGCTGnTTT CATGTCATTA AG AG AT G AAG 180 

CATTGGAAAT GCACAAACGT AATCAAGGTA AAT TAG AAG T TAAACCAAAT GTAAAAGTTA 24 0 

CTAATAAAGA GGAATTAAGT TTAGCATACT CiCCTGGCGT TGCTGAACCG TGTAAAGATA 300 

T T TAT G AAG A TAAAAGAAAA GTATATGATT ACACAATTAA AGGAAATACA GTTGCAGTTA 3 60 

TTACTGATGG AACAGCGGTA TTAGGTTTAG GTAACATTGG AC C T G AAG C A AGTATTCCTG 4 20 

TAATGGAAGG T AAAGC AG T A TTATTCAAAA GCTTCGCTGG TATCAATGGG GTGCCTATIG 4 80 

CGTTAAATAC AACTGATACC GAAGAAaTCA TTAAAACAGT TAAGTTGTTA GAACCTAATT 54 0 

ATGGTGGTAT TAATTTAGAG GATATTTCGG CACCACGTtG TTTTGAAATT GAAGAACGAT 600 

TGAAAAAAGA AACTAATATT CCGGTATTCC AT G AC GAT C A AC AT GGT ACA GCAATTGTAA 660 

CATTGGCAGG TTTGGTAAAT GCATTGAGAG TTGTTAACAA AGATATTGCT AAAATAAAAG 720 

TTGTACTAAA TGGTGCTGGT GCAGCAGGAA TAGCCATTGT TAAATTACTA TACGCGTATG 780 

G T G T AAG AAA TATGGTTATG TGTGACTCAA GAGGCGCAAT TTTTGAAGGA CGTTCATATG 84 0 

GTATGAATCC TACGAAAGAT GTTGTAGCAA AATGGACAAATAAAGATAAG ATTGAAGGGT 900 

CTTTAGAAGA AGTCGTAAAA GACGCAGATG TATTTATCGG GGTTTCTGTA GCTAATGCGC 9 60 

T G T C AC AAG A TATGGTTAAG AGTATGGCAG ATAATCCAAT TATATTTGCA ATGGCTAATC 1020 

CAAATCCTGA AATAATACCT GATGATGCCA AAGCGGCAGG TGCACGAGTT GTTGGTACAG O8 0 

GACGTTCAGA CTATCCTAAC CAAATTAATA ATGTATTAGC TTTCCCTGGT ATTTTTAGAG 114 0 

G T GC AT TAG A GGTTGAAGCT ACACATATAA ATGAAGAAAT GAAAAAGGCA GCTGTAGAAG 12 00 
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CGATTGCTGA TTTAATCGAT AGTTCTGAAT TAAATGAAGA CTACTGTATC CCAGGACCGT 12 60 

TTGATAAACG TGTAGCGCCA TCAGTTGCTC GTAATGTTGC TAAAGCGGCA ATGGAATCTG 1320 

GAGTAGCTAG GATTGAAGTT GATCCGCAAG ATGTGTATGA TAAAACAATG AAACTTACAG 138 0 

ATTTACAATA AAAATAATTA ACGTTTAATT AAATTGCTTA AAAGTTAGCA ATAAAGAGTC 144 0 

ATTAAAATGA TGAAAATGAA AGTTTATGAT AAACATTCAA GAGTCAACG AATATAAATC 1500 

AAATAAATTT AAACCCGTTT TTAACTGGTC AAGTTCAGTT TAAGGCTCTA AATGGTTAGA 1560 

ACAGAGGTTA TTTGGAGGTT TTCCTATGTT TAAAGATTTT TTTAATCGAA CAAAGAAAAA 162 0 

GAAATATCTT ACAGTACAAG ACTCTAAAAA TAATGATGTG CCTGCAGGTA TTATGACTAA 168 0 

GTGTCCAAAG TGTAAGAAAA TTATGTACAC AAAAGAATTA GCTGAAAATT TAAATGTGTG 174 0 

CTTTAATTGT GATCATCATA TTGCTTTAAC TGCGTATAAA CGTATAGAAG CAATTTCTGA 1800 

TGAAGGATCA TTTACAGAAT TCGATAAGGG AATGACCTCT GCGAATCCAT TAGATTTTCC 18 60 

aAGTTATTTA GAAAAAATTG AAAAGGACGi ACAAAAGACA GGTCTTAAAG AAGCAGTTGT 1920 

GACTGGTACA GCACAACTAG ATGGTATGAA ATTTGGCGTT GCTGTCATGG ATTCACGTTT 198 0 

TAGAATGGGA AGTATGGGAT CGGTTATCGG T GAAAAG AT A TGTCGCATCA TTGATTACTG 2 04 0 

CACTGAGAAC CGTTTACCAT TTATTCTTTT CTCTGCAAGT GGTGGTGCAC GTAOCAAGA 2100 

AGGTATTATT TCCTTGATGC AAATGGGTAA AACCAGTGTA TCTTTAAAAC GTCATTCTGA 2160 

CGCTGGACTA T TAT AT AT AT CATATTTAAC ACATCCAACT ACTGGTGGTG TATCTGCAAG 2220 

TTTTGCATCA GTTGGTGATA TAAATTTAAG TGAGCCAAAA GCGTTGATAG GTTTTGCAGG 2280 

TCGTCGAGTT ATTGAACAGA CAATAAACGA AAAATTGCCA GATGATTTCC AAACTGCAGA 2 34 0 

ATTTTTATTA GAGCATGGAC AATTGGATAA AGTTGTACAT CGTAATGATA TGCGTCAAAC 2 4 00 

ATTGTCTGAA ATTCTAAAAA TCCATCAAGA GGTGACTAAA TAATGTTAGA TTTTGAAAAA 2 4 60 

CCACTTTTTG AAATTCGAAA TAAAATTGAA TCTTTAAAG AATCTCAAGA T AAAAAT GAT 2520 

GTGGATTTAC AAGAAGAAAT TGACATGCTT GAAGCGTcAT TGGAACGAGA AACTAAAAAA 2 58 0 

ATATATACAA ATCTAAAACC ATGGGATCGT GTGCAAATTG CGCGTTTGCA AGAAAGACCT 2 64 0 

ACGACCC T AG ATTaTATTCC ATATATCTTT GATTCGTTTA TGGAACTACA TGGTGATCGT 2 7 00 

AATTTTAGAG ATGATCCAGC AATGATTGGT GGTATTGGCT TTTTAAATGG TCGTGCTGTT 2 7 60 

ACAGTTaTTG GACAACAACG TGGAAAAGAT ACAAAAGATA ATATTTATCG AAATTTTGGT 2 820 

ATGGCGCATC CAGAAGGTTA TCGAAAAGCA TTACGTTTAA TGAAACAAGC TGAAAAATTC 2 8 80 

AATCGTCCTA TCTTTACATT TATAGATACA AAAGGTGCAT ATCCTGGTAA AGCTGCTGAA 2 94 0 

GAACGTGGAC AAAGTGAATC TATCGCAACA AATTTGATTG AGATGGCTTC ATTAAAAGTA 3000 

CCAGTTATTG CGATTGTCAT TGGTGAAGGT GGCAGTGGAG GTGCTCTAGG TATTGGTATT 3060 
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GCCAATAAAG TATTGATGTT AGAGAATAGT ACTTACTCTG TTffiATCTCC TGAAGGTGCA 312 0 

GCGGCATTAT TATGGAAAGA CAGTAATTTG GCTAAAATTG CAGCTGAAAC AATGAAAATT 318 0 

ACTGCCCATG ATATTAAGCA ATTAGGTATT ATAGATGATG TCATTTCTGA ACCACTTGGC 32 4 0 

GGTGCACATA AAG AT AT T G A ACAGCAAGCT TTAGCTATTA AATCAGCGTT TGTTGCACAG 3300 

TTAGATTCAC TTGAGTCATT ATCACGTGAT GAAATTGCTA ATGATCGCTT TGAAAAATTC 3360 

AG AAAT AT C G GTTCTTATAT AGAATAATCA ACTTGAGCAT TTTTATGTTA AATCGATACT 34 2 0 

GGGTTTTACC AT AAAT T G AA GTACATTAAA ACAATAATTT AAT AT T TAG A TACTGAATTT 34 80 

TAACTAAGAT TAGTAGTCAA AATTGTGGCT ACTAATCTTT TTTTAATTAA GTTAAAATAA 354 0 

AATTCAATAT TTAAAACGTT TACATCAATT CAATACATTA GTTTTGATGG AATGACATAT 3 60 0 

CAATTTGTGG T AAT T TAG AG TTAAAGATAA ATCAGTTATA GAAAGGTATG TCGTCATGAA 3 6 60 

GAAAATTGCA GTTTTAACTA GTGGTGGAGA TTCACCTGGA ATGAATGCTGCCGTAAGAGC 3720 

AGTTGTTCGT ACAGCAATTT AC AAT G AAAT TGAAGTTTAT GGTGTGTATC ATGGTTACCA 37 8 0 

AGGATTGTTA AATGATGATA TTCATAAACT T G AAT TAG G A TCAGTTGGGG ATACGATTCA 384 0 

GCGTGGAGGT ACATTCTTGT ATTCAGCAAG ATGTCCAGAG TTTAAGGAGC AAGAAGTACG 3 900 

TAAAGTTGCA ATCGAAAACT TACGTAAAAG AGGGATTGAG GGCCTTGTAG TTATTGGTGG 3960 

TGACGGTAGT TATCGCGGTG CACAACGCAT CAGTGAGGAA T G T AAAGAAA TTCAAACTAT 4 02 0 

CGGTATTCCT GGTACGATTG ACAATGATAT CAATGGTACT GATTTTACAA TTGGATTTGA 4 08 0 

CACAGCATTA AAT ACGAT T A TTGGCTTAGT CCACAAAATT AGAGATACTG CGTCAAGTCA 414 0 

CGCACGAACA TTTATCATTG AAGCAATGGG CCGTGATTGT GGAGATCTAG CATTATGGGC 4200 

TGGATTATCA GTTGGTGCTG AG AC AAT T G T AGTTCCAGAA GTGAAAACAG ATATTAAAGA 4 2 60 

AATAGCTGAT AAAATTGAAC AAGGT AT TAA ACGTGGTAAG AAACACTCAA TCGTTCTGT 4 32 0 

AGCAGAAGGT TGTATGACTG CGCAAGATTG TCAAAAAGAA T T AT C AC AAT ACATCAATGT 4 38 0 

T GAT AAT AG A GTGTCTGTGT TAGGTCACGT TCAACGTGGT GGTAGCCCAA CAGGTGCGGA 4440 

TAG AG T T T T A GCATCACGTT TAGGTGGATA TGCGGTAGAC TTATTAATGC aAGGTGAAAC 4 500 

AGCTAAGGGT GTTGGAATTA AGAACAATAA AATTGTAGCA ACATCTTTTG ATGAAATTTT 4 5 60 

TGATGGTAAA GAT CAT AAAT TTGATTATAG TCTATATGAA CTTGCTAACA AGTTATCTAT 4 62 0 

ATAAGATTTC AGGAGGAATT ATAAAATGAG AAAAACTAAA ATTGTATGTA CAATTGGACC 4 68 0 

AG C T T C AG AA TCAGAAGAAA TGAT TGAGAA ATTAATCAAT GCTGGTATGA ACGTTGCACG 4740 

ATTAAACTTT TCACATGGTA GTCATGAAGA GCATAAAGGT AG AAT T GAT A CAATTCGTAA 4 800 

AGTAGCTAAA AGATTAGACA AAATTGtAGC AATTTTATTA GATACAAAAG GTCCAGAAAT 4 8 60 
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TCGTACGCAT AATATGAAAG ACGGTATCAT TGAACTTGAA CGTGGCAACG AAGTTATTGT 32 0 

TAGCATGAAT GAAGTTGAAG GAACACCTGA AAAGTTCTCA GTAACATATG aAAACTTAAT 4 980 

TAACGATGTT CAAGTAGGTT CATACATTTT ACTTGATGAT GGCTTAATTG AATTACAAGT 504 0 

T AAAG AT AT T GACCATGCTA AAAAAGAAGT TAAATGTGAT ATTTTAAACT CTGGTGAGCT 5100 

TAAAAACAAA AAAGGTGTTA ATTTACCTGG CGTAAGAGTA AGTTTACCTG GTATTACAGA 5160 

AAAAGATGCT GAAG AT AT C C GTTTCGGTAT TAAAGAAAAT GTTGACTTCA TTGCAGCAAG 5220 

TTTCGTACGT CGTCCTAGTG ATGTTTTAGA AATTCGTGAA ATTTTAGAAG AACAAAAAGC 5280 

TAACATTTCA GTATTCCCTA AAATTGAAAA CCAAGAAGGT ATTGATATA TTGCGGAAAT 534 0 

TCTTGAAGTG TCTGATGGTT TAATGGTTGC ACGTGGTGAC ATGGGTGTTG AAATTCCACC 54 00 

TGAAAAAGTA CCAATGGTTC AAAAAG AT T T AATCAGACAA TGTAACAAAT TAGGTAAACC 54 60 

AGTTATTACA GCTACACAAA TGTTAGATTC TATGCAACGT AACCCACGTG CTACACGTGC 5520 

AGAAGCTAGT GACGTTGCCA ACGCAATCTA TGATGGTACA GATGCAGTAA TGTTATCTGG 558 0 

TGAAACTGCT GCTGGTTTAT ATCCTGAAGA AGCTGTTAAA ACAATGAGAA ATATTGCTGT 564 0 

ATCAGCTGAA GCAGCCCAAG AT T AC AAAAA GTTATTGTCA GATCGTACTA AATTAGTTGA 5700 

AACTTCATTA GTGAATGCTA TCGGTATTX GGTTGCACAT ACAGCTTTAA ACTTAAATGT 57 60 

TAAAGCAATT GTAGCTGCTA CTGAAAGTGG TTCAACGGCA CGTACTATCT CCAAATATCG 5820 

TCCACATTCA GACATTATTG CGGTGACTCC AAGTGAAGAA ACTGCACGTC AATGTTCAAT 58 8 0 

TGTTTGGGGA GTTCAACCTG TAGTTAAAAA AGGACGTAAG AGTACAGATG CATOTTAAA 5 94 0 

CAATGCAGTT GCAACAGCTG TTGAAACTGG TAGAGTATCT AATGGTGATT TAATCATTAT 6000 

TACTGCTGGT GTACCAACTG GTGAAACTGG AACTACTAAT AT GAT G AAAA TCCACCTAGT 6060 

TGGTGACGAA ATTGCTAATG GTCAAGGTAT TGGACGTGGA TCAGTTGTTG GTACTACGTT 612 0 

AGTTGCTGAA ACTGTTAAAG ATTTAGAAGG TAAAGATTTA T C T G AC AAAG TTATCGTTAC 618 0 

TAACTCAATC GATGAAACGT TTGTACCTTA TGTAGAAAAA GCTTTAGGCT TAATTACAGA 62 4 0 

AGAAAATGGT ATTACATCAC CAAGTGCAAT TGTTGGTTTA GAAAAAGGTA TTCCAACAGT 6300 

TGTAGGTGTA GAAAAAGCTG T T AAAAAC AT AAGCASAAC ATGTTAGTTA CGATTGATGC 6360 

TGCTCAAGGT AAAATCTTTG AAGGATATGC AAACGTACTA TAATTTATAA AAAAACGTCT 64 20 

TTCCATTTAT CAACAATGGA AAGGCGTTTT TTGGTTcATC TGGTATTTTA TGACGTAATT 64 80 

AATAGGTTAT TTGATAATGA TAGTGTATGA ATGGCAATCT ATATAAATGT TTATATCTTT 65 4 0 

TATACATGTA CAT TAT C AC C TTCAAACTTT CACTCATATT ACTTTGGAAA TTTATTATAA 6600 

AATAGAAGTA TGGATGTATT TCTGAAATGA TACATTATTA AATAGATGAG AAAGTAAAAG 6660 

TTTTGAGCCA AGTACGCAAT TTAATATTAT AAGTTGCATA TAAAACAGGA TGGGACATAA 6720 
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ATCCCTAAAA AAACAGCA3T AAGATAATTT TCAATTAGAA AATATCTTAC TGCTGTTCTC 67 8 0 

TATTTATACA ATACTTCGTA TTGAATGGTT CGCTTTCCTA GGGTGCCGTC TCAGCCTCGG 68 4 0 

tCTTCGACTG GCACTGCTCC CTCAGGAGTC TCGCCATTAA TACTACGTAT TAACATGTAA 6900 

TTTTACTTTT A CAT ACT TT A AAAAAATAAA ACACTTTGCC CAETTGCAC ATAAATGTAA 6960 

AATTCAATAA AATGAATTTT CTGTGTTGGG TCCCTTCGTA TAATTTAATA AATACCACTA 7 02 0 

AACTAAATTA ACGAGGTGCC TTATGTATAA AATTTATAAC ATGACCCAAC TTACACTACC 7 08 0 

AATAGAAACC TCTGTTAGAA TTCCTCAAAA TGATATTTCG CGATATGTTA ATGAAATTGT 714 0 

TGAAACGATA CCTGATAGCG AATTCGATGA ATTCAGACAT CATCGTGGCG CAACATCCTA 7 20 0 

TCATCCAAAA ATGATGTTAA AAATCATCTT ATATGCATAT ACTCAATCTG TATTTTCTGG 7260 

TCGTAGAATA GArAAATTAC TTCATGACAG TATTCGAATG ATGTGGTTAG CTCAAGATCA 7 32 0 

AACACCTTCT TATAAAACTA TTAAKGTTT TAG AG T G AAT CCTAATACTG ATGCGTTAAT 7 38 0 

TGAATCTTTA TTTATTCAGT TCCATAGTCA ATGTTTAAAG CAAAATCTTA TTGATAATAA 744 0 

TTCAATTTTT ATTGATGGTA CAAAAGTAGA AGCTAATGCC AATAGATATA CATTTGTGTG 7 500 

GAAGAAAAGT ATTCAAAATC ACGAATCGAA ATTGAACGAA AATTCAAAAACATTATATCG 7 5 60 

TGACTTAGTT GAAGAAAAAA T AAT AC C AG A GATAAAAGAA GATGGAGATA GCGATTTAAC 7 62 0 

AATAGAAGAA ATAGATTTAA TTGGTAGTCA T T TAG AT AAA GAAATCGAAG ATTTAAATCA 7 68 0 

TTCTATTGAG AACGAAGATT GTGCTCAAAT TAGAAAACAG ACCCGTAAAA AAATAACTGA 774 0 

GATTAAGAAG TTCAAAAAGA AATTTGATGA TTATTCCGAA AGAAAAAATA AAT AT G AAG A 7800 

ACAAAAATCG ATTCTTAAAG ATAGAAATAG TTTTTCTAAA ACTGATCTGA TCATGATGCA 7 8 60 

ACTTTTATGA GAATGAAGGA AGACCATATG AAAAATGGCC AACTTAAGCC AGGATACAAT 7 92 0 

TTACAAATAG CGACAAATTC TCAAAAATGT TTATCCTAT GACCTATTTC AA 7 972 
(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

ATTTTTTAGT TAATTGTCTT TCTTAAAATA ATTTTAGCTT TCATTAAATT AAACAATTTT 60 

ACAAGCTTGG AACACCAATC AAAATCCTAA GTTCTAAAAT GCAATAT TAG TAGTCGTTGA 120 

CTGAATGAAC ATATGCTTAT AATATTTTTT TGCAATGCTA GTCAAGTTGA TTATGCTCA 180 
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CAAGGATATG CGATTTATAT TTTCTTACAA CAATGAAAAT GCCTGATACA ATGCGATCCT 24 0 

TTATACTTTC AAG AG T T AAA TTAGAAATGT AT AT C AAGAC ACAAGTTGTA TCAGGCAAAT 300 

AATTAAATTA TTGTTGTTTT AACCATTGTA CTATCGTCGG AATCATAAAT CCTGTTGGCC 3 60 

CTTTTGGACC ATTATATGAA GCTTTATTAG TCGTTGCTGG ACCTGCAATA TCGAAATGAA 4 20 

TGTGAGGTGT TTGACCACTA AAATGTGTTA CAAAACTTGC CGCAAATAGC GCTTTACCTT 4 80 

GTCCATTCGT ATGGTTAACT AAATCAGCGA TATCACTGTG TTTAATACTT GCACGTTCGG 54 0 

TTGCAGTAAT CGGTAATTCA AATACCATTT CATC&CTTC AGAACTTATT TGTAATATAT 600 

CGTTTAATAT CACTTTACTA TTCGATTCAA ATGCAGCAGC TTTATCATCG CCTAGTGCAa 660 

CAATTGCTGC ACCCGTTAAT GTAGCAAAGT CCATAATCAC ACTAGGCTGA TATTGATTTG 720 

CATAAAACAC AGCATCTGCA AGGACTAATC TACCTTCAGC GTCTGTATTC ATTACTTCTA 78 0 

CAGTTTCACC ACTTAATGCT GTAAATACAT CATCTGGcTT CATTGATGCT TCATTTATCA 840 

TATTTTCAGC ACACGCAAGC ACTCCGACAA TATTTACAGG CAGTTGTAAA CGAcTAGCCG 900 

CTTCAATGAT ACCAACGACA TTCGCAGCGC CACACATGTC AAACTTCATT GTAGCCATGC 960 

CATTCTTCGT TTTAATACTA TAACCACCAG AATCATACGT TATACCTTTA CCAACTAAGG 1020 

CAATTGGTGC TTCATCTTTG TCTTTGCCAT TATATGTGAT GGTTACTAAt CTCGGTTTAT 1080 

GCTTACTACC TTTACCTACT GCTTGTAAAA GTCCGAATCC TTCAGAAACT AAAGTGTCAT 114 0 

AATCTTTAAC ATCTACTTTG ACCTTTGTAT TTTTAAAATG ATAACAATA TCTTCTGcAA 1200 

ATGTTTGTGG TGtTAATACA TTCGGTGGcA TATTACTAAA GTCTCyTGcC AAATTAATGG 12 60 

aTTGGcCAAT ACTGATACCC TCATGAATAA AATCTAATTC AATTAATGAT TCACTAATAA 1320 

GGkTTAAATT CGTCTTAAAC GGtGCCTTCT kACTTgrTTT ATAATGATCG aATyCATATG 1380 

kTGcACGCTC ACTTTGGAAT ACCGCATGCC ATTAATACAT CTGgATAACT GgATCcATAT 14 4 0 

TTTGgAnATA AATGaATCCa TAAGTAAATA CGTATCTTCa ATGTGTTCTG ACTTTATGTA 1500 

TTGGAAAAGA TGTCCCCaTA TTTTCAACAT ATCTTGATAA tACGTGTCTT TAAGTTTCCT 15 60 

AAACCAACTG TAATTAAACG ATAT3TTTGA TCTTGTACAT CAAATGCAGT TGTATAAATC 162 0 

TTCCCAACTT TACTACCAAT AATATGTTGA TGTTTTAGTC TTTCAAGTGA TTCTGTAATA 168 0 

TCGATATGAT TAAAACTAAT GCGCTCTAAC TGATTTAAAT GTTCTGGTAT ACCAATAATC 17 4 0 

AATGTATTTA TTTCGTTGCT TAGTGTGTTA TTTAATTTAA AAT T C AT AATG T ACC T C C T T 1800 

AGATTTGATA TGTTACTCAC TTCAAATTGT ACAACAATAA AGCCCTcAGT G AC AC T G AAG 18 60 

GACTTTATAA ATGAATAATT TAATTTTATG TGTTTAGCTG AAACTTTATT TAAAGCCTAG 1920 

AATTTACCTT TTTTGAATGC TAAACCGATA CCACCGATTT TGAATACCGC ACGTGTATCA 198 0 

ATCACTTTTT TCATGAATGC TGCTTTTTTA CCAGCGATAG GTTTACCAAA TACCATACCT 20 4 0 
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ACACCGTCAT GTGAACCTAA AGAACAAACA GTTCCACGAT CAACGTATTC GAATTCTTCA 2100 

GTTGATTCAC CGTTTAAGAT GCGTTTAATG TTTTTAGCAA CACTTTCACC TTGTTGCATT 2160 

GCAATTTGTG CTGTAGTTGG TAATGGACGT TJTTCTCCAG CTGGGATAAA CGCTGAACAG 2220 

TCACCAATAA CAAAAATGTT GTCGTAACCA TTGATTGTTA AATCTTGCTT TGTAACGATA 228 0 

CGTCCACGTT TAACGCCTTC AAATGATTCT TCCATTAATT TACTACCACG TACACCAGCT 234 0 

GCCCATACTG AAGTACCTGC ATTTAATTGT TGTTTTTCAC CATCTACTTC AACTAC&AA 24 00 

CCTTTTTCGT TACAAGCAAC GATTGGTGTA GCAATTTTAA ATTCAACACC GCGGTCTTCT 24 60 

AAGTAGCTAA cTGCGTGGTT AACTAATTCT TCTGAGAACA TTGGTAACAT TTTAGGTGCT 2520 

GCTTCAACAC AAGTGATTTT AACTTTATTT TGATCCACAC CATATTTGCT ACATAATTCA 258 0 

GGAATTCTGT CTGTTAATTC ACCTAAGAAT TCAACACCAG TGAATCCAGC ACCACCAACT 2 64 0 

AAGATAGATA AATCGTTATC ATCTTTTTCT TTTGATGCTG CATAGTTAGC AAATTTGTCT 27 0 0 

TCGATATGAC GTGATAATTC ACGTGCTGTG ATAACATTTT CAATTTGGAA AG CAT GAT CT 27 60 

TTCATACCTT CGATGCCGAA TGTTTCACTA ACGAAACCA ATGCTACTAC TAAAATATCA 2820 

AAG T CAT AAA TACCTTGATT TGTTTCTACC TTTTTAGCAT CACGGTCAAT TTTTGTTACT 28 8 0 

TCTGCtTGAA CAAAGTTCAC TTTGTCTTTC TTCAAGACAC TTTCCACAGG ATATAATACA 2 94 0 

TCTTCATAGT TTAGTGTACC TGCTGATGCT TCATGTAACC ATGTTGCTTC AT AG T GAT AT 3000 

TCATTTTTAT TAATAAGCGT AATTTCTGCT TCTTCTGTTG ATATCGCTTT TTGCAATTTA 3060 

GTTACAGTTT GTAAACCTGC ATAACCAGCA CCAAGTACAA GTACTTTTTT ACGATCTTGA 3120 

GCCATTTAAT TnCACCTAAG CnTTCATATT TTTTTAACCA AATGCTGATA ATT AC 317 5 
(2) INFORMATION FOR SB2 ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

CATCATTATT AAAGATTTTC AATCAATACA gAATCACAAT ACGTACGCAT TGTGCACGAT 60 

AAAAATACAG ATGTGTATAT TAACTATGAA CTACAAGAGC AACTAACGAA CAAAGCTTAC 120 

ATTGGTGATC ATATTTATGT TGAAGGGATA TGGCTCGAAG T AC AAG C T G A TGGTTTAAAT 180 

GTATTGAGTC AGAATACAGT GGCATCGTCA TTAATTCGCT TACACAAGA GATGCCACAT 24 0 

GCACAGGCAG ATGATTACAA TACGTACCAT CGTTCGCCAA GGATTATTCA CCGTGAACCG 300 
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ACGGATGATA TTAAGATTGA AAGACCGCCA CAGCCAATAC AGAAGAACAA TACAGTGATA 360 

TGGCGTTCCA TTATACCGCC ATTAGTAATG ATTGCTTTAA CTGTTGTCAT CTTTTTAGTG 4 20 

AGACCAATTG GTATTTATAT TTTAATGATG ATTGGTATGA GTACAGTAAC GATAGTATTT 4 80 

GGTATTACAA CGTATTTCTC TGAAAAGAAA AAGTATAACA AAGATGTTGA AAAAC GAG AG 54 0 

AAAGAT T AC A AAGCTTATTT GGATAATAAA T C T AAAG AAA TTAATAAAGC GATTAAAGCA 600 

CAACGTTTTA GTTTGAATTA CCATTATCCA ACGGtTGCTG AAATTAAAGA TATCGTTGAA 660 

AC GAAAG C AC CAAGAATATA TG An AAAAC C ATCGGCATCA TC 702 
(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

TATGTTCCGA CAACGAAACA AAGTGTAATT ACAAGAGCAA AGATAACTTT GAATGTTTGT 60 

AAACGTCCAT CTTTACCTTC AGTTAAATGC ATGAACATTA ATAATGArG TCCTGCTTGG 12 0 

ACGAATGCAA AGCC AAAGAT AATTGTCAAC TTCGCGTGGA ATGTTAATGA CGTGTATAGT 180 

GTTACGTAAA CTGCTAAAAG CGT TAATACG ATAGATGCGA TAAATCCTAC AGTATGTTTC 24 0 

ATTATTGTAC TCATCCGCTA TACACCATCC CTATCATATA TACGGCAGTA AAGATGAAAA 300 

CCCAAACAAC ATCTAAGAAG TGCCAGTATA AACTTACTAT AAATAATTTT GGCGCATTAT 360 

ATTTGTCTAA TCCGCGTCGT TGGATTTGGA TTAATAAACA AATGGCCCAA ACGATACCTA 420 

GCGATACGTG AC AAC CATGC GTTCCTAATA GGATAAAGAA AC TAG AC C AG TAAGAACCAA 4 80 

TTGTTGGGTT AACGCCTTCT GATGCATAGT GTGCGAATTC ATAAATTTCG AATCCAACAA 54 0 

AG AC T AAAC C TAAAAGTAAC GTAATGATCA TCCAAAACAT CATTAACTTT TGTTTTTCTT 600 

GGCGCATGTA GTAAATAGCA ATACCACATG TGTAAGAACT GAATAATAAT GCAAACGTCA 6 60 

TTATTAAAAC AAGAGGCAAT TCAAATAACT CAGTAGTCAT TTTACCTGCA TATCGCCAC 720 

CATGTTGCAA AGTTAATAGT GTCGCAAATA GGGTACCGAA TAACGCAAAT TCGGCTGTAA 780 

TGAAAATCCA AAAGCCAAGC TTATTTAATT CGCCTTCATG TGTGCGTGAA TCAATAGTGT 84 0 

TTGTATCATG ACTCATGACT TACAGCCTCC CTTTCTTTAA TACGCGCTTC TCTTAATCTA 900 

GCTTCAGTTT CTGCAACTTC AGCAGCAGGG ATATGATATC CGTGATCGAT TTGGAAACTG 9 60 

CGATAAATCA TAGTACCAAA AATACCGAAT AAACAAATTA ATGCTGGAAT TACAGTTTCG 102 0 

AAAATTAAGA AGAAACCGCC GATAGTCATA AAGATACCAA TCCAGAATCC AACAGGAGTA 108 0 
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TTGTTTGGCA TATGAATGTC TTTGTAATTA TGGTGTCTA AGTAATGACG ACCATGTTCT 114 0 

TTCATATCAA CAAATGTGTC GTAGTCATTC CAATCTGGTG TAATGGCAAA GTTGTATTTA 12 00 

GGTGGAATTG CTGATGCAGT AGTCCACTCT AGAGTACGAC CAAGGCCATC CCAGTTATCT 12 60 

CCAGTTGCTT CACGTGGAGA TTTGAAGTGA CTGTATACGA TACTAACAAC AAGGAATAAG 132 0 

AATCCGATTG CCATTAATAA TGCACCGATA GTTGAAATTA AGTTTAATAA GAACCAACCA 1380 

TCTGATGGCA TATAAGTGTA TAAACGACGT GGCATACCAT CTAATCCAAG AATGAATTGT 14 4 0 

GGTAAGAAAC AAACGTTAAA TCCGATCATG AAGAACCAGA AGCACCATTT GTTTAATGTT 1500 

TCGTTTAATT TGTAACCCAT CATCTTTGGA TACCAGAAGA TTAAACCAGC TAAGCAGGCA 15 60 

AATACAACAC CAGTAACCAA TGTATAGTGG AAGTGAGCTA CTAAGAAGTA CGTATTGTGA 1620 

TATTGATAGT CAGCTGATGC CATTGCTAAC ATTACACCCG TAACACCACC TAAAAGGAAG 168 0 

TTAGGGATAA ATGCTAATGA GAATAGCATT GGTGACTCAA fflGTAATACG TCCTTTATAT 17 4 0 

AATGTTAATA ACCAGTTAAA CAATTTCACA CCAGTTGGAA TACCAATCAG CATTGTTGAA 1800 

ATTGAGAAGA ATGAGTTGAT TAACGCACCA TTACCCATTG TGAAGAAATG GTGAACCCAA 18 60 

ACTAAGAAAC TAAGGAACGC GATACCGGCA GTTGCCCATA CCATACTTTG ATGTCCGAAT 192 

AAACGCTTAC GAGCGAATGT CGGGATAATT TCTGAGTAAA TACCAAATGC TGGAAGGATA 1980 

ACGATATAAA CTTCAGGGTG CCCCCATACC CAGAAGAAGT TAG C C C AAAG CATTGGCATA 204 0 

CCGCCATGTG CAACTGTGAA GAATGCTGTG TCAAATATTC TATCAGTTGT CATTAATGCT 2100 

AACGCTACTG TTAAAGGAGG G A£AG C AAG A ATAACAATTA ATGTAGTAAT AAATGTTGTT 2160 

ACTGTAAACA TTGGCATTTG CATAAACTTC ATAGTTGGTG TTTTACATCT TAAAATTGTT 2220 

AC AAAG AAG T TGATACCTGT AGCTAAGGTA CCAAGCCCTG AAATTTGTAT AGCTATTAAG 2 28 0 

TAATAGTTAA CACCCGGACC AGGACTGAAT TCACCTGCTA GTGGCGCAffi GTTTGTCCAA 2340 

CCAGCTGCTG GTGAACCACC AATAATAAAT GACAGGTTGA ATAAAATCAT ACCTGCAAAG 24 00 

AATAGCCAGA AACTTACGTT GTTTAATACT GGGAATGCAA CATCACGTGC TCCAATTTGT 24 60 

AATGGAACAA CGATATTCCA TAAACCAAAG ATAAATGGCA TTGCCATGAA GATAATCATG 2 52 0 

ATTACACCAT GTGTACTAAA AATTTCGTTA TAGTGGTTAG ATTCTAAAAA TTTGTTATCA 2 58 0 

GGTACTGTTA ATTGCGCACG AATAAGTAAC GCATCAATAC CACCACGGAC GAACATTAAT 2 64 0 

ACGGCACAGA TTAAATACAT AATACCGATT TTCTTATGGT CTACAGATGT GAACCATTCT 27 00 

TTGTAAAGAT ATTTCCATAA TTTAAAGTAA GTAATTACTG CGATTAAACC AATAACTAAG 27 60 

AATGGGGCAC CAATTTGTGC CATTGTAATC ATcCAGTTAC CTTTAACTAG TAATTGATCC 282 0 

CATGGAAAAT TCATTAATGT CCACCTCCAT GATCATCATT GTCTTGATCT TGCGCATCTT 2880 
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TTGAAATTTT CTTCATTTCT TTCGCATTTT TCGATTCATC TTTCTTGAAC TCATTETAT 2940 

ATGGTTCGTC ATTTCCAAGA ATCATCAACT TCATACCATG TCGTTTATAG TTCGCATTTG 3000 

TAATTTGAGC TTTACGAGCA GGTATTAATG GTTTGTCTGA TACATCTTTA AACATATTTT 3 0 60 

CTTCACTAGT GAAGTTTGGA TCTTTCAATT CGAAATTGAA ACGTTTATAT GCATAGAAGA 312 0 

TGTATTCTGG ATCGGCTGCT GGATCAACAA ACGCCATATG TGTACCATTA AATTCTAAAG 3180 

CTTTATTAGG TGTGCTTGGT AATAATTGTT TATCAAATGT ATCTTGATCT AACGTTTTCT 324 0 

TACCTTTAAC TTCTTTCACC CATTTGTCGT AGTCTTTTTG ACTAACGGCA TTTACTTTAA 3 30 0 

ATGTTTGACG TGAGAATCCT TCACCATTGA AGTTAGAQTT ACGACCTCTG AACGTACCAG 3360 

TTTGAGATGC TTCTAACGTC CAATTCATTG TCATGCCAGT CATGGCATAT TTTTGACCAC 3420 

CTAATTGTGG AAT C C AG AAA CTTGTCATTG TATCCATAGC TTGAAGCTTA AATACAACAG 34 8 0 

GACGATCTTT AGGGATTGTT AATGTATTAA CAGTCTCTAT ATGTTCATCT GGATAAGCAA 354 0 

AGAACCATTT GTATCCTGCA CTTACTGCAT ATACAACCAT TGGATCTTTC TCACTCTTCG 3 600 

GTGGTTTTTC GTAATCGTAT AAAGTTTTAA CTGTAGGAAT AGctAAAGCA GCAACGAT T A 3660 

TGATAGGTAT TACAAACCAT ATTGTTTCAA TGATGGCATT ATGGTGCATC TTACCAGATT 372 0 

CGGCATTCTT ATTATAACTA TACTTGTAAA TAAAAATGGC GAACATGCCA AGTACAACGA 37 8 0 

AACAAATAAC AAGCATGAAG ACGATTGAAT AAAGAATCAA GAACTTCTGA CTACTTGCTA 384 0 

CTGGCCCTTT TGCGTTGAAA ATTTCTATAT TTGAACAACC AC TAAGT AAA ATTAGTGTGC 3 90 0 

CAAATAATAG AAGCAAAGAC TTAAATTTTG ACACTTTTTT GACCtDCTAA T AC T ACAAAT 3960 

GTAGGGCTTA ACATTAATTT TAAGTTATTA CACAATATTT ACAAGGGCTT ATGGGAAAAA 4 02 0 

AAT T AAT AAA ATTGTATCAA AAATGTTGAT AAATCAAGGT GTGACGTGGG TTCACACATT 4 080 

TGTTAAAATT ATGTGTACAT TTTGTGACTA ATAGCGTTTT T 4121 
(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3E: 

CGAGTGAGTA CAmACATATT TTTATTTGCA AGGGGTAAAT GGCATATAAC TATCTTTTTT 60 

ATGTAAGCTG GTATAAAATT TATACTAATA GGAGGGATAG TATGAATATA GTAGGGCATC 12 0 

ATCACATATC CATGTATACA AAAGATGCAA AACGTAATAA GGATTTTTAC ACAAAT GTCC 18 0 

TTGGATTACG ATTAGTTGAA AAGTCGGTTA ATCAAGACAA TCCTTCAATG TATCATTTGT 24 0 
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TTTATGGGGA CGAAgTAGGT AC AG C C G G AA CAATTTTAAG CTTTTTTGAA ATTCCCAATG 300 

CGGGTCATAA GCAGCCAGGT ACT G AAAC G A TTTATCGATT TTCATTATTA GTACCAAATC 360 

AAGCGGCACT TCATTATTTT GAAAAACGTC TTGAGAATAA TGGTATTAG TCTGAACGTT 4 20 

TGTACTATCT TGGACAAGAA GGTGTTGTCT TTAAAGATGA AGACGACTTA G AAAT CAT AT 4 80 

TGCTTGTTAA TGATAGTTTT GAAGTACCAC ATCAATGGCA ACATAACGCT TATAGTGAAA 54 0 

TACCTCAAGC ATATCAAATT TTAGGAATAG GGCCAGTCGA ATTAAGAGTT AG AAAT G C AG 60 0 

CGCGTACGGT AGAATTTTTG GAAAATGTCT TAGGTTATCG CAAAAGAGAT AAT AAAT CAT 660 

TCGATGTGCT GACATTAGCA CCACAAGGTT TATATTCGGA TTTTGTAGTT ATTGAGCAAC 72 0 

AGGGACAACG TGAAAGACCT GGACGAGGTT ATATCCATCA TATTGCAGTT " AATACACCAC 7 80 

AAATGAGTGA CTTAGATGCA ATTTACAAGA AATTACAACA AC AAC C AC AA AGTAATTCAG 84 0 

GTATAATTGA TCGCTATTTC TTTAAATCAT TATACTATCG CCATAATTCA ATTATGTATG 900 

AATTTGCGAC TGAAGCGCCT GGATTTACTA TTGATACACC TGTTGAACAA TTAGGAAGTC 960 

AATTGAACTT GCCTGACTTT TTAGAAGCAG AACGTGAACA AATTGAAAGT AAGTACACG 1020 

AAATATAAAG GAGAATGTTT AATGGCCAAA TTAGAAATGA ATAAAAATAC GCCTCTTGAG 1080 

TTTGGTTTGT ATTCCTTAGG TGATCATTTA TTGAATCCAT TGAAAGGTGA AAAAGTTAGT 1140 

TATGAGCAAC GTATTAATGA AATTATTGAA GCAAGTAAAT TAGCAGATGA AGCAGGTATT 1200 

GATGTTTTTG CAGTTGGTGA AAGTCATCAG GAGCATTTTA CAACACAGGC ACATACGGTT 12 60 

GTGTTAGGTG CAATTGCCCA AGCGACAAAG CATATTAAAG TTTCAAGTTC TTCAACGATT 1320 

ATTAGTGCAc AGATCCTGTA AGAGTATTTG AAGACTTCGC G AC AT TAG AT TTGATTTCTC 1380 

ATGGTAGAGC CGAAATTGTA GCTGGCAGAG CATCAAtAAC AGGTATTTTT GACTTGTTTG 14 4 0 

GCTATGATTT AAAAG AC TAT GAT G AAT T G T TTGAAGAAAA ATTAGGTTTA CTTTTAGAGT 1500 

TAAATAAAAC TGAGCGTATT ACTTGGTCTG GAAAATATCG TCCAGAACTT AG AAAT AT G A 15 60 

AAATATTCCC AAGACCAATC GATAATATAT TGCCAATATG GCGTGCTGTT GGTGGTCCAC 1620 

CTGCAAGTGC TATTAAAGCG GGAAAACAAG GTGTGCCAAT GATGATTACA ACCCTTGGTG 168 0 

GCCCAGCAAT G AAC T T T AAA GGTTCTATAG ATGCTTATCG TCAAGCGGCA AC T GAAG C AG 17 4 0 

GTTTCGATgc TTCGCCTAAG TCTTTACCAG TAAGTACAGC GAGTCTGTTT TATACAGCTG 1800 

AAACAACTCA GGATGCTATG AG AG AAT T T T ATCCACATTT GAATACAGGG ATGTCATTTA 18 60 

TTCGTGGTGT TGGTTATCCG AAACAGCAAT TTGCTAATTC G T C AG AT TAT CGAGAAGCGC 1920 

TAATGGTTGG AAGCCCGCAA C AAAT TAT T G AAAAGATATT GTATCAACAC GAGTTGTATG 198 0 

GTCATCAACG TTTTATGGCA CAGCTTGATT TTGGCGGTGT GCCffiTTGAA AATGTTATGA 20 4 0 
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AGAATATTGA GTTAATTGGC AACGACATTA TACCGGCGAT TAAAAAGCAT TTATCAAAAT 2100 

AGGAGGGGCG TCATCATGAA TATTGTATTA TTGTCAGGTT CCACAGTAGG TTCTAAAACG 2160 

AGAATTGCTA TGGATGATTT AAAAAATGAA CTAGAAGTCA TCAATGAGGG ACATCAAATA 2220 

GAGTTGATGG ATTTACGAGA ACTTGAATTA GAATTTAGCG TTGGAAAGAA TTATCTAGAT 22 8 0 

ACTACAGGAG ATGTATATAA ATTAACGACG TCGTTAATGC AGGCTGATGT GATTTTTATT 234 0 

GGTTTTCCAA TTTTTCAAGC TTCCATCCCT GGTGCTTTGA AAAATGTGTT TGATCTACTT 2400 

CCAGTCAATG CGTTTCGTGA CAAGGTAATA GGACTTGTAG CGACAGCAGG TTCTAGTAAA 2 4 60 

CATTATTTAA TTCCTGAAAT GCATTTAAAA CCAATATTGA GTTACATGAA AGCACATACG 2520 

ATGCAAACGT ATGTATTTAT TGAAGAGAAA GATTTTTCAA ATCAACAAAT TGTCAATGAT 2 58 0 

GATGTTGTAT TTCGGTTAAA AGCGTTGGCA CAATCCACAA TGCGAACTGC EAAGTACAA 2 64 0 

CAACAAGTGT TTGAAGAAGA AAACAACCAA TACGACTTTT AAAGTATAAA AATAAGACGC 2 7 00 

TCGGCACACT AAATTTGTAA GTGTTTGAGC GTCTTTTCAT ATTAACTATA TAGCCAATGA 27 60 

AC G AC GAT AA AGGCAAGTGA TGACAAGCAT ATTGAGGTAA TAATGATTGT CATAAGCGGT 28 20 

TTAAGTGCGC GATTTTTAAG ATCTTTAAAT GCAACATTTA ACCCTAAAGC AACCATGGCC 2880 

ATTAATAAGC AAATTGTTGA TACAGTATTT AAAATATTTA GCAATGCTGA CGGAATAGTT 2 94 0 

ACATATGTAT TCACTAAGGC CATAATGACA AATCCAATTA AAAAGTATGG AATGCTTATT 3000 

CGACCCTTGC TAGATGATTC TGATGAACGG AAA:GCATAA TTAAAATAAG TACGATGGTT 30 60 

AATGGAATCA GTAAGAATAC TCTACCAAGT TTACCAAGAA GTGCAATTTT AAGTGCATCA 312 0 

CTACCACCAA AGCCACCAGC TAAGACAACG TGTGCAATTT CATGAAGACT AACACCAGAC 318 0 

CAAGCGCCAT AAACATTTGT CGTCATTGAA AAGATAGCGT AGATAGCTGT ATATATAAG 32 4 0 

GAAAATATCG TACCAATCAA TGCGATGATA CCGATACTAA TAGCTGTATC CTTTTCACGT 3300 

GATTTGAATA TTGGAGCGAC TGCGGCAATA GCAGCAGCAC CACAAACGCC TGTGCCGACA '3360 

CCTAGTAATA ATGCGATGTT TTTGTCACCA TGCAACAGTT TGTTGACAAA GAGCATCATT 34 20 

AC AAT AC T G A AAA17ACGAC ACCTACATCG ATGGCTAATA GTTTACTACC TTGACCGATA 34 80 

ATATCGAATA TATTGAGTTT AAG T C CAT AT AGGATGATTG CAAATCTTAA TAAATATTTA 354 0 

GATGAAAACG TAATACCTGA GCTATATTGT TCAGGATATC CTCTAAAGTG ACGATATAGA 3600 

ATAGCGATTA ATATCGCGAT AGTTAATGCG CCAACCTTATCTAGGATTGG CAATTTAGCT 3660 

GCTAAAAAGC TAAATAATGC GAC TAT AAA T GTTAGTGATA GCCCAATCAT AAAATGCTTA 3720 

TTTTTCAATG ATGCCATGAG CAGTGCCTCC TTTAATAGCA TTTTAGCACT GTTTTGTCGT 3780 

ATTTTTAAAT ATAAATTTGG AAT G AAT AAT AAAGTAGTGA TTAAATTAAG TTGTGTGATA 3430 

GGAAACTTGG ACATCAATCA AAGTAATAGG CACTACAACG CTTATTGGCG GGGCCCCAAC 3 900 
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AAAGAAGCTG ACGAAAAGTC agCTTGcAAT AATGTGCAAG TTGGGGATGG GCCCCAACAT 3 9 60 

AGAGAAATTG GGTCCGTAAT TTCTACAGAC AATGCAAGTT GGCGGGGCCC CAACATAGAG 4 020 

AATTTCGAAA AGAAATTCTA C2AGCAATGC AAGTTGGGGA AGGACAACAA ATTTAAGATA 4 08 0 

CAATGCGTAA CATTAATATG TTATTATAAT GATAATTTAC AGAATTATAT GAAAAATGAA 4 140 

TGAGGATGTG ATGGTATGTT TGGAATGAAA GTGAATGAAC AAATAACATT AAAAATTTTA 4 200 

GAAGCTCATG ACACAGAAGC GCTTTTCAAT TTAGTCAATC GTTCAAGAA TTCACTTAGG 4 2 60 

GAATGGTTAC CTTGGGTAGA TGCAACTGAG CAACCATCAG ATACGCGTGC ATTTATTAAA 4 320 

AGAGGACTTT TGCAATTTGC TGATGGTAAT GGATTTCAGT GTGGCATTTG GTATGAAGGA 4 380 

ACGCTAGTTG GTGTCATCGG TTTACATGAA ATTAATCACA T G C AC AG AAA AACTTCATTA 4 440 

GGGTACTATT TAGATAAAGA ATTTGAGGGT CATGGGAT T A TGACACAAGC AGTTGAGGCA 4 500 

TTGATAAAGT ATTGTTTCGA AGAGCTTGAC TTAAACCGAA TTGAGATTAG TGCCGCAGTT 4 5 60 

AATAATGAAA AAAGCCGGGC TATTCCTGAA AGGCTGGGAT TTACTAGAGA AGGTATGTTA 4 620 

CGTGACAATG AATTACTAAA TGGTATTTAT TCATCGAGTT ACATCTATAG TTTATTAAAA 4 680 

TCAGAATACG ACCAAAAATG ACAAATTAGA C T T AC AAAAG AGTGATGACA TTTAAAATGG 4 740 

CAGCGCTCTT TTATTTAATT TTTGAAAATA AAAGGTTGTT GACAGTATTA TTTTATAACA 4 800 

ATATAATGAT TTTGATAATT ATTATCAACT AGATGATGTT TATGGGAGGA TGCTTAAAA 4 8 60 

CAGCCGTTTT AAGTGTAATG TATTATTTTA GCGTGTAGGG AATGCGAAAA TAATATTTAT 4 920 

AAGAACACAT CTATGGGGAT AATAGAATTT CTATAATGAG GTGTCAAAAT GAAAAAGTTA 4 980 

ACAACGCTAT TATTAGCATC AACGTTATTA ATTGCTGCAT GTGGGAACGA CGATAGTAAG 504 0 

AAGGATGATT CAAAG AC AT C GAAAAAAGAT GATGGTGTTA AAGCAGAATT AAAACAAGCA 5100 

ACAAAAGCAT ATGATAAATA TACTGATGAA CAGTTAAATG AATTTTTAAA AGGTACAGAA 5160 

AAATTTGTTA AAGCGATTGA AAATAATGAT ATGGCCCAAG CAAAAGCGTT ATATCCAAAA 5220 

GTTCGTATGT ATTATGAACG CTCTGAACCA GTTGCAfAAG CATTTGGAGA TTTAGATCCT 52 8 0 

AAAATTGATG CACGTCTTGC AGAT AT GAAA GAAGAGAAAA AGGAAAAAGA ATGGTCAGGA 534 0 

TATCATAAGA TTGAAAAAGC ATTATACGAA GATAAGAAAA TTGATGATGT GACTAAAAAA 54 00 

GATGCACAAC AATTATTGAA AGATGCAAAA GAATTGCATG CCAAAGCTGA TACATTAGAT 54 60 

ATCACACCAA AAT TAATGTT ACAAGGTTCT GTTGACCTAT TAAATGAAGT TGCAACTTCT 5520 

AAAAT C AC AG GTGAAGAAGA AATTTATTCA CATACAGATT TATATGATTT TAAAGCGAAC 55 8 0 

GTTGAAGGCG CACAAAAAAT TTATGACTTA TTTAAACCTA TTTTAGAGAA AAAAGATAAA 564 0 

AAATTAAGTG ATGATATCCA AATGAACTTC GATAAAGTGA ATCAATTATT GGATAAATAT 57 00 
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AAAGATAACA ACGGCGGTTA TGAGTCATTT G AAAAAG TAT CGAAGAAAGA CCGTAAAGCA 5760 

TTTGCGGATG CTGTTAATGC ATTAGGAGAG CCACTAAGTA AAATGGCTGT GAT T AC T G AA 582 0 

TGACAAATTA TGAACAAGTT AACGATAGTA CGCAATTTTC AAGJ5GTACA TTTTTGAAAA 58 80 

TGTTAGGTAT TGGCGGTGCC GGTGTTGCAA TTGGCGCAAG TGGTGTTGGT AGCATGTGGT 5 94 0 

CTTTCAAATC AATGTTCAAT ACACCAGAAG ATCCGGAAAA AGATGCGTAT GAATTTTATG 6000 

GTAAAGTGCA ACCAGGCATT ACCACACCCA CGCAAAAAAC ATGCAATTTC GTTGCGTTAG 6060 

ATTTGAAGTC AAAAGATAGA GATGCAATTA AGGCAATGTT TAAAAAGTGG ACGGTTATGG 612 0 

CTGATCGTAT GATGGATGGT GATACAGTTG GCAAGCCGAG TAACAATCCT TTAATGCCAC 618 0 

CAGTAGATAC CGGTGAATCG ATAGGATTAG GTGCAAGCAA GTTAACGATT ACCTTTGGGA 624 0 

TTAGTAAGTC TTTGATGAAG AAAATTGGGT TATCTAGTAA AATTCCCGAT GCCTTTAAAG 6300 

ATTTACCGCA TTTTCCGAAT GATCAGT T AA TAGACGATTA CAGCGATGGT GAT AT TAT G A 63 60 

TTCAAGCATG CTCAAATGAT TCGCAAGTAT CCTTTCATGC GGTTCATAAT TTAGTTCGTC 6420 

CATTTCGAGA TATTGTTAAG GTACGTTGGG CGCAATCTGG TTTTATCTCT GTAAAGGTA 64 80 

AGGAAACACC TAGAAATTTA ATGGCATTTA AAGAT GGAAC AATTAATCCT AGGAAGAATA 654 0 

ATCAACTTAA AGATTATGTG TTTATTGATG ACGGATGGGC GAAACATGGA ACTTATTGCG 6600 

TTGTCAGACG TATTCAAATA C AC AT T GAAA CGTGGGATCG TACTGCGCTG GAAGAACAAG 6660 

AGGCTACATT TGGTCGGAAA C G AC AT AG T G GTGCACCGTT AACAGGTGGG AAAGAGTTTG 6720 

ATGAAATTGA CTTAAAAGCG AAAGATAGTC ATGGCGAGTA TATTATTGAT AAAGATGCCC 67 8 0 

ATACGAGGCT AGCGAAAGAA GCAAATACGT CAATTTTACG TAGAGCCTTT AATTATGTGG 68 4 0 

ATGGTACGGA TGACCGCACA GGTAACTTCG AAKAGGCTT ACTTTTTATT GCTTTTCAAA 6900 

AAGCGACAAA ACAATTTATC GATATACAAA ATAATTTAGG TAGTAATGAT AAATTAAATG 6960 

AATATATTAC ACATAGAGGT TCTGCTTCAT TTTTAGTATT ACCAGGTGTT AGTAAGGGAG 7020 

GATACCTTGG TGAAACATTA TTTGACTAAA TTTGTAGCAA TGCTAATAAC TGCTGCTAG 1 7080 

GTGTGTAGCT TTGGGTTACT GAAAAGTCAG GCAGCAGAAC AACAAAGTAT TAGTGATGTA 714 0 

TATAGTGTGA TAACGGATGC GAAATCTGCA CTTTCTAATA ATTCGATATC GAATGACAAT 7200 

AAGCAGAAAG CAATTGAGCA AGTGGTAAGT GCAGTTAAGA AATTATCGCT TGAAGATAAT 72 60 

AGTGAAAGTA ATGCTGTCAA ATCAGATGTG AGAAAGCTTG AAGAT G C AAA AGCGAATGAT 7 320 

AATC AAAAAG ATACACTTTC GCAATTAACG AAG T CAT T AA TTGCTTATGA AGAGAAATTG 7 38 0 

GCTAGTAAAG ATGCGGGTTC TAAAATTAAA CTATTGCAAC AGCAAGTCGA TGCTAAAGAT 74 4 0 

GCTGCGATGA C AAAAG C G AT TAAAGATAAA AATAAAGCGGAATTAGAATC TTTGAACAAT 7 500 

AGTTTGAATC AGATTTGGAC AAGTAATGAA ACAGTGATTC GCAATTATGA CGCAAATCAA 7 5 60 
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TATGGACAAA TTGAAGTCGC ATTATTACAA CTTAGAATTG CAATTCATAA GTCACCATTA 7 620 

GATACGGCAA AAGTGTCACA TGCTTGGACA ACTTTTAAAT CAAATATTGA TCATGTCGAT 7683 
AAAAAAAGTA ATACGTCTGC AAATGATCAA TACCATGTAT CACAATTAAA TGATGCGTTA 77 40 

GAGAAGGCGA TTAAAGCTAT CGACGACAAT CAATTGTCGG ATGCTGcaTg TGCGCTTACA 7 8 00 

CATTTTATAG AAACTTGGCC GTATGTTGAA GGTCAAATTC AAACTAAAGA CGGTGCTTTG 7 8 60 

TATACGAAAA TTGAAGATAA AATACCATAT TATCAAAGTG TATTAGACGA ACATAATAAA 7 920 
GCACATGTGA AAGATGGTTT AGTAGATTTA AATAACCAAA TTAAAGAGGT TGTTGGCCAT 7 980 

AGTTATAGCT TCGTCGATGT GAT GAT TAT C TTTTTACGTG AAGGGCTAGA AGTGTTGTTA 8040 

ATTGTAATGA CAT T GAC T AC CATGACGCGT AATGTAAAAG ATAAGAASG GACTGCAAGT 8100 
GTGATTGGTG GTGCAATTGC CGGACTTGTA CTGAGTATTA TCTTAGCAAT TACGTTTGTA 8160 
GAAACTTTAG GGAATAGTGG CATTCTTCGT GAAAGTATGG AAGCGGGATT AGGTATCGTT 8 220 

GCGGTCATAT TAATGTTTAT CGTTGGTGTT TGGATGCACA AACGTTCAAA TGCAAAACGT 82 80 

TGGAATGACA TGATTAAAAA TATGTATGCT AATGCGATTA GTAATGGTAA TTTGGTATTG 834 0 

TTAGCGACGA TTGGTTTAAT ATCTGTGTTG CGTGAAGGTG TCGAGGTTAT CATTTTCTAT 8400 
ATGGGGATGA TAGGTGAGCT AGCGACCAAA GATTTTATTA TTGGTATTGC TTTAGCTATC 84 60 

GTTATTTTAA TCATCTTTGC ATTATTATTT AGATTTATAG TTAAATTAAT ACCTATTTTC 8 52 0 
TATATATTTA GAGTGTTGTC GATCTTTATT TTTATTATGG GATTCAAAAT GCTTGGCGTA 8580 
AGTATTCAAA AGTTACAATT ATTAGGTGCG ATGCCAAGAC ATGTTATTGA AGGATTCCCA 8 64 0 

ACGATTAACT GGTTGGGCTT TTATCCAAGT TATGAACCAT TGATAGCACA AGGTGTTAT 8700 
ATTATGGTAG TTGCTATCTT AATCTTTAAA TTTAAAAAAT AAAAAACAGG CCGAGTGCCT 87 60 

GTTTTTTTTG TTGCTATATT GGAAAT AT T C GGTATTGCAG TATAACGATA ATCACAGCAT 8820 
TGATTCGTAT AAGGTTAATG TGTTGGCGGT TTGCCTCGGC ATGTGAACTT AACGATGAAC 8880 

ATACTGAACT CAAAGAGCAA TATGAGTGGC AATGTGAGTA ATATATTTAA TGTTAAATCG 8 94 0 
GGTGGTGCAA TGATACTTGC TAATACAAAG CAAGCGAAAT AAAT AT AT T T ACGrTAATGT 9000 
TTCAATGATG TGGTATCTAT AAGACCGAAT TTTGCAAGAC CCATAAATAA TATTGGTAAT 90 60 

TGAAATAGAA GACCAAATGT GAATAACCAA CGTATGKTT CAATCAAATA TGCTTTAAAG 9120 

CCAATGACAG GCGAAATGTT TAAAGTTAAT GATAATTTTA ACGCGAATTG AATGATCATT 9180 

GGAAAGCCAA CATAAAATGC AAAAGCGACG CCAGCACAGA ATAATAACAC GCTGAAAAAA 92 4 0 

CTATATTTAT AAATAAATTG ACGTTCATTA TTATGTAATC CAGGTGCAAT GAATGCCCAC 9300 

AATTGATAAA 9310 
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(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : abuble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

ATTAGTATTA CACACTAATG TAAATAGATT GGTTGGAGAA GAGATATTTG CTAATAAGTG 60 

CCTTGCTAAT AATGATGTTC AAATTATGAA TTCAATAAAA AAATTAATTG AAGCTGAGTT 12 0 

ATTAACAACG ACTAATGATT TTGAAGTTAG TATATATAAA AAGACAAGAC CTGAATTACA 180 

AAGTATTTTA AAAAGTTTTG GTATAAAAAC AACAGGTAAT AAACCTGACT TAATTAAACG 240 

TATTGACGAC AATTTTCATA TTATTAATAA CTTAGATTTA CCATATGTAT ATATACCAAT 300 

TAAGAAAGGA GAAGAAAT T T TAAAGAAAAC AGAGTACTTAACCTCTTTTA TACAAAGTTA 360 

TGGTGAAATT TCTCTTGAGC GTGCTTATTA TTTGGTTGAA AACTATATAG ATGAAAATTG 4 20 

CGATGATAAA GTTGCAGAAA TATACAAGTT TGAATTTCAA AG AAAAT AT G ACAATGGCGA 4 80 

GTTTGATTTT AATCATGGAT ATAATTTCGA ATTGAATATG TTGATAGATC AC T AT AAAAG 54 0 

AGATGTAAAA GACTACGATA ATGCCAGAAA GTATTCAAAT ATTTATCTTT ACTTTGGTTT 600 

GAGAGATTTC TTAAAAAAAT TAATGAGTAA TTATTCATAT TACGATAgTA AAGGGAATAT 660 

AGATTTGAAC GAAATACAAA AC G AT C T G AA TAGATTTATC AACTCTAGCG CTTCTGGTAT 720 

GTACGAGCGA TTAATATATA ATGAAAATTT ATCCAATAAT ATTATGTTTG AATTATTTAA 780 

AAAGGACACA CAAGATTATA GTGATTTGGA AGAACAATTG ATTGAAAAGT TCATAAACTA 840 

TGTAGTGTCT AATGTAAAAA AAG AAAG TAG GAGTAATACT CTTATAGAGT TATCAAAAAT 900 

TTTAGAGAAC GGATATACAA T T GAT AAAG A AGAATTTAAA AAAGAAETG ATTATCTTTC 9 60 

TAAGTACATA TTTACTGACA TAGATTATTT GAAAAAGTTA GAATCAAAAA TAAACGTTGC 1020 

TATTGATATT CGAAGTGGAG AAATTCATTT GGTATTAGAT GATGATAGCC TTGATATATT 1080 

AATACAAAAT CAAAAATACG GCAATGAGTT TTGAGTCATA ACTAAATATA AT AT G T T G AA 114 0 

GAGAGGTTTT ATTTAAATGG CTAAAATTGG TTATGATCGT GTATCAACGA AAGATTAATA 1200 

TCGATACACT TAAAGAATTT GGGTGTGGGT GTGTATATAG TGAAAGAATA AGTGGTGTCA 1260 

AACG TACAGA GCTTGTTAAG TGTTTAGATT ATTTACGAGA GAGCGACACA TTAGTTGTCT 1320 

ATCAACTTGA TCGGTTAGGT AGAACGACm AACTATTaAT TGAATTaTCA CAATGATTCG 138 0 

ATGATAACGG AATTGACTTA CAAAT TAGT A ACATGAACAT TTCAACGAAA GACACAATGG 14 4 0 

GCAAAATGTT TTTTACGATG ATGAGTGCAT TTTTCaGGAT TAG AAG T T AA TTTACTATGT 1500 
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GAGTGTATAA AATAGACTTA GCAGCAACAA GAGCGAGAGG CCGAAAAAGC GGGGCCCCT 15 60 

CTTTACCAGA GAATAAAAAA T GAG AAA T T A AATTTTTATA TGATGAACAA ACGATAACAG 1620 

GGGAAGAAAT AGCTAGTTAG ACAAGGGTAT GTCACTCAAC TGTTTATCGA GTTGATTAAG 1680 

AAATGAAAAA AC T TAT AC T A TGAATTACTG TTTAAAAGTG TGCATGTTAT AATATTTATT 17 4 0 

GAGCAAGTTG GATAGATGGT GGCTAATCTC TTAATAAAGG GGTGATGCCT ATGGTTATAG 18 00 

TTGTTACTCC TAGnAAAGGA CTAGCATGTC TGATTTTGAA ATGCTTATGG TTGTATTAAC 18 60 

AATCATTGGT TTAGTATTGA TTAGTACTCA AGACCATAAA AAATAaCCTT CTATTCGCTT 1920 

TGACCGGCAT TTTTGAAGGC TATTTTTAAA TAAAAATAA GGTCACCGTC TTTTTAACGG 1980 

GCTCATTAGG GTAACATGTT TCCGAGTGTT GCCCTTTTTG TGTTTCAAGA GTTAATGATT 204 0 

TTCATCTTTT GCTTCTACTT GCTACAAATA TATTTTAACA CATTTTTCTT ATGAATTGTA 2100 

GTTCTGAACA TAATCAGAAT TAATAAAACC AACTTTCCAT ACAGCAGAAA ATACAATTAA 2160 

AAGTATAGAA TGTAATCACG AATTATATAA AAAGTATTGG AGACCTCTTC ATATATAGAT 2220 

AATTCACTTA GTTATTTTAG AAAGAAGCCC CTAaCAACTA AAGTTGAAAA ATAGAGGAAC 2280 

ACAGTTgGAT TACGCATCAA CTGCATaAGg CCCCTAAmAA CTAAAGTTGT AAGGGGCyCT 2340 

AAAATTTATT TTGGTTGATT GTCTTCTGGT TTATCTGAAG TCATTGTTTT TGTTGTATTA 24 00 

TCATTTAATG AATCTTGTGC TTTTTCTGTT TTTGTTTCAA CAGATGTTGT CACTTTATCA 24 60 

TTTTCTACTT TTGTATTTTT TGGTTCAGTG ACAACTTTTT TATTCTTGTC AGCTGTTTTT 2520 

ACTTTATCTT TAGTAATTTT TTGACCACGT TTTAAGTAAT ATIEAACGAT ACCCATTAAA 2 58 0 

ATGATTGCAT GAATAACAAC GAATAAAATA ATTGTCATTA CTGTATAAAC TCTTATGATA 2 64 0 

TTTTCTGCAA C AC T T T GAGA GAGTGATTGC GTATTAAATG AAATCAAGTA ACCAACAGGT 27 00 

GTTTTAAATA AAATAATAAT TAAGTTTAAT AATAGTATTC CAATGAAAAA TTTGAAAAAT 27 60 

GTTTTTTGAC CATT.TTTCAT TGCTTTAAAT CCATTAGCTA AGTGTGTTTT TACTTTATCG 2820 

TTAGTTGATT CaACGAAACT AGTCaTAAAG TTGAAAATAG GTATTAACAA TAACCAAGTA 2880 

ATAATTGCTT TAATCAATAA AACAACAATG ATTATGATGC TTTGAGTTGT AATAGCAATA 2 94 0 

CCAATCAGGT GTGTACTATC TGCATATGAT GATTGTACTG AATTCATAAT CATTTCTGAT 3000 

ACTGGAGAAA ATAATTTGTA TAG T AAT GAT GTCaGAAGTG AAAGAACGAT AATCATTGCT 30 60 

ATAGATACAA GACCGATCAG AACGCTCTTT AAATAATTAC CTTTTTTAAA TGCAATAAAT 3120 

AAATCGGTAA ATTTTACTTT TTCATGACTC ATTGCTCGTT TCATAACATTTGTAATTCCG 318 0 

ATAAAAATTT GAACTAAAAC AAACAGTGAA ACTACAGCTG CTATAAGTAA TACAGCGATA 32 4 0 

ACTTTTAAAT ATGCATCTAC TGGTGGTTTT TGCCCAAATT GTGAATAAAT TGCAATAGTT 3300 
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TGTGCATACT TAGCAAGCTG GAAATTAGCA AGTAAATATA CAACTGCAAT TACAGCAAAT 33 60 

GCTATAAGTG CATATATCAA TGTTATTAAT AATTGTGGTT TAGCATTTTT AAATGCTGAT 34 20 

TTAAAGTAAG TAAACAAAGT GGTGCCTCCT TTTCTTCA 34 58 
(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

GCTTGCAAGA TGCTTTCATT AAAGACAAAG TCATCGATAT T ATGAAT AT G TTCCAAAATC 60 

ACACTGATAT CACTTATACG TTGAATAAGT CTCATGCACA TATTATATGT ACGCCAGAaA 120 

TATTTGCTAA ATTGTTACAT ACGATTGCAA CTAGAAATAT CGACATTCTA TCTGCCAATT 180 

ATAGATCGTC ATCTATGAGC AAAGCTCGTA TATCATAATA AAAGACACAT CTATACGATG 240 

AT CAT TT AC A ATGATTGCGT ATAGTGATGT GTCTTTTTTT GTATTATTTA TCTCTGGAcG 30 0 

ATTGATCCTC AACCATTTGT TCAAATGCTT CACGAACTTG TGGTACTGTC ATACCTACTA 3 60 

CAACTTGGAT ACTCTTACCA CTCTTTACTA ATCCGTGTGC CATTTGTTGA TGCGTGAAAT 4 20 

ATTCCGTATC TGCAACTTTA CTTTCATCAT AAACAGTTAA GCGTAATCTT GTTGTACAGT 4 80 

TAGTAACATC TTTGATATTT TCTTTGCCGC CTAAACCGTC GAGGTAATAT GCAGCTTTGT 54 0 

CTTCGTATTC ATTACCAGAT GAAGCACGTT TAGAATCTAC GCTATCACCT TTTTTATTTT 600 

TGTAATCTTG CTTAGAAAAT AATTTAACTT CCTCTTCTGT TTyCTTACGT CCAGGTAATG 6 60 

G AAT AT C AAA TTyCAAAATT AAGAATCTGA ATAAG 6 95 
(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

AATATTGaTa TTATATATAG TCGAATCaAT GATGACCCaC ATCGACCtAA TGCAAATAAT 60 

AAGACAATCA AGCAGTTAAA TGACTTGTAT TGCTCGCGGA ATTTAAGATA TAATGAAATC 120 

GCATTCAAGA AAT T T GAT AG TCATTTGCTA T C AAT T T C AG AAATATATTA TGAATTGEA 180 

AAT T T AAT AA AAGCGAGTGA TCAGTATTAG AGAGAATAGA GCGTTAAGAC TCTATCGCCG 240 
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AAGtGCAAGT AATTTATTAC GAAACTCTCA GGcAAAAGGr TAATACTGTA ACGCGTTCCT 300 

GAATTGGTGA TTTATAAACA GGGTAGCGAT TGCTATCCTG TTTTTATAAT TTTAAGGGGG 360 

TATTTCAATG TCAAGTGATT TAAAACAAAC ACCTTTATAT CAAAATTATG TTGATAGAGG 420 

TGCAAAAATT GTGGAATTCG GAGGATGGGC GATGCCTGTT CAATTTTCAA GTATTAAAGA 4 80 

GGAGCATAAT GCTGTTCGAT ACGAAATTGG CCTGTTTGAT GTTAGTCATA TGGGTGAAAT 54 0 

TGAAGTAACA GGTAAAGATG CTAGTCAGTT TGTGCAATAT TTATTATCAA ATGATACTGA 600 

TAATTTAACT ACTTCAAAAG CAT TAT AT AC TGCTTTATGT AATGAAGAAG GCGGTATTAT 6 60 

TGATGATTTA GTAATATATA AATTAGCTGA CGACAATTAT TTATTAGTTG TTAATGCTGC 7 20 

TAATACTGAA AAAGATTTTA ATTGGATTTT AAAACACAAA GAGAAATTTG ATGTTGAAGT 7 80 

AC AAAAT G T A TCAAACCAAT ATGGTCAATT ' AGCAATACAA G G AC C AAAAG CtAGAGATTT 840 

AATTAATCAA TTAGTTGATG AAGAkGTAAC TGAAA 87 5 
(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 
TCTTTTTACG AAGGCTACCG TTTAGGATCT GAT AAC T T AA CTGTAGGAGA AATTGTATTT 60 

AATACAGCGA TGACAGGTTA TCAAGAAACT ATTTCAGATC CATCATATAC AGGTCAGATC 12 0 

ATTACTTTTA CGTATCCATT AATCGGTAAT TATGGTATCA AT AG AGAC G A TTTTGAATCA 180 

TTAGTACCTA CATTAAACGG TATTGTAGTG AAAGAAGCGA GTGCGCATCC AAGTAATTTT 24 0 

AGACAGCAAA AGAC AC T T C A TGACGTTTTA GAATTGCATC AATTCCAGG GATTGCAGGT 300 

GTTGATACAA GAAGTATTAC GCGTAAAATT CGACAACACG GTGTGTTAAA AGCTGGTTTT 360 

ACTGATCGAA AAGAAGATAT T GAT C AAC T T GTCAAACATT TACAACAAGT AGAATTACCT 4 20 

AAAAACGAAG TAGAAATCGT TTCGACTAAA ACACCGTATG TTTCGACAGG TAAGGATCTA 4 80 

AGTGTCGTAC TTGTAGACTT TGGTAAGAAG C AAAAT AT T G TTCGAGAATT AAACGTCAGA 54 0 

GGTTGTAACG TCACAGTTGT ACCATATACA ACTACTGCCG AAGAAATTTT AGCAATGGCT 600 

CCAGATGGCG TTATGCTATC AAACGGACCA GGTAATCCTG AAGTTGTAGA ATGTGCGATT 660 

CCAATGATTC AAGGAATTTT AGGG\AAATT CCGTTCTTTG GTATCTGTCT AGGACATCAA 7 20 

CTTTTTGCAT TATCTCAAGG AGCAAGCTCA T T T AAAAT G A AGTTTGGTCA TCGTGGTGCG 780 
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AACCATCCAG TTAAAAATTT AGAGACTGGA AAAGTTGATA TTACGAGTCA AAACCATGGA 840 

TATGCAATAG ATATAGATTC GTTAAAAAGT ACTGATTTAG AAGTTACTCATCTTGCATTA 900 

AATGATGGTA CTGTAGAAGG TTTAAAACAT AAAACATTAC CAGCATTTTC TGTTCAATAC 9 60 

CATCCTGAAG CAAATCCAGG AC C G T C AG AT TCAAACTATC TATTTGATGA TTTTGTAGCA 1020 

ATGATGACTA ATTTTAAGGA AAAGGAGCGT CATATCAATG CCTAAACGTA ATGATATCAA 108 0 

AACAATTTTA GTAATAGGGT CTGGGCCAAT TATCATAGGT CAAGCAGCTG AATTTGATTA 114 0 

TGCTGGAACA CAAGCATGTC TAGCTTTAAA AGAAGAGGGA TATCGAGTTA TTCTTGTAAA 1200 

TTCAAATCCA GCGACAATCA TGACTGATAA GGAAATTGCG GATAAAGTAT ATATCGAACC 12 60 

GTTAACTCAT GATTTTATAG CGCGAATTAT iCGTAAAGAG CAACCTGACG CTTTACTTCC 1320 

AACTTTAGGT GGTCAAACAG GTTTAAACAT GGCGATTCAA CTACACGAAA GTGGTGTGCT 138 0 

TCAAGATAAT AACGTCCAAT TATTAGGAAC TGAGCTAACA TCAATTCAAC AAGCAGAAGA 14 4 0 

CCGTGAAATG TTTAGAACAT TAATGAATGA TTTAAACGTT CCTGTACCAG AGAGTQBAT 1500 

TGTAAATACA GTAGAGCAAG CCTTTAAATT C AAA GAG C AA GTGGGATACC CGCTAATTGT 15 60 

TAGACCGGCA TTTACGATGG GTGGTACCGG AGgCGGTATT TGTCATAATG ATGAAGAATT 162 0 

AC AT GAAAT C GTCTCAAATG GTCTTCATTA TAGTCCAGCA ACGCAATGTT TATTAGAAAA 168 0 

ATCTATCGCA GGTTTTAAAG AAATCGAATA CGAgTAATgC GTGaTAAAAA CGATAATGCC 17 4 0 

ATCGTTGTAT GTAACATGGA AAATATTGAT CCAGTTGGTA TTCATACAGG CGATTCAATT 18 00 

GTTGTGGCTC CTAGTCAAAC ATTATCAGAT GTTGAGTATC AAATGTTACG TGATGTTTCA 18 60 

TTAAAAGTTA TTCGAGCTTT AGGTATCGAA GGTGGTTGA ATGTTCAATT AGCATTAGAT 1920 

CCCCATTCAT TCGATTATTA TAT T AT AG AA GTAAATCCGC GTGTATCACG TTCATCAGCG 198 0 

TTAgCTTCAA AAGCAACAGG ATATCCTATT GCAAAATTAG CTGCTAAAAT CGCGGTTGGT 204 0 

CTAACATTAG ATGAAATGTT AAATCCAATT ACAGGAACAT CTTATGCAGC GTTTGAACCA 2100 

ACTTTAGACT ATGTGATTTC AAAAATACCA AGATTTCCTT TTGATAAATT TGAAAAAGGA 2160 

GAACGAGAGC TTGGCACACA AATGAAAGCA ACAGGTGAAG TTATGGCCAT TGGTCGAACT 222 0 

TACGAAGAAT CATTGTTAAA AGCAATTCGA TCACTTGAGT ATGGTGTGCA TCACTTAGGA 22 80 

TTACCAAATG GTGAAAGCTT CGATCTTGAT TATATTAAAG AACGTATTTC ACACCAAGAT 234 0 

GATGAACGAT TATTTTTCAT CGGCGAACAA TTAGAAGAGG CACAACATTA GAAGAAATTC 24 00 

AT AAT AT GAC T C AG AT T GAT TACTTCTTCT TACACAAGTT CCAAAACATT ATTGATATTG 2 4 60 

AGCATCAATT AAAAGAGCAT CAAGGTGATT TAG AAT AT C T TAAATSGCA AAAGATTATG 252 0 

GAT T TAG T G A TAAAACAATA GCGCATCGCT T T AAT AT GAC GGAAGAAGAA GTATATCAAT 2 58 0 

TGCGTATGGA AAATGATATT AAACCTGTTT ACAAGATGGT TGATACTTGC GCAgCTGAAT 2 64 0 
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TTGAATCTTC AACACCATAT TATTATGGTA CATACGAAAC TGAAAATGAA TCCATAGTTA 27 00 

CTGACAAAGA AAAAATCTTA GTATTAGGCT CTGGACCAAT TCGAATCGGC CAAGGTGTAG 27 60 

AATTTGACTA T GC G AC AG T T CACGCCGTTT GGGCAATTCA AAAAGCAGGG TACGAAGCGA 2820 

TAATTGTGAA TAACAATCCA GAAACAGTTT C AAC AG AC T T CTCAATTTCT GACAAATTAT 28 80 

ACTTTGAACC TTTAACTGAA GAAGATGEA TGAATATCAT TAATTTAGAA AAACCTAAAG 2 94 0 

GTGTCGTTGT ACAATTTGGA GGACAAACAG CGATTAATTT AGCAGACAAA TTGGCTAAAC 3000 

ATGGTGTTAA AATACTTGGT ACTTCACTAG AAAATCTAAA TCGTGCTGAA GATAGAAAAG 30 60 

AATTTGAAGC ACTATTAAGA AAAATTAACG TGCCACAGCC ACAAGGGAAA ACfiCTACAT 3120 

CACCTGAGGA AGCATTAGCG AATGCTGCAG AAAT CGGATA TCCGGTTGTA GTAAGACCTT 3180 

CTTATGTATT AGGTGGTCGC GCAATGGAAA TTGTAGACAA TGACAAAGAG TTAGAAAACT 32 4 0 

ATATGACCCA GGCTGTAAAA GCGAGTCCGG AACATCCGGT ACTAGTCGAT AGATATTTAA 3300 

CTGGTAAAGA AATTGAAGTT GATGCGATTT GTGATGGAGA AACGGTCATT ATTCCAGGAA 3360 

TCATGGAACA TATTGAACGT GCTGGTGTGC ATAGTGGTGA CTCAATCGCT GTATATCCAC 34 20 

CACAAACTTT GACAGAAGAC GAGTTAGCAA CACTTGAGGA CTATAC TATA AAATTAGCTA 3480 

AAGGTTTAAA CATCATTGGC TTAATCAACA TTCAffTCGT TATAGCTCAC GATGGTGTGT 35 4 0 

ATGTTTTAGA AGT AAAT CCA CGTTCTAGTA GAACGGTACC ATTCTTAAGT AAAATTACTG 3600 

ATATTCCAAT GGCACAATTA GCTATGCGAG CAATCATTGG GGAAAAACTA AC AGAT AT GG 3 6 60 

GTTATCAAGA AGGGGTTCAA CCATATGCTG AGGGTGTCTT TGTGAAAGCA CCAGTATTTA 372 0 

GTTTTAATAA ATTGAAAAAT GTTGATATTA CTTTAGGACC TGAAATGAAG TCAACAGGTG 3780 

AAGTGATGGG GAAAGATACT AC AT T AGAAA AGGCGTTATT CAAAGGGTTA ACAGGTAGTG 38 4 0 

GCGTTGAAGT T AAAGAT C AC GGTACAGTAT TAATGACCGT CAGTGACAAA GATAAAGAGG 3900 

AAGTTGTTAA ATTGGCACAA CGCTTAAATG AAGTTGGCTA TAAAATTTTA GCAACGTCTG 3 9 60 

GAAcAGC T AA TAAATTAGCT GAG TAT G AC A TACCTGCAGA AGTAGTAGGC AAAATTGGTG 4 020 

GCGAAAATGA TTTATTAACA CGTATTCAAA ATGGTGATGT TCAAATCGTT ATAAATACAA 4 08 0 

TGACTAAAGG TAAAGAAGTA GAAAGGGATG GCTTCCAAAT TSACGTACT AC AG T T G AAA 414 0 

ATGGTATTCC ATGTTTGACA T C T T T AGAT A CAGCTAATGC CTTAACGAAT GTAATTGAAA 4200 

GTATGACATT TACAATGCGT CAAATGTAAA TCAATCAAAC TGTATCGGTG GGGCTGTAAT 4 2 60 

TAACCATTTA CTTAAAGAAG TTTATATTAC AGCCTCATTA TTTTAATGAA TTTCTTAATA 4 320 

TAAAGGGAGA CaTATATGAT G AAAGAT T T A CCAATTATTG CATTAGATTT TGAATCAAAA 4 380 

GAAAAAGTAA ATCAATTTTT AGAT T TAT T T GATGAATCAT TATTCGTAAA AGTAGGTATG 4 4 40 
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GAACTTTTTT ATCAAGAAGG TCCTCAATTA ATTAATGAGA TAAAAGAAAG AGGCCATGAT 4 5 00 

GTATTTTTAG ATTTAAAACT GCATGATATT CCTAATACAG TTGGTAAGGC GATGGAAGGA 4 560 
CTAGCTAAAT TGAATGTTGA TCTGGTAAAT GTTCATGCTG CTGGTGGCGT AAAAAT G AT G 4 620 

TCTGAGGCCA TTAAAGGATT AAGAAAACAT AATCAAGATA CAAAAATTAT TGCAGTAACA 4 680 

CAGCTTACGT CAACAACAGA AGACATGTTA CGACACGAAC AAAATATACAAACATCGATT 4 74 0 
GAAGAGGCCG TTTTAAATTA TGCCAAGTTA GCAAATGCAG CTGGTTTAGA TGGCGTTGTT 4 8 00 

TGTTCACCTC TTGAAAGTCG TATGTTGACT GAAAAGTTAG GTACATCATT TTTAAAAGTA 4 8 60 

AC AC C AGG T A TTAGACCTAA AGGTGCATCT CAAAATGACC AACACCGTAT TACGACACCG 4 920 

GAAGAAGCAA GACAGCTTGG TTCGACGCAT ATTGTAGTCG GTAGACCGAT T AC AC AAAG T 4 980 

GACAATCCAG TCGAAAGTTA TCATAAAATT AAAGAAAGTT GGTTAGTATA ATGGCTAAAG 50 4 0 

AAATTGCAAA AT CAT TAT T A GATATTGAAG CTGTAACATT ATCACCAAAT GATTTATATA 5100 

CATGGAGTTC AGGT AT TAAA TCACCGATTT KTGTGATAA CCGTGTTACG TTAGGTTATC 5160 

CTTTAGTTCG AGGCGCAATC CGCGATGGTT TAATTAACTT AATTAAAGAA CACTTTCCTG 5220 

AAGTAGAAGT TATTTCTGGT ACTGCAACAG CTGnTATTCC ACATGCAGCT TTTATTGCTG 52 80 

AAAAATTAAA ATTACCAATG AATTATGTTC GTT CATC AAA TAAGAGTCAT GGTAAG5AA 534 0 

ATCAAATCGA AGGTGCTAAA AGTGAAGGTA AAAAAGTAGT TGTGATAGAA GATTTAATTT 5 4 00 

CGACAGGGGG ATCTTCAGTC ACAGCAGTTG AAGCCTTAAA ACTAGCAGGT GCAGAAGTAT 54 60 

TAGGTGTTGT AGCTATCTTT ACTTACGGTT TGAAAAAAGC AGATGATACA TTTAGCAATA 5520 

TTCAACTACC TTTTTACACT TTAAGTGATT ACAATGAATT AATTGAAGTA GCTGAmAmTG 5580 

AAGGTAAAAT TTCTAGTGAA GAT AT C C AAA CATTAGTTGA ATGGAGAGAC AACTTAGCAT 564 0 

AATATAGACA CTAGAAGGAG GAATTCAACA AATGAATGAC AAAACATCTA ATGATTTATA 57 00 

TGGGAAGATA AAACATTGTA ACGAATTTAT CAATCATT© AATGATTCCA ATCTATCTAG 57 60 

TAGTCACGAT GTCGACGAAA GTTCAACGAA G C AAAAAC AT ATAAAAAATA AAACAACTAT 5 8 20 

AGATCATAAT GATGATTTAT TTAAACATGT AAAGGATATA TTACGTAAAC AAGGACAAAT 58 80 

TTAAAAACAA AAAAGCT 58 97 
(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO: 321: 
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TCATTCTAAA TCAACTTATT TTCCATTGCA TAAATTGCTG CTTGTGTACG ATCGCTAACT 60 

TGTAATTTAC TAAATATATG ACTGACATGT GTTTTAATTG TTTTTTCAGA TACAAATAAA 120 

GTTTCTGCAA TCTCTTTATT TGTTTTACCT TTAACCATTT CACGTAACAC TTCAATTTCT 180 

CTCTTTGACA ACTTATTCGT GTAGTGTGGT TTTTGGCTAA CTGTTTCGAA TACATCTTGT 240 

GCCTTAGGAT GTATCATTTT TTCACCGTTC ATAACTCGTC TAATAGTTTC AATTAATTGC 300 

TGAGGCTCAA CGTCTTTCAT TTCATAACCA TCAGCACCTT TATTGATTGC T GAAAT T AC A 3 60 

TGTTCATCAT CAACATAACT TGTTAATACC AAAACTTTAA STCCGGATA ATGTGCCTTA 4 20 

ATATATTCCG TAATTTCAAT ACCATTCATG CCAGGCATCA CTAAATCTAA TAGCACAATA 4 80 

TCAGGGTGCT CATGCTCTTT TAAATATTCT AAAAATGTTT CTCCATCTGC AAAGTCTTGT 540 

AAAACTTCTA TGTTTTCAaT CGTGGaTAAT AAAAATCGCA ATCCTTGtCG CACAATATAA 6D 

kGGwCATCTA CTAATATnAC TTTGTTCATG GGTTATCTCC TTAAAt CAAG CTATTTTATA 6 60 

GGAATTGTGA ATTGTATTTG TGTACCCTTT GTTGGCTGAG AATGAAAGGT CACTTTACCT 720 

CTTAATAATT TAACTCTTTG TTTTATGTTA TTAATACCGT GTGATGAAGC TATCTGAACA 7 80 

TTATCGATCT CAAATCCTTG ACOVTAATCA ATCACGTCAA TATATAGTAT ATCGTTCATT 84 0 

TGTTTTAATG TAAGATCCAT TTTATTCGTA TCAGCATGTT TCTTAACATT ATTAATACAC 900 

TCTTGTAATG CTCTGTATAT GTTTTCTTCG ATTTCATTAG ATAAATCGAT TAAACCTTCT 9 60 

ACATTTACAT TTAATTGTAT ATGCATTAAT TTACTATATG CTGTCAAAG ATGAATTAAA 102 0 

CCTTGCTCAA GTCCAACTGG CTTAAGTnGC CAAATCAATG CACGCATTTC ATTAACGGCA 1080 

TTTTGACTCG TTtCCTCAAT CGTCTTGAAT GCTTGTTTAG CGATGGATTC GTTTGACATG 1140 

CCATACGCAG CATGTGCtGT TAGTTTTACA GAAAATAACA TTTGATTTAC TGAATCATGT 12 00 

AAATCTCTAG CTAAACGATT ACGTTCATTA ATTTTTGCCG CTTCTTTTTC ACGGTCTGTT 1260 

AAATAAATAC GTTTGATGGC TGACCCTAAT TGAAATGCGA CAGACTCTAG CAACTCTAAA 1320 

TCTTCATCGC TATATATTTC AGTATTTGGA GACGCTACAT TTAAAATGCC GAATTGTTCT 1380 

TGACCCGATT TAAGTGGTAC CGTTGCATGA TGTGTAATAT TGTCATTTTG GCTAGGAAAT 14 4 0 

GCTTTAGAGG CTAAGTTAAT ACGAGAACAA TTGACGATAT TCGACGCTTT CATTAGCCTA 1500 

CGTTGATTAA ATGCTTTCAC ACACCAACAA GACCCATCTT TAATATAGTG ACAGTGGTCT 15 60 

GCTGTCAAAG ATTGTGGTAG AGCCACATGT GATACAAGTT CATGTTCACC TACGCSTTG 1620 

ATGAAAAATA TCCAGCCTGT CGTGAAATTA CTGCCCTCAA TTAAAT AT TT TAACGCACCT 168 0 

TGGGTCATGC TATACATTTC rGTTTCTTCG TTTAAAAATT CGGCAATCTC TTTTAATAAA 174 0 

GCTAGTCGCG TCCTTTGTTC CATCAAATCG CTCCAATTCA TTTTTACGTG TATTAACTAT 1800 
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TATACATTGA GTTATTATAT TTTTAAATCT TAGACGTAAA CATGATAAAA TGGCCTTGAT 18 60 

TACTCAATAG TTATATTTCG GAGAACTGAT TTGTGATATG ATATTAAAGA CTATAGGAGG 192 0 

ATTTTATGAA ATTTAAAATA CCAGAAAACT TTAATGACTT AAGTTTACGA GATATTTTCC 198 0 

AACAACTTAA GGTACCTAAA AAAGATTTAC ATCATTTAA TATGTCTAAA GATATTACTA 204 0 

TTAATGATAA ACCTGCGCGA TTAATGGATA AAGTGCATAC TGGCGACGAT GTATTTGTTC 2100 

CAACCATCGA TGAAAAAAGT AATTATGTTC CAAGTTATCG TTATGCACAA ATTAAATACG 2160 

AAGACGATGA TATGGCAATC GTAATGAAAC CTAAAGGTGT TAAGACTCAC CCTAATGATT 2 22 0 

TAAAAGAAAG CAATACTTTA AT G AAT CAT G TGATTTACAC TATTGATAGT GACTATGTCG 2280 

AACCAATTCA TCGACTGGAC CAGGAAACAG TAGGATTATT AATTGTTGCT AAAAATCCTT 2 34 0 

TAATGAAAAA AATTCTTGAT CGCATGTTAG AAGACAATGA TATTACGCGG ATATACAAAG 2 4 00 

CAAATGTTAA GGCACTTTTA CCTTTAAAAC CACAAACGAT TGATATGCCA ATTGGTAAAG 24 60 

ATAAATTCCA TTCGAATAAA CGACGTGTGT CTCCTACTGG ACAGCGTGCA ATTACACACA 2520 

TTTTAACTTC AAAAATGATA AAAGAAGCTG TGTGCCmACT TGAAATCAAG TTGGATACTG 2 58 0 

G AC G T AC T C A TCAAATwCGT GTGCATTTAG CTGAAATTGG TCACCTGTT ATTGGTGATC 2 64 0 

CTTTATATGG TGATTCAACG TTAAGACAAT TAGAACTTGA AAGT TACAAA ATAGAGTTTG 2700 

TGcATCCCTT GACTAAGGAA GTCATTTCCG TTTCTTTGGA TGACTAATTT GATTAGTTTT 27 60 

GCATGATATT tAAACATGCA ATACCGCATT GTAACTAAAT CAAGTATCAA CtTAAAcGGA 2 8 20 

TAGATGGAAA ATTATTAATT TTTTCAGATG TTCGGTTTTT TTGTTTTTTA CGATGCTTAG 28 80 

GATTTTATAT TTTGATATTT TAGTAATTAT TCATTTTATA ACATCCTTGG ATAATGACTT 2 94 0 

GTAGTCTTTT TCAACTGCGT TACGTGTATC TATGGACAAT AC AT G AC AT C ATAAGATTTT 3000 

TATCACAGGT TGTTTGGCCA ATACATGTAC AACAATTCAT CATATAAAAA ATAGGTTCTA 3060 

TAATAAAACG G AC T C CAT G A AAAGTTTTTC CTTTTCATGG CTCTATATCA AATCAGACTT 3120 

ATAAGTTCAA TTTGACATAG ACAAAAAATA AAGCTTGAAC ACTACATCTA CCTGTAATGC 318 0 

CCAAACTTTA TTTTATATTA ATATTTAATT AATGAGGATC TACCATATCT TEGGTTTAA 32 4 0 

TCCATGCTTC AAATTGTTCT TCTGTAACAT ATCCAGTTTG AATTGCAGAT TCTTTTAAAG 3300 

TTAAACCTTC TTTATGGGCT TTCTTAGCAA TTTGAGCTGC TTTTTCATAA CCAATATGTG 33 60 

GATTTAATGC AGTAACTAAC ATTAATGATT GATTTAAATA ATTATCAATA TTCTCTTCGA 3420 

TTGGTTCAAT GCCCACTGCA CAATTGTTAT TAAATGTTTC CATACCATCA GCTAAAAGAT 34 80 

AAATTGATTG TAGTGTATTA TGCATAATAA CTGGTTTATA AACATTCAAT TCAAAGTTAC 35 4 0 

CTTGTGAACT TGCGAACCAA CAACTGTATC ATTACCCATT ACTTGGACTG CnACCATTGT 3 600 

TAACATTTCA CATTGTGTAG GATTAACTTT ACC&GCATA ATTGATGAAC CTGGTTCATT 3 6 60 
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TTCAGGGATA GAAATTTCTG CCAAACCAGC TCGTGGCCCT GAAGCCAACC ATCTCACATC 37 20 

ATTAGCAATT TTCATTAAGT CTCCTGCTAA TGCCTTCAAT GTTCCATGCA ATTGAACAAC 37 8 0 

TTCATCATGC GCTGTAAGTG CGTGGAATTT ATTTTCAGAA GATACAAATG GATAACCCGT 38 4 0 

ATTTTCTGAA ATATAATGTG CCACTTTATC ACCAAATTCA GGATGCGCAT TAATACCAGT 3 9 00 

ACCAACAGCC GTACCACCGA TGGCAAGATT TAAAATGTGC TTCTTAGATT CAGATAACAT 39 60 

TGTTTCACAA CGGTCAAGCA TATAACGCCA GCCACTAATC TCTTGTCCTA GTTTGATCGG 4020 

CGTTGCATCT TGTAAATGTG TACGACCAAT TTTAATAATT GAATCAAATT TATCTTCTkT 4080 

TTCTTTCAAA GTATTTCTTA AAAGTTTTAA TGCAGGTTCT AATTTTGTTT CAACCTCTTG 4140 

ATATAATGCA ACGTGCATAG CAGTTGGGaA TGTATCaTTC GaACTTTGAG ATTyATTTAC 4 2 00 

ATCATCATTk GGGTGGATAC TTTCATCACT TTGATGATCT TTAAATACA TATTAGCAAC 4 2.60 

ATAACTTACT ACTTCGTTCA CATTCATATT ACTTTGTGTA CCGCTTCCTG TTTGCCATAC 4320 

AACTAGTGGG AAGTGTTCAT CTAATTCACC TGATAAAATT TGATCACATG CGTATACAAT 4 380 

GGCATCTTTC TTTGCCTCGC TTAATTTTCC TAAATCAAAA TTAGCTATTG CTGCTGCACG 4 401 

CTTTAGTTGT GCAAAACCAT AAACTACTTC GATTGGCATA CGCTCTTTAC CAACTGGGAA 4500 

ATTACGTTTA CTTCTTTCTG TTTGAGCACC CCAATATTTA TCTGCAGGTA CTTCTATTTC 4 5 60 

TCCAAAAGTA TCATGTTCAA TTCTTACTGA CATTCAATTT CTCCCCTTAT CACTGTTTAT 4 62 0 

TTAACTGTAG TATATCATTA AATAATTTAA TTGAGCAATT TATGATTAAA ACGTTTTCAT 4 680 

AATTTGAAAT AAAAATACAC TAATCGCACG TGTTCACCCT T TAT T AC AG T GATACGGTCA 4740 

TACGATTAGT GTGTTATCTA TCATTATTTA GTTATTATTG AACTAAGTTT AATTACGATA 4 8 00 

CTTTGTTTTA GTAGCTTCAA CCGtAGCAAT AGCTGTAAGT ATATATAAffi CAGCACTAAC 4 860 

AATTGTCGTA TATGGATTTA GAGCAACAAG CGTaCCTAAA ACTCCTGTTA AACTCGCATA 4 920 

AAACCCATCA AGTGTATAAT ACAATATTGC TGTTATTAAT AAACAACAAA T T AC AC T GAT 4 98 0 

TACAATACCT GAT T GAT T AC TTTTAATGAA TGTTTGCGCA TTAACATCAT CAATTAATCC 504 0 

TTTTGATAAA TTGAGTTGTA ATTTTATTAC TTTGAAAATA ACAGGTAAAT ATAATGCCCC 5100 

AATTGCCAAT GGAAAAGCTT TAATTGATAT TAAACTTATA ATAACTGTTG CTATCAATAA 5160 

TTGAATCCAG TATTTTCCTA ACATAAATAT ATAAATCTCC TCTAATTTCA TTCTTCAATA 5220 

GCATATCATA ATCTTGGCAT ATTAAGAAAC GCGGTTTAAT GATTTCATTA AAAAT AT T AC 5280 

T GAT AG AT G A CTTCTTTCAA TTATGTCTGG AGTAATTAAT TATCAATTCC GTTTAAATGG 5340 

TGTTTTAATA TTTAAAATTG AACTTTTGAT ATATTACTAT GTCTGGTACA CAAATCAATG 54 00 

TTTTATGCTT TACAAAGTTA TATTGGCAGT AGTTGACTGC AGTCCACAAC ATAGAGCTT 5 4 60 
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CGGAATGTCA GCTTCTATTT CATGCAAGTT GGTGGAGCTC CAACATAGTG GAATTGGATT 5520 

CCCAATTTCT AC AG AC AT T G CAAATTGGGG AAACGGGCCA CAAACTCAGA AACTGGTGGA 55 8 0 

AAGTCAGCTT AAAATAACAT GCAAGTTGGC GAGGCCCCAA AATAGTGAGA TCGGATTTCT 564 0 

AATTTCTACA GACATTGCAA ATTAGGGAAA CGGGCCACAA ACTCAGAAGT TGGTGGAAAG 57 00 

TCAGCTTAAA AT AAC AT G C A AGTTGGCGGG GCCACAACAT AGAAAAATTG GATCCTCAAT 57 60 

TTCTACAAAC AATGTAAGTT GGGGAAACAG CCCCAACACT GAAACTAGCA GAAAGTCAGC 5820 

TTCTATGAAT ATAATAAAAA AGCTAGGTAA CAAAATGCTA CCTAACTTCA TATTCAAGAT 58 8 0 

AATCAATCCT ATTTGATATA TGTTCTATAC TATACATTAT TTACATGATA AATAACTGAA 594 0 

TATTACACAA TTATAATACT TTACTGACTG TCTTCTTCAG AATTCTTTTC TTGATCATTT 6000 

T GAT C AG AG A TTTGTTCCaT TTCTTTACCT AATTCTTTTA AATCTTCAAA ATCCGTTACC 6060 

ATACTGTTTT CTTCTTCATG ATAATTTAAT TTTGGATCTT TGTCTTTAGA CATAATCAAT 6120 

ACCTCACAGA TTTTTAATTA AGCAAAGCGT GACGTAAAGT AAGCTTTAAC ATCTTCAGGT 6180 

AAACCTGCAG CCGCTTCTTT ATCAAGAATA ACATTTACCA TTCTATGTGC TTTTAAATCG 62 4 0 

GCTGGTTCGA AGCTTGTTTT ACCATTTTCT TGATATAATT T T T C AAC T AC ATCTCGTTTA 6300 

TTAGCACCTG TCACTACTAA GAAAATTTCT CTTGCTTCCA TTAGTCCTTG ACGAATACTA 63 60 

ACATTTAACT TACCTTGCTC AT C GAT AG AA ACAACTTGTA ATGTTAATTT CCCTTTATTT 64 20 

TCTTTAGTTT TAATCTTATC AGCGATTAAT TCGATTGCAT CTTTTCATA AGCAATTGGA 64 8 0 

TAAACTTGAC CTGCTGGTAC ACCTAACGCT TCGAAATATG ATTTTTTATC GTCATAATCT 65 4 0 

AAAATATTTA TTTGGCTAAA ATCAACAGCA TGTTTTTCAA CATTTTTCTT TAATTCATCT 6600 

AGAACTGGCG CTTGATCTGT ATCTAAATGA AAACCTGCAA TTGTAGTAGG ATTATTGTTA 6660 

AATTGCTTTC TAATAATATC AGCAGCATAT TCTGCTACAA GTTGACTATT GTCAAAGACT 67 2 0 

TTAAAGTTCA TTGCCATGAT TTCTACACTC CTCTTAGTTT GCATTTTACA TAAAAACGGC 67 80 

GATATTAATA ATTATTATAC CCTAACTTTC AATATATCAA ACCATTTAAC TTTAACATGC 68 4 0 

TTATACTCTA AATATAGCAC TTAAGCSTCA TTTTTATAAT GAAAATGAGT AAATTTTAAT 6900 

TCAATCCCGG rAAATCTTGT TGACGTAACG CTTCATAAAT T AAC AAC GC A GCAGTATTTG 69 60 

ATAAATTTAA TGAACGAATA TGTTCACTCA TAGGAATTCT TAACGCTGTG TCTTGATATT 7020 

TCTCTTTCAC CCAGTCTGGT AATCCTGTCG TTTCTTTTCC AAAAATGAAG TAAAATCTT 7080 

TGTCATGATT TGAAAAATCA AAATCACTAT AAGTCTTTTT ACCAAATTTT GTTAATAAGT 7140 

AATACTCGCC ATTTGTGACT TCAAAAAATG CTTCAATACT AT CAT GAT AC GTAATATTCA 72 00 

CAAATTCCCA ATAATCTAAA CCGGCTCTTT TTAACATTTT AT CAT C AG T T CTAAATCCAA 7 2 60 

GAGGTTTAAT TAAATGTAAA TGTGTGTTTG TACCTGCACA CGTACGgCAA TGTTACCAGT 7 320 
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ATTAGCTGGG ATTTCTGGTT GATATAAAAC GATATGATTT GTCATAT TAC TATTCTCTCC 7380 

TTGTGTCTAA TCCTTTTATC ATTTCATTCT GAACTTCTGC ATCCTCTTGA T C AT AAT TAG 74 4 0 

CATTGATAAA ATCTCTTGCT TCTTCCCCAA GAATTGACC AATGGCCCAA TAAGCAGTTG 7 500 

CTCGAATCAA CGGTCTTTCA TCTGTTGTTG CAACTTTTTT CAATTCTGGA ATTGCATCCA 7 5 60 

CTTCATTAAA ATGCGCCAAT GCTAAAATAG CATTTCGTTG TATCGGCTTT TTACCACGCC 7 62 0 

AAGCACCTGC AAGgTGACCA TATGTTTGTT TGAATTCTTT AT TAG AC AT A CGTAGtAAAG 7 680 

GTACTAATCT TGGCTTTAAA ATTTCTGGTT CCAAAATGAT GTCATCTTGT TCGGTATTAA 774 0 

TACCTCTATT TTTCGGACAA ACTTGTTGAC ACGTATCGCA ACCATATAAT CTATTCCCAA 7800 

TTTTATAACG ATATTGGTCA GGCATATAGC CTTTTGTTTG CGTTAAAAAA CTAATGCATT 7 8 60 

TCTGACTATT TAATTG3CCA TTTCCAACTA ATGCACTTGT TGGACAACGA T C AAC AC AAA 7 92 0 

TTGTACAATG CACCACAGCT ATCTAATAAT GGATCATCAG GTTCC 7 9 65 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1302 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

GCCCTGTTGG AGAAATCACC TTTATACGAT GGTTGAAAAA GCATCATTAG G TAC AAT C G T 60 

GGCAATTATA GTTATTTTGC TGTTTTTAAG AAACAT'EGT AC G AC G G C AA TTTCTATTAT 120 

ATCGATTCCG TTATCACTTC TTATGGCGCT TATTGCTCTG AAATTGAGTG ATGTTTCATT 180 

G AAT AT ACT A ACGTTAGGTG CAT T AAC AGT AGCGATTGGA CGTGTGATAG ACGATTCGAT 24 0 

TGTAGTTGTT GAAAAT AT T T ATCGACGCTT AAC AG AT T C A GAAGAACAAC TAAAAGGTGA 300 

AAATTTAATT ATCAGTGCGA CAACTGAAGT ATTTAAACCA ATAATGTCAT C G AC AC TAG T 3 60 

TACTATTATC GTCTTCTTAC CACTTGTGTT TGTATCAGGT TCAGTAGGCG AAATGTTTAG 420 

ACCTTTTGCA TTGGCTATTG CATTTAGTTT ATTAGCATCG TTATTAGTGT C AAT TAC ACT 4 80 

CGTTCCAGCG TTGGCAGCTA CACTATTTAA AAAAGGCGTT AAACGTCGTA ATAAACAACA 54 0 

TCAAGAAGGA TTAGGTGTTG TTAGTACAAC TTATAAAAAA GT ATT AC ATT GGTCATTAAA 600 

TCATAAGTGG ATTGTAATTA TATTAAGTAC ATTAATTTTG GTTGCAACTA TTGTATTTGG 660 

AGGACCGAGA CTAGGCACTA GCTTTATTTC AGCAGGTGAC GATAATTTT TAGCTATTAC 720 

TTATACACCG AAGCCTGGTG AAACGGAGCA AGCAGTGTTG AATCATGCGA AAGATGTTGA 780 
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AAAATATTTA AAACAGAAAA AGCATGTAAA AACAATTCAA TACTCAGTTG GCGGTAGTAG 840 

TCCAGTAGAT CCAACGGGTA GTACAAATAG TATGGCAATC ATGGTTGAAT ATGATAATGA 900 

CACGCCTAAT TTTGATGTAG AAGCGGATAA GGTTATTAAA CATGCAGATG GCTTTAAACA 9 60 

TCCTGGAGAG TGGAAAAATC AAGATTTAGG AACAGGTGCA GGTAATAAAT CTGTAGAGGT 102 0 

TACTGTAAAA GGTCCATCAA TGGATGCCAT AAAATCAACT GTAAAAGATA TTGAACAGAA 108 0 

AATGAAACAG GTTAAAGGAC TAGCCiATGT CAAATCTGAT TTATCGCAAA CAT AT GAT C A 114 0 

GTATGAAATT AAAGTCGATC AAAATAAAGC GGCAGAAAAT GGTATTTCTG CAAGTCAACT 12 00 

TGCAATGCAC TTGAATGAAA ACTTACCAGA AAAAACAGTT ACGACTGTTA AAGAAAATGG 12 60 

TAAAACTGTT GATGTTAAAG TCAAACAAAA TAAGCAAACA GC 1302 
(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

ATATATATTA ATTTAAACGT GTTTCACATG TACCAGTGTT AATGACAGAT AATGCTGCGT 60 

TTAAACCACC TTCAACAAGG TTTTGTACTG CTTCATCAGA GAAGAAAGCA ATATGTGGTG 120 

TTACTAAAAT TCTTTCATGT TCGATTAACT CTAATAATGT TTTATCGTCA ATGTCTTTAT 180 

TAGTCCAGTC ATTTGTGAAG TATGCTGCTT CATTTTCATA AGTATCAATC GCAGCACCTA 24 0 

ACAAAGTACC ATCGTTCACT GCAGCGATTA AATCAGGTGT ATTGATGACT GCACCACGTG 300 

CTGCGTTAAC TAAGATTGCA CCTTTTTTCA CATGATCAAA CATTGCTTTA TCGAATAAAT 3 60 

GATAGCTTTC TTTGTTCGCT GGAACATG'R AAGAAATAAT ATCGGCATCT TTAATAGCTT 420 

CTTTAACACT ATCTTTATAA GTTAAAAAGT CTAAATCTTT ATTAGGATAG GCGTCATAAG 4 80 

CTGTAATTGT AGCACCAAAT CCTGCATATA TtTTAGCTGT AGCAGCACCG ATACGACCCG 540 

TACCAATAAT TGCAACAGTC ATATTTTTAA CTGGTTTAGA CATGATTTCT GCTGCCAAG 600 

TAAAATCATG TGCTTGTACA CGGCGTTCAA TATCTGGGAA GCGACGCACT AATTGTAGGG 660 

CGATAGArAC AGAATACTCT GCAATtGTTT CAGGTGAATA ACTAGGAACG TTAGATATCA 7 20 

CAATATTGTG TTTTTTAGCT AAATCTAAAT CATACATATC AAATCCAGCA GTACGTTGTG 7 80 

CAATTTGTTT AATACCGTAA GATTCTAATT TAGGATAAAC GTCATTTTCT AACTTACCAA 84 0 

ATTGCATTGT AGTTACGCCA TCGTAATCTT TTAATTGAtC GACTGTAGCA CTTGATAATA 900 

GCTCTTTAGA AGTAGTTACT TCGACATTAT TCTtTtTCCC CCAATTTAAT GCCATCTCTT 9 60 
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TCTCATAATC ACGCGTaCCA AAGAACATAA TTCTCCTCAT TAT 1003 
(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

GGCTTTTATA GTGTTTACGA TTATTATGAT TACTTATATT CACGCTGTTA AAAATTATCC 60 

AAATAACCGT ACTGTTCATT ATGGTTATAC AGCTGCATTT ATACTTGTTA TATTACAAGT 12 0 

TATCACAGGT GCATTATCTA T T AT G AC AAA TGTTAACCTG ATAATCGCAC TTTTCCEGC 18 0 

ATTATTTATC ACTTATTTAT TTGGTATGAC AACATACTTT ATCATGCTAA TGTTACGATC 24 0 

AGTAAGAAGT GACArGCAAT AACaAAAAAG cmAACCGTAA TTTTAATGGC ACGCCCATTA 300 

AAATTACGGT nTTTTATATC AATATT TAAA AtTAAACCtA AG C CAT G T AA AAACGAGATT 3 60 

ACACGTCAAT TGTTGTGTAA TCTCGTTTTA T n T T AAT CAT TTTAGTCAGT TGCTTTTTCA 4 20 

ATTTCGATTA ATAAATCGCC TGTCGCTATT GTGTCACCAT TATTTACAGT TACTTGTTTA 4 80 

ATCACACCGT CAAATGGTGC TTGAATTGTT GTTTCCATTT TCATAGCTTC AGTAATTAGC 54 0 

AACGGCTGAT TAGCTTTCAC AGTTTCACCT AC AC TAACCT TGACTTCAGT TACTGAACCT 600 

GGCATTTGAG CACCGATATG ACTTGGATTA CTCTTATCTG CTTTTGGCTT AACGTTCGCA 660 

TTTGTATGCA CATTTTCATC TTTAATGTAA ATACGTCTCG CTTGACCATT CATCGCATAG 720 

TAAATCGTTC TATTACCATT TTCATCTGGT TCACTAATCG TTTCTAGTTT AATAATTAAT 780 

CGTTTACCTT TATCGATTTC GATTTCTACT GTTTCACCAT TACGCATTCC AAAGAAGAAT 84 0 

GTAGGCGTAT CAAGTAACGA TAAGTTTCCG TATTGATTTC TAGTTTGAAT ATATTGTTCA 900 

TATACTTTTG GATATAGTAC ATAACTAATA ATATCTTGCT CCGTAACAGG ACCTTGTTGC 960 

TCTTCTTCAA GCAACTCACG GACTTTTTCA AAATCAACTG GCTCTAGATA TTCACCTGGA 1020 

CGAGCTGTTA GTGCTTCTTG GCCTTTTAAA ATAACCGCTT GTAAATCTTT ATTAAAACCA 108 0 

TTTACAGGTT GTCCTATTTC ACCTTTGAAG AACGACACTA CTGATTCTGG GAAATCTAAT 114 0 

TTATAGCCAT CTGTAATCAC GGATTGTTCA TCAAGATCAT TTTGTECAT ATAAAGTGCC 12 00 

AT AT C AC C AA C T AC T T TAG A CGATGGTGTT ACTTTTACGA TATCACCAAA TAAGAAATTC 12 60 

ACTCTGCGAT AC AT AT C T T T GACTTCATCA AATCTTTCGC CTAAACCTAA ACTTTTAGCT 1320 

TGTTGACTTA AATTCGAATA CTGTCCACCA GGCATTTCAT GTTGATAAAT TTCAGTATTC 138 0 
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GGTGATTTGA TATCACTTTc AAAGTCTGAA TAATAAGTAC GTACAGTTGA CCAATAATGA 14 40 

CTAAGTGACT CCATACCTTC AATATCAGTT CTAAGGTGGC GTGGGAAGCC ATTTAATGCA 1500 

TAATATAACG AATTGGCGCT TGGCTGACTT GTTAAACCAC TCATTGAAGC AACAGCAGTA 15 60 

TCAATGATAT CGACACCAGC ATCTATTCET TGTTTGTATG TTAATAAACC ATTACCACTT 162 0 

GTATCATGAG TGTGAAGATG AATTGGTAAA TCTACAGCTG ATTTTAACTC ACCAATCAAT 1680 

TCGTAAGCGG CTTTAGGTTT TAATAAGCCT GCCATATCTT TAATCGCTAA AATATGGAAA 17 4 0 

CCTTCACGTT CTAACTCTTT AGCTAGTTTG ACATAATACT CTAAAGTATA AACFTTTGAT 1800 

CGCTCAGGAT TTAAAATGTC ACCTGTATAA CAAATAGTAC CTTCTGAGAT TTTGCCCGCT 18 60 

TCTTGTACTG CTTCATTGGC AACTTTCATT TGATCTACCC AGTTTAATGA ATCGAAAATT 192 0 

CTAAAGACAT CTATGCCTGC TTTAGCACTT TCTTGTACGA ATTTATGAAT AAC AT TAT C A 198 0 

GGATAGTTTT TATAACCAAC TGCGTTTGAA GC AC G T AAC A ACATTTGGAA TAATACATTT 2 04 0 

GGAATAGCTT TACGTAGACG TTCAAGTCGT TCCCATGGGT TTTCCTTCAA GAAATTATAT 2100 

GCCACATCAA ATGTAGCACC GCCCCACATT TCTAGTGAGA AACCATCTTT AAATACGTCC 2160 

GCTGTTTTGG ATGCGATATT AATCATATCC TTAGTCTAA CTCGTGTAGC TAATAATGAT 2220 

TGGTGTGCAT CTCTAAAGGT TGTATCTGTT AGTAAGACAT CATCCTGCTT TTTAACCCAT 22 8 0 

TCAGCTACAC CTTTTGGACC TACTTCATCA AGCAATTGTT TCGTACCACT AAATGAAGCG 2 34 0 

ATTTTACTTG AAGAT AC AG T TGGAATTGAT GCTAATTCAT AGTCTGGTTT CGGACGTTTC 2400 

TCAACATTTG GGAAACCATT AATTGTTACA TTACCTATAT ATTCTAATGT TTTAGTACCT 2 4 60 

CTATCTAGAG ACGGCTGAAT GTCGAAAAGT TCTGGTGTTT CTTCAATAAA TTTAGTTGTG 2520 

TAATCACCAC TTGTGAACTT TTTATTCTTC ATTACATTAA TTAAGAATGG AATATTAGTT 2580 

TTAACACCAC GAATACGCAT TTCTCGTAAT GAGCGTACCA TTTTTTCTTC TGCTTGTTTA 2 64 0 

AATGATATCG CGTGTGTAGA TAATTTTACG AGTAATGAAT CATAATAAGG TGATATCTCA 2 7 00 

GCACCTTGGA AACCATCTCC AGCATCAAGA CGTACACCAA AGCCCCCGCT TGAACGATAA 2 7 60 

GCAATGATTG TTCCAGTATC CGGCATGAAA TCATTTAACG GZECTTCTGT TGTAATACGA 2820 

CATTGGATGG CATAGCCTAA TGTTGTAATA TCTTTTTGTT GCGGCATATT AATCTCTTCA 2880 

CCAAATAAAT CGGCACCTGC T GC AAC T AAA ATTTGTGTCT TAACAATATC AATTCCTGTT 2 94 0 

ACCATCTCTG TAATTGTATG CTCTACTTGT ACACGAGGGT TAACTTCTAT AAAGAAGAAT 3 000 

TCGTCACCAG ATACTAGAAA TTCAACAGTA CCTGCATTGA CAT AT T T AAT ATTTTCCATC 3 0 60 

AATTGAATTG CAGCATCACA AATACGTTGA CGTAATGTTG ATGATAATCC AACTGATGGT 312 0 

GCAACTTCTA CAACTTTTTG ATGACGACGT TGTACTGAAC AATCACGTTC AAATAAGTGT 3180 

ACGATATTTC CATGTTCGTC ACCTATGACT TGTACTTCAA TATGCTTTGG ATTATCAATG 32 4 0 
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TATCTTTCTA TGTAAACTTC ACTATTACCA AATGATTTTT CAGCTTCTGA TTTTGCTCTA 3300 

TGGAAAGCAT CTTCTAATTC ACTTTCTTCA CGAACGATTC TCATACCTTT ACCGCCGCCA 33 60 

CCACTTGTGG CTTTAATCAT TAGCGGGAAA CCAGCTTCTT CTGCAAATTCTTTTGCTAAT 34 20 

TCATATGATT TAATTGGACC GTCTGTACCA GGAATAACTG GTAAATCTGC CTTGATAGCC 34 8 0 

GTTGTACGAG CTTTAACTTT ATCTCCAAAC ATATCTAAAT GTTCTAAATG AGGACCAATA 35 4 0 

AATTTAATTC CTTCTTCTGC ACAACGACGC GCAAATTGTT CATTTTCACT TAAAAATCCA 3600 

TAGCCAGGAT GAATCGCATC CACATTCGCT TGTTTTGCTA CATCAATGAT ACGCTCAATA 3660 

TTTAAATAAC TTTCAGCAGG ACCTAAATCA CTTCCAACTA AATAGGATTC ATCTGCTTTA 3720 

TATCTATGTA ATGAACTTTT GTCTTCATTC GAATAAATTG CAACTGTGCT GATGTCTAAT 37 8 0 

TCTGCCGCCG CTCTGAATAT ACGAATTGCA ATTCTCCAC GGTTAGCAAC AAGTAACTTT 38 4 0 

TTTATTTGTT TCAATAGCGA TACACTCCTC AAAC TAT TAG AATTTTCTAA CTAATTAGAT 3 90 0 

AATAAAATTT TATCTTAAAG CGCTCTGTTT TGCTATAGTt mTGTTTCmAA TTTTCAAAaT 3 960 

TTaACATyCT tGAGACAATT AAAaCCyCCG CTTCmGaAAT AATAATTTCA AAAATG&TA 4 020 

TGCAACAACA GGTAGTTCCA CGTTTTTGTT GTGAAACATT TTCGATTTCT ACAACTCTAA 4 08 0 

AAAATTAAAA ATAAAATTGC AAAACATCAA CATTTATTAT CAATAGCGAT AACTTTATCT 414 0 

TATCATCATG ATTCTAATTT CGCCACCACA TTTAGTAATT TTTAGTCATA AAATTTAGTT 4 200 

AT AAT TAT AC GTTGTTTTGT TTATAAAATT TGATAATCak GAGTAATCtC GTAATATCAA 4 2 60 

AACaAAAAGG AAGTTAAGCG TTGTTTGGTT GcCTAACTTC CGTTATTGAA CTCATCcAGT 4 32 0 

CCAATTTGAC AAAG AG TATA TAAATAGCCA ATGTTTAGCA TTGCTTAATG ACAGAAGTCA 4 38 0 

TCTCGTACTA AATATTGGCT AGTATTTTTT TAATTAAAT GTCTTCTTAT ATCAACTTTT 44 40 

TGTTGTTGTT TCTTTCGTTG CTGGTCTACT TTGATTTGTT TACCTACAAT CAGAAGTAAA 4 50 0 

CCCATAGCAA T AC T T AAAC T AATCATTGAT GATCCACCAA AGCTGATAAA TGGCAATGGC 4 560 

ACACCAGTTA ATGGAATTGT TGCCGAAATA CCGCCAATGT T T AC AAAC G T TTGACTTCCA 4 620 

AAGTATGTGG CAATCCCAAC AC AC AC AAG T T T AT AAAAAT ATGATGATGT TTTATTTGCA 4 680 

AACTGGAAGG C AC GAT AT AC AATAAAGAAC TCTAAAGTAA TAACTAGCAA TCCTCCGATT 474 0 

AAAC C T AAT T CTTCGCAAAT AATTGCAAAA ATAAAATCTG TATGTGGTTC TGGTAAATAG 4 80 0 

CCCAATTTCA TTGCACTATT TCCTAATCCT TTTCCAAATA CGCCACCGTT ACCTATCGCA 4 8 60 

AG C AAT G AAT TGGAAATATG GTATCCAGTT CCTGATTCGA ATTGGAATGG ATCTGTTAGC 4 920 

GTACTAAATC TGGCAGTTAA ATAACTTGGT AACCAACCAG CCATTAATGC AATGACAAAT 4 980 

ACTACTAAGA ATCCTAGCAC TGCTGGTATA CCAAATCTTA GGACTTGTT 5030 
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(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

CTTGTTAATC CGAAAATTAT TAGTCAATCA AATGAAACAA TAACAGACTT nGAAGGTTCA 60 

ATTACATTGC CAGATGTTTA CGGCGAAGTG AC AAGAAG T A AAATGATAGT TGTCGAAAGT 120 

TATGACGTCA ATGGGAACAA AGTTGaACTA ACTGCACATG aAGATGTAGC AAGAATGATT 1£8 

TTGCATATTA TAG AT C AAAT GAACGGTaTC CCTTTTACAG AACGTGCGGA CCGTATTTTA 240 

ACAGATAAAG AAGTGGAGGC AT AT T T T AT A AAT G AC T AAA ATAATATTTA TGGGTACACC 3 00 

AGACTTTTCA ACAACTGTTT TAGAAATGCT TATTGCAGAA CATGATGTCA TTGCAGTCGT 3 60 

AACGCAACCA GATCGACCTG TTG5ACGTAA ACGTGTTATG ACACCACCAC CAGTTAAAAA 4 20 

AGTTGCAATG AAATATGATT TACCTGTATA TCAACCTGAA AAATTAAGTG GATCAGAAGA 4 80 

ATTAGAACAA TTGCTTCAAT TAG AT G TAG A TTTAATTGTA ACTGCTGCTT TTGGACAATT 54 0 

ATTACCTGAA TCATTGTTGG CAT T AC C AAA TCTTGGGGCA ATTAATGTB ATGCATCATT 60 0 

GTTACCGAAG TATAGAGGTG GTGCACCAAT TCATCAGGCA ATTATCGATG GTGAACAAGA 6 60 

AACCGGCATA AC AAT TAT GT ATATGGTTAA AAAATTAGAT GCGGGTAATA TTATTTCGCA 7 20 

ACAAGCAATT AAAATAGAAG AAAATGATAA TGTCGGTACG ATGCATGATA AATTAAGTGT 7 80 

ATTAGGGGCA GATTTATTAA AAGAAACTTT ACCATCTATT ATAGAGGGCA CAAATGAAAG 8 40 

CGTACCTCAA GAT GAT AC GC AAGCAAC AT T TGCTTCCAAT ATTCGACGCG AAGATGAGCG 900 

AATTAGCTGG AAT AAAC C AG GAAGACAAGT GTTTAATCAA ATTCGTGGAT TAT C AC CAT G 9 60 

GCCAGTTGCT TATACAACTA TGGATGACAC TAACTTGAAA AT AT AC GAT G CTGAACTCGT 1020 

T GAG AC T AAT AAGATAAACG AGCCTGGAAC CAT T AT AGAA ACGACTAAAA AAGCCATTAT 10 8 0 

TGTTGCTACA AATGATAATG AAGCTGTTGC AATTAAAGAT AT GC AAT TAG CTGGGAAAAA 114 0 

GAG AATGT T A GCTGCCAATT ATTTAAGTGG TGCGCAAAAC ACACTAGTAG GGAAG&ACT 12 00 

TATATGATAG AAAACGTGAG AAGTCTTGCT TTTGACACGA TTCAAGATAT AT T AAAT G AA 12 60 

GGTGCGTATA GTAACTTGCG TAT C AAT G AA GTGTTGTCAG AAAATGAATT AAATGCAATG 1320 

GATAAGGCTT TATTTACAGA AATTGTCTAC GGAACCGTTA AAAGAAAATA TACGTTAGAT 138 0 

TTTTATTTA 1389 
(2) INFORMATION FOR SEQ ID NO: 32 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2746 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGCTAATA ACAATAAAGC CAAAGCCGAT TCACACTCTA AACAGCTAGA AATTAATGTT 60 

AAGAGTGACA AAGTACCTCA AAAAGTAAAA GATCTAGCAC AACAACAATT TGCTGGTTAT 120 

GCTAAAGCAT TAGATAAACA AAGTAATGCA AA&CTGGTA AATATGAACT TGGCGAaCTT 180 

TTaAAATTTA TAAATTTAAT GGTGAAGAAG AT AATAGT T A CTATTATCCA GTTATAAAAG 24 0 

ACGGTAAAAT TGTTTATACT TTAACACTTA GTCCTAAAAA T AAAG AT GAT TTAAACAAAT 300 

CAAAAGAAGA CAT GAAT T AC AGTGTTAAAA TTTCAAACTT CATCGCTAAA GAT T TAG AC 3 60 

AAATTAAAGA TAAAnATTCA AATATCACTG TTCTTACTGA TGrAAAAGGG kTTTATTTTG 4 20 

aAGAAGATGG cmAAGTTAGA T TAG T AAAAG CTACGCCTCT ACCTGGTAAT GTAAAAGAnA 4 80 

AAGAAAG T GC TAAAACAGTT TCAGCAAAAT TGAaACAAGA GTTaAAAAAT ACAGTAACAC 54 0 

CTACTAAAGT TGAAGAAAAC GAaGCGATrC AAGAAGATCA AGTTCAATAT GAAAATACAT 600 

TAAAAAACTT CAAAATTwGA GaACAACAAT TCGATAACTC ATGGTGTGCA GGATTCAGTA 660 

TGGCAGCATT ATTAAATGCa ACTAAAAATa CAGACACTTA TAATGCACAT GATATTATGC 720 

GTACATTATA CCCTGAAGTA AGTGAGCAAG ACCTTCCTAATTGTGCAACA TTCCCTAATC 780 

AAATGATTGA ATACGGTAAA TCACAAGGCA GAGATATTCA TTATCAAGAA GGCGTACCAT 84 0 

CATATGAACA AGTTGATCAA C T T AC AAAAG ATAATGTAGG AATTATGATC CTTGCACmAA 90 0 

GTGTATCTCA AAACCCTAAT GACCCACATT TAGGACATGC GCTAGCAGTT GTTGGTAATG 630 

CTAAAATTAA TGACCAAGAA AAACTTATTT ACTGGAATCC TTGGGATACA GAaTTATCAA 102 0 

TCCAAGATGC AGATTCAAGC CTATTACATT TATCATTCAA TCGTGATTAT AACTGGTATG 1080 

GTTCAATGAT AGGTTACkAA AAAGTAATAT AGATATTGAT TAAAGGCAGG TAAAACTATG 114 0 

TATCAACTAC AAT TTATAAA TTTAGTTTAC GACACAACCA AACTCACACA TCTAGAACAA 12 00 

ACCAATATCA ATTTATTCAT TGGTAATTGG AGTAATCATC AATTACAAAA ATCAATTTGT 12 60 

ATACGTCATG GCGATGATAC AAGTCACAAT CAATATCATA TTCTTTTTAT AGATACGGCA 132 0 

CATCAACGCA TTAAATTTTC AT C TAT T GAT AAT G AAGAAA TCATTTAffiT TCTTGATTAT 138 0 

GATGATACAC AGCATATCCT CATGCAAACG TCATCCAAAC AAGGTATTGG CACTTCGCGA 14 4 0 

CCAATCGTTT ATGAGCGCTT AGTATAACTA ATTTAAATGA TTTCACTTCA TAAAGCGGGT 1500 
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TGGCGAGAAT TCAATTTCTC ACCAGCTCGT TTTTTCATTG TAATAATAAT CTTTAACATT 1560 

TATTCTTTCT CTATTAATTT TTCTCAAACT ATCTTATCTT TATGATAATT AATTAAAATG 1620 

CCCTTTTAAA TTCTTATAAA ATAAAAAAGC CACCTATCGT CGCTAATAAA CGACGCAAGT 168 0 

GACTTAATAT CATATTCAAA ATAACTTATG GGAATTTAGG GAATTGATCG AAGTCAGGAT 174 0 

CACGTTTTTC TTTAAACGCA TCACGGCCTT CTTTCGCTTC ATCAGTTGTG TAATAAAGCA 1800 

ATGTTGCATC CCCAGCCATT TGTTGTAAAC CAGCTAAACC ATCTGTGTCA GCATTCATAG 18 60 

CTGCTTTAAG GAATCGTAAC GCTGTTGGTG AGTGTTTCAT AATCTCTTTA CACCATTGCA 1920 

CAGTTTCATC TTCAACTTTC TCTAAAGGTA CCACTGTATT TACTAGACCC ATATHAAAG 1980 

CTTCTTGTGC ATTGTATTGA CGACATAAGT ACCAAATTTC ACGTGCTTTC TTATGTCCAA 2 04 0 

CGATACGTGC TAAATATCCT GAACCATAAC CCGCATCAAA TGAACCTACT TTAGGACCAG 2100 

TTTGTCCAAA AATAGCATTA TCAGCAGCAA TCGTTAAGTC ACAAACAACA TTTAGTACAT 2160 

TACCGCCACC TACAGCATAA CCTTTTACCA TCGCGATAAC CGGTTTTGGA ATAATACGAA 222 0 

TTAAACGCTG TAAATCTAAT ACATTTAAGC GAGGGATTTG GTCTTCACCT ACATAACCAC 228 0 

CATGTCCACG TTTCTTCTGG TCACCACCAG AACAGAATGC TAAATCACCT TCACCAGTTA 2 34 0 

ATACGATAAC TGAAACGTTT TGATCATCAC GTGCACGTGA AAATGCGTCA AT CAT T T C AG 24 00 

CAACTGTTTT AGGTGTAAAC GCATTGCGTA CTTCAGGGCG ATTTATTGTT ACCTTAGCAA 2 4 60 

TCCCTTCGTA AAAT T CAT AT TTGATTTCAT CATATTCTCT AAGTGTTTCC CATTGTCTGT 2 52 0 

TAGTCATTTT GCTCCTCCTT TAAAAATCCT AATATCATTG TATCAAATTC ATCACTATCy 2580 

TCCACATGAA TTGTATGACC TGTAGCAGAA ATTAATTTAC ATTTACTATT AG GAAT T AAA 2 64 0 

TTTGCCATTT TTTTCGCAAT CTGTACAAAT TTTTCATCAT ATTCTCCAGC TAATATTAAT 2 7 00 

GTTGGTACTT TAATTTCTnT CAGCGCGGCC ATAAGTTTGG CATTTG 27 4 6 
(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

TAATGTTTAG TTTATTAACA GTAAGTTCGT ATATCAATGT TTAGTGCTCC CCAAAATTGA 60 

AGTTTGAATT T T AAAAGC AT CTTGTAGAAT TTAGTTGTAT TTTTTTCAAA G AAAT TC ATT 12 0 

TTGATTATTT TTGATAATGA GCATTTTAAT AGTAATACAT GTTTATAGTG TGTAGTATAT 180 

GTCTATACTA GTAGTAACTA TATAGAGAAA GTAGGAATAAACTATGTCAC AAG AT G T AAA 24 0 
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TGAATTAAGT AAGCAACCAA CGCCAGATAA AGCAGAAGAT AACGCATTTT TCCCATCACC 300 

ATATTCCCTT AGTCAATATA CAGCACCTAA AACAGATTTT GATGGTGTTG AACACAAAGG 360 

TGCCTATAAA GATGGTAAAT GGAAAGTATT GATGATTGCT GCTGAAGAGC GATATGTATT 20 

ATTGGAAAAT GGAAAAATGT TCTCTACGGG TAATCATCCT GTTGAAATGT TATTACCTTT 480 

ACATCATTTA ATGGAAGCAG GTTTTGACGT TGATGTTGCG ACATTATCTG GTTATCCAGT 54 0 

TAAATTAGAA TTATGGGCTA TGCCAACTGA AGACGAGGCA GTTATAAGTA CTTATAATAA 600 

ATTGAAAGAA AAATTAAAAC AG2CAAAAAA ATTAGCAGAT GTGATTAAAA ATGAATTAGG 6 60 

ACCTGATTCA GACTATTTAT CTGTCTTTAT CCCAGGCGGA CATGCTGCAG TTGTTGGTAT 720 

TTCTGAAAGT GAGGACGTTC AACAAACATT AGATTGGGCA TTAGACAATG ACCGCTTTAT 78 0 

AGTTACATTA TGTCATGGAC CAGCAGCACT ACTTTCAGCA GGGCTTAfiA GAG AAAAAT C 84 0 

TCCATTAGAA GGATACTCTG TTTGTGTCTt CCCTGaCTCA TTAGATGAAG GTGCAAATAT 900 
(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 

ATCTGAnTnG AGATAGTGAT AATGTGTCAC CcAATTTTaA AcCtTTATTT TTaAGcGtTT 60 

CaTCaGCTAA CaCtTCaTTa TCtTTAGtCG cTTTATGcCC tTCtATTAAA CtCGGaACtA 120 

AAAATGaTGa CtTTTCAACa CCGAACaCTA AAACaTTGtC tTTTTGATGG CCaTTAGACA 180 

CAATTTCCCC TGTTTGCTTC AAAGTAGCTT GCTTCTTGTA TTTATTTTCA ATATCTTTCT 240 

TGTTAAAAAC AGATTGTTGC ACAGTTTGAT TGGCATCTTT ATTTAGAACA ATGGCATCTG 30 0 

CTTGCCACTT ATCAATGCCT TCTTTATTCA TATTGATAAG ACCATTCGCC AATCCAGATA 3 60 

ATAAAAATAG CAAGTAACTA ATCATCGTTA ACACACCAAT AATTAGTCCA AACTTCAATT 420 

TGTTGCGCCG TATTTCATTC CAAGCTAAAA ACATGCATTT CTCTCCCTAC TACTATGATT 480 

TAAACATTGT TTATATTCTT AGATGCACGT ACGTCGTGTT GCGCTCTGTAATGTTATACA 54 0 

TACACTTATC CTTCATTATA CCCGAACTTT T TAT AT T AAA AGCAAATTTA TGGAAAATGC 600 

AAT TAATTGT CTATTATTTT TGTACGGTAC ATTTAAAATT AAGGATCAAT TTAAAAACGC 660 

CTACATATAC CTTTAAGTAC ATGTAGACGT CCAATTCATA TAT TAT T T AA CTTCGCCTGT 72 0 

TTTAGGATCG AATTGCTTAA TAGCATTTTT ACGTAATTTA TCTTTTGCTT TrTCACTTGC 7 80 
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TTTATAGTTA TTGTTGTAAA TCGTAGCTTC CCAACTACCA TACATTGGGT TAGGGAAAAT 840 

GAT AT AT T T C TTACCGAAAT CGTCTTTATG TTTTTCAATT AATGCTTCAC GAGATTCAGC 900 

TGTAGCTTCT TTTGGATCTG TAAAGTCTAA T^ATTATCT CCAAATAGCA TGACAAGTTT 960 

ATGATCCTTT TGAACCATTT GTCTGCGTGA TTCTTTACTC TTATCATCTT TACCTTTTAG 1020 

TAAAATATGA CTCTTCTTAG CTTGAGGGAT ACCTTGTTGT TTTAAGTTCT TTTGTGTTGC 1080 

CTTTAAATCT TTTTCTTTAT CTCTATCAGA AATATAGTAG ATATCGACAC CTTTTTTTC 114 0 

AGCATATTTC AAGAATTCTT TTGCGCCATA GACAGGTTTA GCTTTAGCAG cTTGTACCCA 1200 

TTCATGCCAA CCTTCTGGGA AAGGTTTATT ATGTATTGAT GCATAGCCTT GATATGGAGA 12 60 

ATTATCTAAA ACTGTTTCAT CTAAATCCAA AGCAATAGCT AACTTATGTT TACCTTTATT 1320 

CTTTTTAATC TCTTTATCTA ACTGTGTCTT TGCACTGTTA TAACCTTGTA AATATAATGC 1380 

TTTTGCTTCA GCTGAATTTT GATACCAAGC CACTGCCATA ATATTTTGAT TACCAAGATT 14 4 0 

CGCCTTTTGT GATGCTGGTA TAGAAGCTTG TTGCGTTTGT TGAACTTCAG CAGAACTTTT 1500 

GGCAAACGCT GTAGAATTTG TCGTTTGTGG TGCTGAAACT GTAACCGCTA CCGATAATGA 15 60 

TGCTATTGCA ATATACTTTG AAATTTTATT CATCTTATCA CCTCATGATT AAT AT T T AAA 162 0 

ATACAGTTAA AATTATAAAT GCATTTATTT AATATTGCTA T AC TAT G AAA AGATATTTAA 168 0 

TATTATTTCT TGgAAAAGCT AACAAATATG TGAACATTTC ATATAAGCAT GATTTAATGG 7L4 0 

GAG AC C T AT C ACATGAGAGA GATTTGTGAT TTATCaCGTA ATAAACTTAA ATTCATTTGA 18 00 

GTGACTAGGC CTTCCTATCA GAC AT AT T C A CTCATCCACG TATCATTATG TGTACAGTGT 18 60 

GCTATCTCTT ATTTACCTAT TGGAACAACC AT AAAC T CAT CCATAGTTTA CCTTTTATAA 1920 

ATAGCAGTCC TCACTCATAC 2ATTTCTCAT AAAAAT C AC A ACGCTCCAAC GTATTTCCAA 198 0 

CTTACTTTCA CCTATTTTAA TTCATAAAAA CGACACTTTA ATTGTCATTA TCCAATAATA 2 04 0 

GCAAGACGTT ATTATTGCAA TCTTTTTTAT AAAATAATAG AATCATAGTA TTGTCATTTA 2100 

AAGATAAAGT AAGAACGTTT TTATTTTTCA GATTTTTTAA ATTATT35GA ATATCTAGTT 2160 

TTAGGAAGGA AATTACATTG AAAAAACAAG TTATTATTTC GGGCCTCATG TTATTTTCAC 2220 

TATTTTTTGG AGCCGGAAAT TTAATATTCC CGCCCATGCT TGGCCATACA GCGGGTCAAA 22 8 0 

ATATGTGGAT TGGTATGCTA GGCTTTGCCC TTACAGGCAT ATnACTCCCC TTTATTACTG 234 0 

TTATTGTTGT TGCATTTTAT GATGAAGGTG TTGAAAGTGT AGGCAATCGT ATACATCCAT 2 4 00 

GGTTCGGGTT TATTTTTGCT GTCGTGATTT ACATGTCTAT CGGAGCATTT TACGGTATTC 24 60 

CACGTGCTGC AAATGTCGCG TACGAAATTG G T AC AAG AC A CATTTTACCT GTGCATAACC 2520 

AATGGACTTT AAT T AT AT TC GCAGCAATCT TTTTTGCCAT CGTTTACTGG ATTAGTTTAA 2580 

ATCCATCGAA AATCGTTGAT AATTTAGGTA AAT TAT T AAC ACCGTTATTA CTATTAATGG 2 64 0 
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TCGCTCTATT AAGTATTGCT GTCATTTTCA ACCCTGAATC TGCACTAAGT GCACCTAAGG 27 00 

ATAAATATAT AACACATCCT TTCATTTCAG GAAGTTTGGA AGGCTATTTT ACAAGGATC 2760 

TTGTTGCTGC GTTAGCTTTT TCCGTAGTCA TTGTCAATGG CTATAAGTTT AAAGGCCTCA 2820 

CAGATCGCAT GAAAATTTTA AAATATGTCT GCTTTTCAGG TCTTATTGCA GCCATATTAC 28 80 

TTGGAATGAT TTACTTTGCA CTTGCATACG TTGGGGCATC AACAGCTCCA GGAAACTTTA 2 94 0 

AAGATGGTAC AGATATATTG ACGTACAACT CATTACGATT ATTTGGTTCG TTCGGTAACC 3000 

TCGTATTTGG AATGACGGTT ATCCTTGCAT GCCTAACAAC ATGTATAGGA CTCGTcAATG 30 60 

tTGCGCCACA TTTACTAAGA AACACGTACC TAAGTTTTCT TATAAAATAT TCGCACTTAT 3120 

TTTCtCTATC ATAGGGTTCT TATTTACAAC ACTTGGTTA GAAATGATTT TAAAAATTGC 3180 

TGTCCCATTA TTGACTTTAA TATATCCCGT GTCGATTGCA CTTGTACTCA TATCATTTGC 32 4 0 

TAACATGTTT AGCACATTCA GATTCAGTTG GGCCTATCGA CTCGCAACTG T TAT T AC AT T 3300 

GAT TAT T T C A ATTTTACAAA TACTAAATAG TTTCAACTTA TTACACGGTG TTATTTTGAA 3 3 60 

ATCGTTTATG ATGTTACCTT TAGCAGATAT CGATTTAGCT TGGCTTGTAC CATTCATGCT 34 2 0 

CTTTGCTATT ATCGGTTTCA TAATCGATGT ATTTATACGC CGTCCGAAAC AAGCGACAAC 34 80 

TTAATAAATG CTCACTGCCT AGTAATGATT GACCCATCGT TACTAgGCTT TTttATATGA 3540 

nATTCATGAC TTTTCAGTGA TACCGTCTGT C TAT T GAT T A TnTTAAAATA AATGATTTGT 3600 

TAAAAAATCC TAGCTGTTAT T C AAAAAT AT TAGTTTTTAA AA 3642 



(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

T TAT T GAT AT TGAAAATTCA AAAACTGCAA CACATCTTAT CACTAAAAAC CAACACTATC 60 

ATCAAACTGA CACTCATTTT GAAC AG TATA AGAAATTTff TTTAGATTCA GG TAT AT CAT 12 0 

CAACACAATT TGTATATAAT AACCTGTCTG TAAGCGGATT T AAAT AT AC T AATGATGGTA 18 0 

AGAATCCAAT TCAATTATCT GACATAGTGT ATCACTTAAT CGCATTATTA CGATATGGCG 240 

GTGGCATTAG CTATCAATTA TTAGATGACC ATTCAAATTA TATTTCCTTG TACAACAAAT 300 

ATGGTAGCCC CCTACCATTA ATGCATTTAT ATAAAATGTT TAGACCTTTT GTTAATGAAG 3 60 

AT AT TGAAAT TACAAATAAT TATGTATTGA GTCGTAAAGA TAATAACTAC CATTTCTTAT 4 20 
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TATTCAATAA AATTAATGAT CGATATATGT C AGAC G T AAA ACAAGATTTC ATTTTCCATA 4 80 

ATGAATTACC TCAAGACTCT TTGATGATTA TTAAAACATT GAATCATGAA CATGGTTCAA 54 0 

TTCAACATTT GCTTCCAATA AGCGATCAAC TTGTTTATAT AGAAAAAGAA ATTTTAGATG 600 

AATTAGACAA AACGAATTAC CCTAAAACGG AGCTTGCAGT TCAAGAAGAA ACTGGTCGTA 6 60 

CATTTGAACT CAAGTTAAAT CACGACGAGG TTAAATATAT TTGCTTAAA CCAAGC T AAA 720 

TACTAACAGT CCTCTTGTGT TTAGTTTCTT ACGTTAAAGG C TAT T TAT AT CATAAGGAGA 780 

TGATTTGTAA TGAGTAACTC ACAAGCAATT CAAGCAATTG AAAACGTGTT AGTAACGTCA 840 

AAAGTTGGTG TATTATCAAC TGCATATAAT AATAAnCCTA ATAGTAGATA TATGGTCTTT 900 

TATAATGATG GTCTTACTTT AT AT AC T AAA ACGAATATCC ATTCTGCTAA GGTCAAAGAA 960 

ATTAAAGATA ATCCAGCAGC ATATGTTTTG TTAGGCTATA ACGACACAAC TAATCGTAGT 102 0 

TTCGTTGAGA TGGAAGCGAC AATCGAAATC GTTACAGAAC AAGAAGTGAT TGATTGGCTA 1080 

TGGGAAACAC AAGACAAAAG CTTTTTCSST TCAAAAGAAG ACCCAGAGTT ATGTGTTTTA 114 0 

AGAGTAGTTC CGCAATCCAT TAAGCTAATG AATGATAAAT CATTAGATAC ACCTATCAAA 12 00 

ATCGATTTAT AACACAAAGT GTATATAGGA AATAACTTTT ATGAATTCTA GATATAACAA 12 60 

TGTTAAATAC TTAAAATAAC TCGCTATAAT TAAAGTGTTT AATATGTTTA CAATCAATT 132 0 

TGCAATGTAT TGTTTTTTAG ACACACTTTA TTCACATTTA ACGTGATTAG GGTGTGTCTT 138 0 

TTTAAATTAA TTTTATGTAA TATAAATACT GCATTTGCAA ACTGTTGCAC TTTTAGGTAT 14 4 0 

AACAGAATTA ACTACATTTA AGGAGAT TGA TGAACCATGA AAAAGAAAAA AGGTTTTGGT 1500 

CTTGGTATTA GTTTAATCGC CATCATGTTA ATTGTATGTA TTGTATTAGT AAT CAT GAT G 1560 

ATGACTGGCG GAAAGAAAGA TACATACTAT GGAAT TATGA AAGATAATAC TACTATTGAA 162 0 

AAAAT GAT T A G T GAAAAAG A TGAAAGTATT GAAAAAAATG TTAAATTACC TTCAGATTCA 168 0 

GATGTTAAAG TTAAAAAAGG TGATTTTGTA ATTGTTATA AATTAGCAGA T T C AG AT AAA 17 4 0 

ATTGTTAAAG TTAAAAAAGT TGACCATGAC GATGTACCAC ATGGTTTAAT GATGAAAATT 1800 

CAT G AC AT GG GCAAAATGCA CATGAAACAC TAATTGTAAT TTAAATTACA AATTTTAGTT 18 60 

GCCATCAAGG TATATACGAG TAAAAGCAGC GGTAAGTTGA TTTCCAATTT GGAATCATTT 192 0 

TACTGCTGCT TTTTATATTT GAAATACTTT CATATTGAAT AGCTCCACTT GCCGTTCGCC 198 0 

TgcgCTTTGC GCATGCATAA AAGCCCCTAA CAACCTGAGG TCACTGCGCT CCGGTTCGCC 204 0 

TGcgCTTTAG CGCATGCATA AAAGCCCCTA ACAACCTGAG GTCACTACGC TTCGGTTCGC 2100 

CTGCGCTTTA GCGCATGCAT AAAAGTCCCT AACAACCTGA GGTCACTACG CTTCCGTTCG 2160 

CCTGCGCTTT AnCATGGCCA TAAAAGC 218 7 
(2) INFORMATION FOR SEQ ID NO: 330: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

CCnCCTTTTT AAACCTGGnG AAATGtmCAG tTTTGATGGt ATTGGGtTCT TTAGTATTAT 60 

GCTTAAGATA GAGTGTAATA CGCTCCTGTT GTTCTTTSA GTATATTGCT TTTTGTTTTT 120 

CTTTTTCGTC ATATTTCACT TTTAAATAAA TGACTGATGC AACTATACAT ATACATAAAA 180 

TGACACCAAT AATTATAAAA ACATGTTTTT TGTTCATTAA AGTAACTCCT AAAATGTGGT 24 0 

GGAAAATGAA AAAATTTTTA TAATCTATAA TTATGAACAT T AC AG AT TAT AAACCATAAC 300 

ACTAACATCG TCGCCTTCAT TAACTTGATT GTTAAAATCA GCAATTACTG AGAATTGTAC 3 60 

AAGGAATAAT TGCCTATTAT GCCCTCATGT AATTATTGCC TTACTAACAT TAACAAAATC 4 20 

GATAGCTATT ACATTAAATG CCTATACCCC AG AC C T C AAA CACCTTTTTA TACAGGACGC 4 80 

ACTGTAATTT CATyyACGTT AACGTGCTTT GGTTGTGTTA ATGCATATAA TACTGCTTCT 540 

GCAATATCTT GTGGTTCAAG TTTTTTACGA TCTGTTGGAT TGTATGCGGC AGTTATGGCT 600 

GTATCTACCA TTCCTGGAGA AATGCTTGTT ACTTTAACGC CTGTCTTTGC CAACTCTTTT 660 

TCTAATCCTT GAGTAATAGT GTGAACTGCT GCTTTCGTCG CACTffiAAAT CGTACTACTT 720 

TTCGTTACTT CAAAGCCAGA AATAGATGcA ATGTTAATAA GATGGCCACT TGATTGTTCT 780 

AACATAGTTG GTAATGCAGC CTGTGCCGTA TATAAAGTGC CTTTGATATT CACATCAATC 840 

ATACTATCCC ACTCATCTAC TTGATAATCA GTAATCTTAG ACGACAACAT TTGCCCCGCA 900 

CTATTGATAA CAATATCCAA ACCACCGAAT GTTTGTTGTG CAATTTTTAT CAATTCATCG 960 

ACTTCTTCTT TATTCGTTAC ATCTGTTGGC ACTACCTTCA CACTATCTTG TGACAATTGA 1020 

TTCGCTACGT TTTGTAATTT TTCTTTATTT CTACCTGCTA AG AC AAC T T T TGCCCCTTCT 1080 

TCATGTAGTA ATGTTGCAAT TGCTTCTCCA ATACCACTAC CTGCACCTGT AACTACTGCT 114 0 

ACTTTATCTG TTAATACTGT CATAATGATC GACTCCTTTG ATTCTTTTTA TTTTTTCAGG 1200 

GTAAATCATA AAT AC AT AT T ACTTTTAAAA AGCGTATCAC AATTCATATA ACGGTCATAA 12 60 

TAACTCGCTT CATTTTCATA GATAAATTAC ATTACAAGCC ATTCGAAACA TfiAATTAAT 1320 

CGTTGCTTAT ATTTTTTATT TTTAAAAATG TTGAAAAATC GTCACTTCTT TATTGTAAAA 138 0 

ACATTATATT AGTAATAAAG TTAATACTGT GnATTTaTCA TTCGATTGAA TGATTAGAGG 14 4 0 

GAGGAATAAA AC g T G AC AT A TCATGAGCGT GTTTTAGCAT TAAGAGCAGA AAGTAAAAGA 1500 
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ACCGCATTTG ATTTTCGATT CGAAGATTTA TTTAGCAAAG AAGAATGGcT AAGTATGTCT 



1560 



CTTGCAGAAA GACAAAAAGC TGAAAAAGCA TTTCGACACG AgTTAAAAAT ATGGACGATG 



1620 



TAAGAATGCC CTTCTCAAGT GTCCATGACG CCCAAGTAAA ATTATATAAT GTTGTATATT 



1680 



CTTATAACGG CaTTAAACGT AATTTTAAAC AAGTGAAAA TGGAAGGATT CTAATATCAT 



1740 



TTCGTTTATA T AT n GC AG AC CATGGATAGA ATTTTnTATG GTnAATCC 



1788 



(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acri 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

TAAGCCAAAT CCATTGTTAC C AG AAGAAG T TCGCTTCATC TTAAAACAAA TGGGTCTTAA 6 0 

AGAAAAGACT ATCGATGTTG AACTCGAAGT TGGCGAGCAA GTTCGTATTA AATCQGTCC 120 

ATTTGCGAAT CAAGTTGGTG AAGTTCAAGA AATTGAAACA GaTAAGTTTA AGCTAACAGT 18 0 

ATTAGTAGAT ATGTTTGGCC GAGAAACACC AGTAGAAGTT GAATTCGATC AAATAGAAAA 24 0 

GCTTTAATTA ACAATTAAAG TTATTAAACT AACCAAAAGA TAAAAAAGAG TATTGATTTT 30 0 

TTAATTAGAA AAGTGTTAAA ATTATGTGGT cGcGCTTTTA GAGCGCCCAT TTCGTcACGA 3 60 

AATGTTAAGA GTGGGAGGGC AAAACTGAGC CCTGTGACCA CATCACGATA TCAAGGAGGT 420 

GCACATCGTG GCTAAAAAAG TAGATAAAGT TGTTAAATTA CAAATTCCTG CAGGTAAAGC 480 

GAATCCAGCA CCACCAGTTG GTCCAGCATT AGGTCASCA GGTGTGAACA TCATGGGATT 54 0 

CTGTAAAGAG TTCAATGCAC GTACTCAAGA TCAAGCAGGT TTAATTATTC CGGTAGAAAT 60 0 

CAGTGTTTAT GAAGATCGTT CATTTACATT TAT T AC a AAA ACTCCACCGG CTCCaGTATT 660 

ACTTAAAAAA GCAGCTGGTA TTGAAAAAGG TTCAGGCGAA CCAAACAAAA CTAAAGTTGC 720 

TACAGTAACT AAAGATCAAG TACGCGAAAT TGCTAACAGC AAAATGCAAG ACTTAAACGC 78 0 

TGCTGACGAA GAAGCAGCTA TGCGTATTAT CGAAGGTACT GCACGTAGTA TGGGTATCGT 84 0 

TGTAGAATAA TTTTACGAAT ATTAAATTTG ATTACATGAT TTAAACGATG AAGCAGATAA 90 0 

CAGAGATAAT AATGATGAAT TATAAATATA ATCTGAATGA C TAG AT T AAT GATTGATTTA 9 60 

TTCATAAGAT TAATTCTTCT GTTGTCTGCy CTTAACTTGC ATATAGCAAG TAATGTGGGA 102 0 

GGAAATTCCG CTAAAACCAC TAAAGGAGGA ACTATAAATG GCTAAAAAAG GTAAAAAGTA 108 0 

T C AAGAAG C A GCTAGTAAAG TTGACCGTAC TCAGCACTAC AGTCETGAAG AAGCAATTAA 114 0 

ATTAGCTAAA GAAACAAGCA TTGCTAACTT TGACGCTTCT GTTGAAGTTG CATTCCGTTT 12 00 
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AGGAATTGAT ACACGTAAAA ATGACCAACA AATCCGTGGT GCAGTTGTAT TACCAAACGG 12 60 

AACTGGTAAA TCACAAAGTG TATTAGTATT CGCTAAAGGT GACAAAATTG CTGAAGCTGA 132 0 

AGCAGCAGGT GCTGACTATG T 1341 
(2) I N FORMAT I ON FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

CTCTAAATCT TCAATAGGTA ACTTCATTTT AATAATTCCC ATTTGAACAA TTGCTTCTTC 60 

ATCATTGACA ATTTCTAAAA CTTCACCCTT TTGACCGTAA GAkAATACTT TkACTTCATC 120 

ACCAGCAACA ATTTTATCGT ATTTTTGCTT TTGTACATTT TGCTTTATAG ATTTCGCTTC 18 0 

ATAATGATCA TCTAATCGTT TCTTCTTATC AATCAATTCA TGTTCTTTAA CATCTGCACC 2 40 

TTTTTGTTCA CGCAATTGTC TTAAGTCTTT AATAATATCG TCAGCTTCTT TTGTTGCAGC 300 

CTTAATCTTC TGATTTGCTT TkTCTTTCGC TTCCTCTATT AGAGACTTT CATAATTTTG 360 

GAATTGTTGA TACTGCTTAG ATAAATCATC ATGCACTTGt CCGCkTskTT TACAAGACGG 420 

TCCAGkTCTA ACCTCTGTGT CTCTACACGT TTGTAATTAC GCTCTAATGA TTCAATCATT 4 80 

TCATTTATTT CTTTTTCATC AGTACCAATC ATCGTCTTAG CCTTATTAAT AAT AT T C AAA 54 0 

CTAAGACCTA ACTTTTTAGA AATGTCAAAA GCATTTGAAC GACCCGGCAC ACCCATTAAT 600 

AAC T TAT AC G TTGGACTCAA AGTATCTACA TCAAATTCTA CACTCGCATT CATAACGCCT 660 

TCTCGATTAT AACTATATGC TTTAAGTTCA GGATAGTGCG TCGTTGCCAT TACTAGAGAA 720 

CCAATTTTTC TAACATGATC TAAAATGCE ATTGCTAATG CAGCACCTTC ACTCGGATCT 78 0 

GTACCTGCAC CTAATTCATC AAATAAAACT AAACTATGTT TGTCTGCATG CTTTAAAATT 84 0 

TCAACTATAT TCGTCATATG AG AT GAAAAA GTTGATAATG ATTGTTCTAT TGATTGTTCA 900 

TCTCCGATAT CGCAATATAC ATTTTTAAAT ACACTTAACT GACTACCATC AAGOTGGGA 960 

AT C AAC AAT C CTGATTGAGC CATAACAATA ATTAAACCTA ATGTTTTTAA TGTTACAGTT 1020 

TTACCACCTG TATTCGGTCC TGTAATAATT ACCGTTTCAA TATCTTCCAT AAATTCGATG 108 0 

GTATTAGCTA CAACAGTCTC ACGATTTAAT AATGGATGGT ATGCTTTAGG TAAATATACA 114 0 

GTACGGTCCT CTTTAAATAT CGGCTTTGTT CCTTTAATAC TTCTACTATA TCTCGCTTTT 1200 

GCGATTAAAA AATCTAACTG ACCCATGACT TGTTCTGCCA CAAGTAGTGC ATCTTTGTCC 12 60 
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GCAGCCACAT AACCAGTTAG TTGCGTTAAA ATGCGTTCTT TTTCAATTGC TTCGTCATGA 1320 

CGTAATCGAC TAATTTGATT ATTCATTTCA ACAACEATG ATGGCTCAAT ATACAATGTT 1380 

TGTCCTGAAG C AG AT T GAT C ATGTACAATC C CAT T AAAAT CTTGTCGATA TTCAGCTTTG 14 4 0 

ACAGGTATAA CGTTTCTTTC ATTCCTAACT GTTACAATAG CAT C T G AT AA TTTTTTCTGA 1500 

TTTGCTTGGC TTTTAACAAT ACGGTCCAAA TTTTGTCTAA TACGTTGATT CGTGCTAGAA 15 60 

ATTTTACTTC TAATCCCTTG CAATTCATAA CTCGCATTAT CATATAAATC ATACGTATCG 1620 

CATGTTTCAT TTATTTGTTG AAAAAG AT C A G T T AAC AC AG GTAATTGATT CATCTTGTCA 168 0 

TCTAATATTG GGTATTTAAC ACCTTCATCT TCTTCAACCA ATTGATTATA AAATGTCTTG 17 4 0 

AATTGATTTT GTACTTGAVT TAATCTTTTT ATCAAGTTAA GCTCTGATAC ATTTAAAACG 1800 

CCGCCAATAT CAGCGCGATG AATGAATGCT GATACTTTAG ATAAGCCACT CAAGCTTGGT 18 60 

AAACGATGCT TATTATAGAT TTGAGCAATC TCATCCGTTT CTTCCATTTG AAAAACAACC 192 0 

GTTTCAAAAT TAGTAGCTGG CATCATTTGA TTGACCTTTT CCAGCCTAA GTCACTAATA 198 0 

GTTTCATTGG CAACGAGTGA TTTTATTTTT TCAAATTCTA AGACGTCTAA TGTTTTTTGT 204 0 

CTCAT AAAAT CCCTCTATTT CTTAAGTTTT GTATTATTTT CAATAAAGTC TTTAAAAGCT 2100 

TCACGCGATA ATGCGTTAAT CACTCTATCT TTTGTtACAA ATCCTTTTTG CGCAGTTGtA 2160 

CGCCATAATT CAT AAAAT C T AAATGATTTG TATGATGCGC ATCAGTGTTA ATAGTTAATT 2220 

TCACATTTGG ATATTTACGA ACGATATCAG CGCTCAGATC CAGTCGATGT GGATTGGCAT 2280 

TAATTTCTAA TACTGTATTC GTTTCTTCAG CTAATGCCAT TAATTGTTCA ATATTCGGTT 234 0 

TATAACCATC TCTTCTACCT ATAATACGCC CTGTTGGATG CGCTATATGT CGCACGTATG 24 00 

GATTGCGACA TGCATTAGCT AATCGTTCCA TAATTTGTTC TTCTGATTGG TTAAAGCTTT 24 60 

GATGAATAGC TCCAATTACA TAATCAAGTT GTGCTAAAAT TTCATCATCA TAATCCAGCG 2520 

AGCCATCAGG TAATATATCC ATTTCTGTAC CTGAATAAAT ATCAATTTCACTATATTCTT 2580 

TATCTAAAGC CTTAATTTCT TCGTTTTGTC TTAAAAGTCT TTCTACTTGT AAGCCATTAG 2 64 0 

CAACACGTAA ACTTTGTGAA TGATCAGTAA TTACCATGAA TTTATAACCT TTTGCGATAT 2700 

TTGCTTCTAC CATGTCTCGA AT AG AAAAC G CACCATCACT ATACGTTGTA TGCATATGAA 27 60 

TATCACCATT AATATCATCT ATTGTAATGA TATTACTTAG ATCTTTATCA AATTCGCTAC 28 2 0 

CATCTTCTCG CATAGCAGGT GGT AT AAAAT TCACATTAAA ATGTTCATAT ATCTTGGCTT 288 0 

CACTATCATA TTGAATTAAT GTACCATCAG CTTGTTCAAT T C CAT AT T C A CTTACTTTTT 2 94 0 

CATCACGTGC TTTAGCAAGT TGTCGAATTC T'ATATTATG TTCTTTTGAC CCAGT AAAAT 3000 

GCTGCAATGT AT GAT AAAAA GCACTTGGTT CAATTAATCG AAAATCGACA CCAATCGTTT 30 60 

CATCATCATA CGCTAATTCT AATGAAACTT TTGTGTTCCC CACTGCAACT TCTTTTACTT 3120 
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TATTGGGAAT ATTTAATAAT TGCTGCTGCA CTGCTTTTGG GTTATCGGTA CTTATTAOA 3180 

AATCTAAATC TTTGCTCATT TCTTTAAAAC GACGGAAGCT TCCTGCAGAT GAATATTGAT 32 4 0 

CGATATAATT TAATGTATCT ATATAATCAA TGATTTCTTG ATTAAGTCTT CTCATTTGAT 3300 

CAATTGGATA TCTATCTTTC TTAGCACCAA GTTGTTTCAC AGCTTCTAAT ATGTTTTGTT 3360 

CCGTTTTCTT AGCAAA.TCCG CTTAATTCAC TAACTTTTCC ATTTTCACAA GCAACTTGAA 34 20 

GTGACGCTTT ATCAACAATA TTCAACTCTT TATATAGCTT AGCAATTTTC TTGCTTCCAA 34 8 0 

GTCCTTGAAT TTTCAAAAGT GGAATAAGAC CTTCCGGAAC TTCTTCCTGT AATTGCTGTA 354 0 

AAT AC T GAGA TTCACCGGTC TCACGGTAAT CATTGATTAC TTCTGCAACA CCTTTACCAA 3600 

TGCCTTTt AA CTCCGTt ACA T C AG AT AT T T CATCTAATGG TCGTTCATCT AATTCAAGAC 3660 

TTTGAGCTGC TTTTCGaTAC GCTGaTATTT T AAAAGT AT T TTCCCCTTTT AATTCCATAT 3720 

AAGTAGCAAT TTGTTCTAAT AGTTTGATAA CATCTTTTTT TGTCATAATA ACACTCCATA 738 0 

AAAAGAAGAC CAGGACGTAT CATTAATATA TACCTTTGTC CTGACCTCTT ATGATAATTT 384 0 

ATAAATTAAT GGTGAAAGCC GATAAATATG GGATGTGAAA AAT C AAT G AA TGACTCACAA 3 900 

TAGATATTTT AAGCTGATGT TGTAATGCTT CGTTAGGATA TAATGCCAAT AGATATAACG 3960 

TAAAGTGTAA GACAATTATC GTCATAAACA CACCAACTAT CATTCCCATT GCACGACTGA 4 020 

AAATATGAAT GTTTTGATAC GCTATTATTT TATCAAAAGT TACGATAATT AGATATAAAA 4 08 0 

TGAACTTACA AAACAATGTA AT CAT T AAAA AAGC T AC AAT CGCTTCAAAT CGATTTTGTA 414 0 

GAT GAT T AAA ATGAAACGCA AAAGTTGTAT TAAATGCTGT TGTTTTBGA TATGGAATAA 42 00 

AT AC AAT T AA TCTTTCTACA ATAGATTTGT AAAATTGACT GGCAATCCAC AAT G AT AC AA 4 2 60 

nCGTTGCACT CAAATGTATC ATAGATAACC AAAAACCTCG TCTGAATCCA ACGATGACAA 4 32 0 

AATACACAAA GAAAATGATT ATGATAAAAT CAATGACCAT TTATTGCTCA CGCTGCTGCA 4 38 0 

ATTTGTGAAT TTGTTGTTTC AAACGTCGAT TTTTTTTCTT CTAGTAGTAC TTTTTCATGC 4 44 0 

ATAATATTCA CAGCAGTTAG TATTGCTTTT CTTGAAGTAT CTAAACCTGC TGCTTTATAC 4 50 0 

CCTAATTCTT TTATTTTATC AT C AAC T AAA TGTGCTACAT ATCGTATGTG CTCTGGGTTA 4 5 60 

TCTTCCCCAA CAATTGTAAA AAGCTGATGT TTAATTGATA CATTTACCTT GTTTTTAAAC 4 620 

TGTGTCATTT ATAATTTCTC CTGATCCTTT TTTTAAAATC TAAATTCACG TTATAAAATA 4 68 0 

TGACTGGATA GTTTGTCTGA ATTTGATACT AATATTGTTA TATTGCAATT ATGATAAAAC 4 74 0 

AACAACACAA TCTCTATAGA TGACTTAATG TTCTTTTTAT AATGAAATAA TGTftAGAAT 4 800 

TTTCTATTCA ATACTTTATC ATGTTTAAAT TGTGTCACTA TAACATTTTC ATAAACATTA 4 8 60 

TACATGACCA CTATGTATTT TGTAAGTATC CGCAATTAAT TCTTTACAAC ATACATAAAT 4 920 
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GTTTCTGACG TTATTATCAT TTATGATATG ATTATTTTTG CTAAAGACAA TGAAATTTTA 



4980 



TGAAAGGATT T AC AC AAAT G GCGAATATCG TTTTTAAATT GTCGGATAAA GACATAACGA 



5040 



CATTAATGTC ACGCATTTCT TTTGATACTG AGAATTTACC TCAAGGnATG AnAGCACGTG 



5100 



CAAAGTATCC AAAATACAAC TGTAAATATT TACCAT 



5136 



(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4239 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

GGCCAAAATT GCACCTCCAC TTTCCTTTGA iCAAATCTAT TTTAAACGCA CATTCCCATT 60 

AT G AC AAAT T AAATGTGAGT nACATTTGTT TGTATTTTAA CATGACTACT AACGCAAACT 120 

GTTTTACCAG ATTTATTTTT TATGAATATT ATAAAATAAA CTTACATCAT GTTGATAGTC 180 

GGTATCTTTC AAAGATAAAA TCTTAATAAT TTCTTAGTAA ACTCTTTTCT CTAGATTAT 24 0 

CACAATATTA TATAGACCTA TTTTATTTTG ACGTAAGTTG CTAGTATCTT CAAACAAAAA 300 

ACCTTTATAA AAATTCATAC CTTTATGCTA TCGCTGTAGG CTCATTAACT TGTTACATAT 3 60 

AATTCTTAAC TATCCTTTGA TGATTGTTTT ATTAGATTGT TTCGTTGATG GATACTTTCA 4 20 

CGAATTTCTA TAGTTCAATG CTACTAAAAA AACAGCCCAA AACTTTAATT TGTTTTGGAC 480 

TGTTTTATAA TTATGCTTGC GATGGTGTTT TAGTTTCTGA AGTTTGTTCA GCAATGTCAT 54 0 

ATTTAAACTC TTTACCATCA TGATCTACTG TAACTTTCTT ACCTTCAATT TGATTACCAT 60 0 

CTAATATTAA TTCACTTAAA TTATCTTCGA TAGTTTTTTG TATCGCTCTA ATTAATGGTC 6 60 

TTGCACCATA TTCTGGATCA TATCCTTCTT CTGCGATTTT GTCTTTCGCT TTATCAGTTA 720 

CAATAATATT TATGTTTTGT TCAGATAATC GATTTGTTAA TTTATTAACC ATCATTGTTA 780 

CAATTTCTTT TAATTCTTCT TTTGTTAGTT TATGGAATAC AAT GAT AT C A TCTACACGGT 840 

TTAAAAATTC TGGACGGAAT GAATTTTTTA ATTCTTTTAA CATCGTTTTT CGAATTGTTT 900 

CATAATCTTG TCCATCACTT GAACCACCGA ATCCAGCAAA TCGTTGATCT TGTAATTCTT 960 

GTGCCCCAAC GTTTGATGTC ATTATGATAA TTGTATTTCT GAAATCAACT GTACGTCCTT 102 0 

TTGTATCTGT CAAATGTCCA TCATCTAAAA CTTGTAATAG AATATTAAAT ACATCTGGAT 108 0 

GAGCTTTTTC AATTTCATCA AATAAAATTA CAGAATATGG TTTACGTCTA ACTTTTTCAG 114 0 

TTAATTGTCC ACCATCATCA TGACCAACAT ATCCTGGAGG AGCACCAACT AATCGGCTCA 12 00 

CTGCGTGTTT TTCCATAAAT TCACTCATGT CTACACGGAT CATCGBTCA TCATCGCCAA 12 60 



973 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



ACATTGATTC AGCTAAAGCT CTAGCTAATT CAGTTTTACC AACACCAGTT GGTCCAAGGA 1320 

AGATAAAGCT ACCAATTGGT CGTTTAGGAT CTTTTAACCC TGCACGGGCA CGTCTAACCG 1380 

CTTTACTGAT TGAATTAACA GCATCTTTTT GCCCAATAAC TCTCTCATGT AATGTATCTT 14 4 0 

C TAG AC T AAG AAGTTTTTCA GATTCTGTTT CATTGATTTT AGTTAATGGG ATACCTGTCC 1500 

ATCCTGCAAT AACTTCAGCA ATATCTTCTT CTGACAATGA AGTTGaCATG CCATTTTGTG 1560 

CATTCTTCCA TTCATTTTTA GCTTCTTCAT ATTGCTTTTC AAGTTTTGTT TGTTTATCAC 1620 

GCAGgTTAGC AGCATTTTCA AACTCTTCAG CATGTACTGC GGCATCTTTT TCATTTTTAA 1680 

CTTTTTCAAT TTCTTGTTCA ATTTCTTTTA AATTATTAGG TGTCGTATGA CTCTTAAGTC 17 4 0 

TTACTTTAGA ACTTGCTTCA TCAAT TAAAT CAATTGCTTT ATCTGGTAAG AAACGATCTG 1800 

AAACGTATCT GTTACTTAAT TTAACAGCTG CTTCAATAGC TTCGTCTGAA ATATAATAC 18 60 

GATGGTGTGC TTCGTAACGA TCTCTTAATC CTTTTAAAAT AGCAACTGTA TCTACTACTG 192 0 

AAGGTTGATC AACTTGTACA GGTTGGAAAC GACGTTCTAA AGCCGCGTCT TTTTCAATAT 198 0 

TTTTGCGATA TTCATCTAAT GTAGTAGCAC CAATACATTG TAATTCACCA CGTGCTAATG 204 0 

CCGGCTTCAA AATATTCGAA GCATCGATAG CACCTTCAGC ACCACCAGCA CCAACTAAAG 2100 

TATGCAACTC ATCAATAAAT AGGATGACAT TACCTGCTTG TTGGATTTCT TCCATAACCT 2160 

TTTTCAGACG CTCTTCAAAT TCACCACGAT ATTTAGTACC TGCAACTACT GTTCCCATAT 2220 

C T AAAG AC AT AACACGCTTA TCTTTTAATG TCTCTSGTAC CTCATTATTC ACTAtGGCtT 22 80 

GCGCTAAACC TTCAGCAATA GCAGTTTTAC CAACACCTGG CTCTCCAATA AGCACAGGAT 234 0 

TGTTTTTCGT ACGTCTACTT AATACTTCAA TTACACGTGT AATTTCTTTA TCACGTCCTA 2400 

TAACAGGATC TAATGTACCG TCTTTGGCAA TGACTGTTAA GTCACGAGCT AAACTATCTA 24 60 

AAGTTGGAGT ATTATTTGAC TTACTAGCTT GTGCATTTTT ATTACTCATT TCAGGGTTTC 2520 

CTAAAGCTTT CACAACTTGT GCACGTGCTT TAGTAATATT TAAATCTAGA TTTGCAAAAA 258 0 

CTCTTGCTGC AACACCTTCA TTTTCACGAA TCAAGCCTAA TAAAATATGT TCCGTTCCAA 2 64 0 

CAAAATTGTG ATGTAATTTT CTAGCTTCAT CCATCGATAA TTCAATGACT TTTTTAGCTC 27 00 

TAGGTGTATA ATGCAATGTA CCAACATGAT CTTGACCATG TCCGATTAAT TTTTCAACTT 27 60 

CTTCAATTAC TTTATCTTCA GTGATATTAA AACTTTCTAA TACTTTTGCA GCAATTCCTT 2820 

CAGGTTCTTT CATTAACCCC AATAATAGGT GTTCTGTTCC TffiATTTGAA TGATTTAAAC 2880 

GAATTGCTTC TTCTTGGGCA TGTGCTAATA cGCGCTGTGC ACGCTCAGTT AATCTACCAA 294 0 

ATAACATAAA TAATGACCTC CTACTTTATA TGTTCTCTTA GTATATCTGC TCGTTTTTCT 3000 

T T T AC AG AT T TGTCATCTTC TTCATCTAAT AAAAATGGTG ACTGTATAGC TACCATCAAT 30 60 
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TCATTAAATT TAAAGTTTTG TAATTCAATG TAATTTAAAT CTATACCAAG TTTAACTcGC 3120 

TTAATCTATA AGAAGCCTCT TCCATAGTTA TCATTCTACA GTTTTGTAAA ATACCTAGCG 3180 

AGCGAAAAAC ACGGTCTTGT GTTTCTAATT GATTATAAGT GTCTAACTTT TGTCGTATTT 324 0 

GTTTTTCTTC ATGAATGATT TGATTAACAA CTTCTGTTAA TGTTTCTATG ATTTCTAACT 3300 

CAGATTTACC AAGTGTAAGT TGGTTGGATA CTTGATAAGT ATGTCCATAA ACTTGCGAAC 3360 

CTTCACCGTA AATACCTCTG ATTGTATATC CAAAACGATT AATGGTTTGA GCAATCCGTG 34 2 0 

TCATTCTTTT CATAATAGAT AGACCTGGCA AATGTAGCAT CACGCTTGCTCTCATACCAG 34 8 0 

TACCTATATT GGTAGGACAT GTAGTTAAAT AACCAAGTTG TTCATCATAA CTTATATCAA 354 0 

GGCTTCGATC TAATTCATCA TCAATTGATG AAGCTTGATT ATATAAAGCC TGTAATGTCG 3600 

TGTCAGTTCC CATAGCTTGA ATACGAATAT GGTCCTCTTC ATTTATCATG ACACTTAAAG 3660 

ATTCATCGTC ATTCACTAAT ACTGCGGCTG CTGGTTGTTT TATTAGTTCA GGACTAATCA 3720 

AATGCTTnGC AACCATTTyC ATTTTACTTk GTkGATCCaT TkGATCaAGA CGCATTAAtT 3780 

CAAAGTTTGG CAAGGCATCT TGTACCTCAT TTATAACTCT AAATCCcATC ATTTTCAGTA 3840 

GCATACATTA GTGGATGCAC ATGATTTTCT AAtTACGCG CTAACCGAAT TCTAGAAGAC 3900 

ATAACAATTG GTGTTTCTTC ATTACTTTTC ATCCATTGGC TGATATTATC AT G AAT AT T A 3 9 60 

TGCGTCATCA TGTTGCACCT CACTCTCAGC TTTTAGTGCT TTAATTTCAT CTCTAACAAT 4 02 0 

GGCTGCTTCC TCAAAATCTT GGATTTCAAT AAGTTTTTTC AAATATTCAT TCTTTTTTC 4 08 0 

GATTTTTCGC TTTAAAGCTA TCTTTTTATG TGAAGAATGT GGTGTCTTTC CAACGTGCTC 4140 

AAATTGTCCA CCTTGAACTC TGCGGACGAT ATCAATGATG TCATCTTTAA ATGTTGcATA 4200 

ACAATTAGCA CACCCAAATT TACCAACATG TGCAATATC 4239 
(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

CAAAACTTTG ATTATGTCGT GTCAGTTATT AAT GC AT ACA GTAATGCATC GTTGTTATTT 60 

TTTATTTATT TACTATTTAT TATGTACCAA ATCAGCCTTt CAGTAAAATA GGTCAAATTA 12 0 

CTGATTTTCT AAATATAAAA TGCCTCCTAA T AAC AT AC T A TTAGTACATC ATTAAGAGGC 180 

TCTTGTGTTA TTTGCATACT AAGCGCTCAA ATTT^AATTT AAGATGAAGA TTCTTGCAAT 24 0 

AATATTTCTA TATTTGTCGA TATATCCAAT GGATCTTCCA TTGGCAAGAA TCGATTTACA 300 
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ACATTTCCTT GTCGATCGAT TATAAATTTT GTGAAATTCC ATTTTATTGG GGaCCCAAAG 360 

ATTCCTGGTT GTTCaTTCTT TAAATGCGtA AATAACGGAT GTTCATCGTT CCCGTTCACA 420 

GATATTTTAG CTAACACTGG AAATGTTACA CCAAATTTCT CACGACTAAT TTTCAAGATT 4 80 

TCTTCATTAG AACCTGGTTG TCGATTGTCA AAATTATTAT TCGGAAAACT CAACACTACA 54 0 

AACCCACGAT CCTTATATTT TTGAAATAAA GTCTCTAGTT TTTTCAATTG TTCGCTATAT 600 

ATACATTCTG TTGCAGTATT AACAACTAAA ATCACTTTAC CTCTAAATGC TTCTAATTTA 6 60 

TAAGTTAAGC CTTTATAATC ACTTACTTCG ATATCATACA CATTTCTATT ATTCATAAGA 720 

CACCCCTACA CAGCCTTTTT TATATTGAAT ATGTTCTTTT TAGAATGTTC TGATAAAATA 7 80 

AGTGCGCGTT TACACCGTGA ACACACATTA TATAGCGTGA BCATTTTTC GAGCACACGA 84 0 

TAAATAATGT TCGAGTTTAT GTTGTTGCTC AACCTATCCG ATTTACCGTC TTTTTTCACC 900 

TGTCTTACTT TATTAAAATA ATCACATTGT ATAGCTGTAT TAGTCCAGCT ACAATATCTT 960 

CCGTTATAAC CCCTCATTAT AATCATCCTT ATTTTCTATT TTTAAAAAGA CAATTAGACC 1002 

GCTCTTTAAA CTATAGaTTA ATACTTAAGk TAAACTCATA CATACTGATA CCATACGTTA 1080 

GATCTAACAA TTTAAAATTC GTTATAACTA TGGATTAAAG AGCTGCCCAA CTCATATAAT 114 0 

CCTTAAAAAC TTCACATGTG ATTGTkTATT AAGCCCTCCT TTATCnTATT AAATATCCTT 1200 

ATAACCCTTT TAAAATTAAA CTG^CACACT CATACATTGT TACAC 12 4 5 
(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

CTTGTAAAGG TAGTTGTTCA TTTAAATTAA AACAGTATGC TAAGTTnGTG CTTATATTTT 60 

GCAACTTCAA TTCGACAGGC TCATCTTGAT AC AT GAAT GC CTCAATTTCA TCATGTGATA 12 0 

ACTTTTTCAA AATATCAATT AAATGrATAC TAAAAACGAT AAA2SACATA TCCCAATTTG 180 

AATTCAGCCC TAGCGATTTT AATTTGTTTA AAATCTTATC TTTTTGAAAA ATTCGATTCC 240 

TAATGTCTTG TATATCGTTA TCAGTCAAAG TTTCCCAATC TATATGTGAA TGAAGACCTA 300 

AATAACACTT ATCCATTAAT AATTCATATA CCGTTAATGC AGAGACATCG AAACAACGTT 3 60 

CTTCACTTAr AAAAACGCCA TCAACATCAA ATAAAATTTT CTTCACAATC CCCACTCCAT 4 20 

TTCTGaAAAT TCAGaTATAA ATCATTCTAC TATTTGACTA AAAAAAGCGC AAACCCTATT 4 80 
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GAAGTaGATT TGCGCTTTAG CTGTTnAAAT TTTATAAATG TnTTTCAATT TCATCAGCAA 54 0 

CCTGCTGTAC GTGTGTACCG ACAATiACTT GAGTTGAATG TTTGCCATTA ACAGTAACAC 600 

CAACTGCACC GGCGTTTTTA ATCTTCTGTT TATCAATAAT AGATGTGTCT TTTAACTCTA 6 60 

GACGCAACCT TGTTGCACAA TTGGTTAAAT TAACAATATT CTCTTGACCG CCTAAACCTT 7 20 

CTAATATTTG TATAGCATGT T GAT GAT ATT TACTTTGTTT AATATCATTT (TACCAGGAG 78 0 

CAATATTATC TTTTACAACT GTTGGGTCAA CTAATTCATT TTCACCTCTA CCAATCGTAT 8 40 

TCAAGTTAAA TACTTGGATT ACTACACGGA AAATCACATA GTATAAGATG AAAAATACAA 900 

CACCTTGAAC AAGCAACATC AATGGATGAT TTGATACTGG ATTAATTAGT GATAACACAT 9 60 

AATCTATCAA ACCTGCACTA AATGAAAATC CAGCTGTCCA ATGGAATGTA GCTGCGATAA 1020 

ATAAAGATAA TCCTGTTAAT AACGCATGAA CAACATATAA GATTGGCACA ACAAACATAA 10 80 

ATGCAAACTC AATCGGCTCT GTAACACCAA CGAAAAATGC TGCAACTGAA CTCGCTAGGA 114 0 

ACCAACCGTA AACTTGTTTT TTCTGAGTAG TTTAGCTGT AT GAT AC ATT GcTAACGCAG 1200 

CCGCTGGAAT ACCGAACATC ATGATTGGGA AGAATCCCGC TTGATAGCGT CCTGTAATAC 12 60 

CTTTTATAGC ATCTTTGCCA CTTTGGAATT TACCAATATC ATTAATACCA ATCGTATCAA 1320 

ACCAGAACAC ACTATTCAGT GCATGATGTA ATCCTGTAGG AATTAATAAT CTATTGGCZ& 1380 

CACCATATAT GAAAGCTCC 13 99 
(2) INFORMATION FOR SEQ ID NO: 33 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESB : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

TATAGTTnTA TTATTTAGCG AAGCATTAAT ACTACCACCG GTTATAACAA ATGTATTTTG 60 

CGTATTAAAT TGAATGGTAG GACCAATCAA AGTATATTCA ATCGCTGGAC CATCATTGTT 120 

AATTAATGAC TGCGCAACCT TAAAACTAAA TTGATCCATG GCACCTGCGC CTGAAAATCC 180 

AATATGTTCA TAACCTATTC TTCCTAGATC TTGTACCGTT GAAAAGAGAC CTGGTTGTAA 240 

AATCTTAATT GACATTTTCA ATCACCACCC AGTCATCAAC ATTAAAGTTG CCATCTGATA 300 

TATCTCTTTC GATTTGTATA AATTTCTGTT CATCT/TTGC ATAAAATTGT ATCCATTCTC 360 

CTGCTTCGTA CATTGACATT GGTTCACGCT CGCTGCTAAA TACTTTTAAc GGTGTGCGTC 4 20 

CAATAATTTG CCATCCGCCA GGAGAATCTG ATGGATATAG TCCTGTTTGA TTATTCGCAA 4 80 

TACCTACAGA ACCTGCATGA ATTTTTAACC TTGGCTGATT ACGTCTAGGT GTATGTAGTT 54 0 
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GTTCATCAAG TcCGCCTAAG TATGGAAATC CTGGCATAAA TCCTAGCATA TAT AT T AAAT 600 

AAGGTTTACT TGTATGTTTT TCAATAACTT GCTCAACAGT TATTCGATTA TGCTTTGCTA 660 

CTTCTTCAAT ATCTGGTCCA TATGTACCAC CATATTGAAC AGGTATTTTA ATAATACGAT 720 

TGGTTTGATT CACAGCATGA ACATTTTTTT CAT T AAAT T T GTTAAGTTCT AAATTTTCAA 7 80 

T T AAT T TAG A AGATGTTATA GCTTGTTCAT CAAAATATAT TAGAACTGCT CGATACGAAG 84 0 

GGACAATATC TTGAATTTCT AATATTTCTT TTTCTCGTAT CCACCGTACC ATTGCTGTGA 900 

CATTACGATA TGTCTCTTCG GATATTTTAT TTTCAAAATA AACATAATT GTCTGCTCGT 960 

TAATAAATCT TACATCCACT TTAAATCCCC CTTTGTATTG CAATAAACCA GTATTGAATA 1020 

CCTTTTCATT GTATCATTGA GAAGCACAAG TTGTTTAATA AGTAATTCAA ATCGCATATA 1080 

AT AAC AAAT G ATAATCAATT TGTTTTCTCG GAAATATTTT GCTTTTTTAA AATGGAGTAG 114 0 

TTAATATTGT AACTCTTACA CTAATTTAGG TTCTGCTATC ATTCGGTCTG ATGGAAAATT 1200 

TTTACTTTTC ATCTGTCCGA TTTTTTGATT TTGAATATAA AAAAG C AC G A CCGAAGTATC 12 60 

ATTAACACAC TTCAATCGCG CAATTAAATA ATCTATTTGA TCATTTATTG GATATTAACA 1320 

ATTTTTACG 132 9 
(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

GAATAGTGCC ATTTGGAAAG ACATAAAGAA TCCAGAAGCA CCTTTAGCGA ATATGCCGTG 60 

T T C AT AT AGA TTTGTGAAAC TGGCATGCCC GAATTGAGTT TTAAATGCAA ATAGAATCAT 120 

GACGAAACCA ACTACTATTA AACCAATAAT TGTCGCTATT TTAATSTAG AGAACCAAAA 18 0 

TTCTAATTCT CCGAAAAGTC TTGCGCTAAG TAGGTTGAAT G AC AT T AAT A ACAGTACACA 24 0 

AAATAGTGCA CTTATCCAGT TTGGAATTTC TGGGAACCAA AAGCTAACAT ATTTTGCCAC 300 

AGCCGTTACT TCAGCCATAC CTGTAATAAT C C AAC AG AAC CAGTATGTCC ATCCGGTAAC 3 60 

AAATCCTGCA AAAGGCCCAA TATATGTATT GGTTACATCT GCGAAAGATT T AAAT T C AG T 4 20 

ATTCTGTATA ATGATTTCTC CTAAACCTCG CATAAACATA AATAACATAA ATCCTATAAT 4 80 

GATGTATGTT AATAGAATTG AAGGGCCGGT T AAT GC AAT C GTTTGACCAG CACCTAAGAA 54 0 

TAAGCCTGTA CCAATTGCAC CGCCAATT3C AATTAATTGT ATGTGGCGAT TGCTCAGTTC 600 
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CCTTTGTAAT TTTTCAGCCA TAATACATCT CCCTTAAATA TAGATATGTT TATTATGCAC 660 

TTATATTGAG ATATATACAA TTATTTTCGG TAAAAATGTG TAAAATTCCA TGTTAATATA 720 

CTTTGGTTTT TATAATCATA TATAATAACC AATTGAAAAT TTAATTCTAT TGSAAATTC 780 

ATGGATTATT C AC AT C T T G A AAAAGCTTTA ATGGTGCTAT TTGTGGCTAT TCTGTGACAT 84 0 

T T AC AT AG AT TTACAAAAAA ATTGTTGCAC ATATAATGCC AGTtTTTATA TTTCACAAAC 900 

GAAATGCGTT TAcTATAATA TTAGTTGAAA GCCATTTCAT AAAGAAACAG TAAAGGGGAA 960 

ATTtATCaTA GCmGaATTAC AAAGAGGTTT AGAAGGGGTT ATCGCmGCGG AG AC T AAAAT 102 0 

AAGTTCAATT ATTGAAAGTC AATTGACTTA TGCCGGCTAT GATATTGATG ATCTAGCTGA 108 0 

AAATGCGCAA TTTGAAGAAG TTATTTTCCT ATTATGGAAC TATAGATTGC CAAACGAAGA 114 0 

AGAGCTTGCT CAT C T AAAAG GGAAATTAAA TCAATaCATG ACATTAAATC CTCGTGTGTA 1200 

TACACATTTT GAGGAGTATG TTACAGATCA CGTGCATCCA ATGACAGCAT TACGTACGTC 1260 

ATTATCATAT ATTGCACATT TCGATCCTGA TGCTGAAAAT GAATCAGATG AAAATCGTTA 1320 

TGAAAGAGCA ATGCGTATAC AGGCTAAAGT AGCATCATTA GTTACAGCGT TTGCTCGAGT 1380 

AAGACAAGAT AAAGAACCAC TTAAGCCTAA TCCTGACTTA AGTTATGCGG CAAACTTCCT 14 4 0 

ATATATGTTA CGTGGGGAAT TACCAACAGA TATAGAAGTA GAAGCCTTCA ATAAAGCACT 1500 

TATTTTACAC GCTGATCATG AGTTGAACGC ATCTGCATTT ACGGCACGTT GTGCGGTATC 15 60 

ATCATTGTCA GATATGTACT CAGGTATTGT AGCAGCCGTA GtTCTCTGAA AGGGCCATTA 1620 

CATGGTGGTG CAAACGAACA AG T T AT G AC G ATGTTATCTG aGATTGGGTC Aa T T G AAAAT 168 0 

GTTGATGCTT AC T TAG AT GA AAAATTTGCT AATAAAGrTA AAGTAATGGG cTTCGGTCAT 174 0 

CGTGTATATA AAGATGGTGm tCCTAGaGCG AAaTATTTaA GSAAaTGAG CCGTCAAaTT 1800 

mCGAAAGACG CTGGTCGTGA AGAATTATTT GAAaTGTCAG TGAAAaTGGA AAAmCGTATG 18 60 

G C AG AAG AAA AAGGATTAAT TCCTAATGTT GATTTTTATA GTGCGAGTGT TTATCACTGT 192 0 

ATGGAAATAC CTCATGACTT ATTCACGCCA ATCTTTGCTG TAAGTCGTTC TGCAGGATGG 198 0 

ATTGCTCATA TTTTAGAACA ATATAAAGAT AATAGAATTA TGCGTCCTAG AGCGAAATAT 204 0 

ATTGGCGAAA CGAATCGTAA GTATATCCCG CTTGrAGaAA G AAAitiT AAT C AATACAAATT 210 0 

AAAAATGAAG ATGTAAAATT TGGAGGTAAA ATAACTATGA CTGCAGAAAA AAT T AC T C AA 2160 

GGAACTGAAG GATTAAACGT ACCTAATGAA CCAATTATCC CATTTATTAT CGGTGATGGA 2220 

ATTGGACCGG ATATTTGGAA GGCAGCAAGC CGAGTTATAG ATGCTGCtGT TGAGAAAsCC 22 8 0 

TATAATGGCG AAAAACGCaT TGAATGGAAA GAAGTGCTAG CTGGCCAAAA AGCATTTGAT 234 0 

ACAACTGGTG AATGGTTACC TCAAGAAACA CTTGATACAA TTAAAGAATATTTAATTGCT 24 00 

GTTAAAGGAC CTTTAACAAC ACCAATTGGT GGTGGTATTA GATCATTAAA TGTGGCTTTA 24 60 
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CGCCAAGAAT TAGATTTATT TACTTGCTTA AGACCGGTAC GTTGGTTTAA AGGAGTACCA 2520 

TCACCTGTTA AACGTCCACA AGATGTTGAT ATGGTTATTT TCCGTGAAAA TACTGAAGAC 2580 

ATTTATGCTG GTATTGAATT TAAAGAAGGT ACAACAGAAG TTAAAAAGGT AATTGACTTC 2 64 0 

TTACAAAACG AAATGGGTGC G AC AAAC AT T CGATTCCCAG AAACTTCAGG TATTGGTATT 27 00 

AAACCAGTTT CTAAAGAAGG AACTGAGCGA TTAGTTAGAG CAGCTATACA ATATGCTATC 27 60 

GATAATAACC GTAAATCAGT TACTTTAGTT (ATAAAGGTA ATATTATGAA ATT T AC AG AA 2820 

GGCTCATTTA AGCAGTGGGG TTACGATTTA GCATTATCTG AATTTGGTGA TCAAGTATTC 28 80 

ACTTGGCAAC AATATGACGA AATTGTTGAA AAT GAAGGCA GAGATGCTGC TAATGCTGCT 2 94 0 

CAAGAAAAAG CTGAAAAAGA AGGCAAGATT ATCATTAAAG ATTCTATTGC TGACATTTC 3000 

T T AC AAC AAA TTTTAACTCG TCCAGCTGAG CATGATGTTG TAGCAACTAT GAACTTGAAT 30 60 

GGTGACTATA TTTCAGATGC TTTAGCTGCA CAAGTTGGTG GTATTGGTAT TGCGCCAGGT 3120 

GCAAACAT T A ATTATGAAAC AGGTCATGCT ATTTTTGAAG C AAC AC AT G G TACAGCTCCA 3180 

AAATATGCAG GTTTAAATAA AGTGAATCCA TCTTCAGTAA TTTTAAGTTC TGTATTAATG 324 0 

T TAG AAC AT T TAGGATGGCA AGAAGCGGCA GATAAGATTA C AG AT T C AAT TGAAGATACA 33 00 

ATTGCTTCAA AAGTTGTTAC TTATGACTTT GCCCGTTTAA TGGaTGGtGC TGAAGAAGTT 33 60 

TCTACATCAG CATTTGCAGA TGAATTGATT GnAAATTTA AATAAGCAGA AT AG AAT TAG 34 2 0 

G 3421 
(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

CCCTnGATAC CCAAACTGCC CTAATACCAC TGGCTAAACC TAATGGATAG TACCATTTAT 60 

TTTCCAATAA ATAAnCCAAC TGCAATnGCT ATaACTCTAA ATATAATAGA GATAATCmcA 12 0 

TTAATTGGAT TAAtACGCCA AAT AC TAG T A ATAATAGGCT AGATAATAAT CCACCTAAAA 180 

AGTACTTTTT AATTCCAAAG AAAGCTAATA TCAATAATGC TGCCGGTGCA GAT AAT TGAA 240 

AATCTAATCC TGGTATAATG GACGGTATTT TCAAAACTGC CAAAATGGTT AAAATCGCAG 300 

CAATGACACT AATTTGAGTA ATATCTTTTG ATGTCATACT AAAACCCCTA TACCGTTTCA 3 60 

T AAAC AAC T T GCTTCGGTGT GCTTTCTAAA AATGATATGT AATGATTTAA ATCAATACAA 4 20 
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TCGTCCACAA ATATTATTCT GCCTCCATAT CTCGTATTAA CTGGTTTAAT ATCAAATAAT 480 

CGATGGTAAC CAATTTTAGC AGCGGCAAAA TAACCTGTCG &TACGTTAA GTCATCGGAC 54 0 

ACGCAAAGTT CTCCTTTGAC ATACGGATGC GCATTGATAC AACTAGCAAT TGCTAAGGCA 60 0 

TCAGTCACTC TTTCATTAAG ATCACCTTTA TTATTTATAT CTTCAAACGA AAAATGTGTT 660 

GCCCTAATCC CCCTTTGTCC AAATGAATCT AAACGTTTAC C AG AT AT AG C AGATAGAATA 702 

ATAGCTCCTG TATAAACCGT TTCATTTTTA ATATATGTCA TCCCTTGATT TAGCGCTTGT 7 80 

TCAGTGACAC CACATTCTTG TGTTAAATGT TGGAGATTTG CTTTATCATC CTCAATAATT 84 0 

TGTAATGCTT TTATTTGTTG AATCGGTTCC ATGATTTTTT GTATTTTTAT ATTGAGAAAA 900 

TCCGGTTGCC CATTTTCATG AAAAAAACCT TT ATT AT AAA ACCTTTGTAC CGTCTGTTCT 960 

ATTTTGTCAA AC T C AC AAAT CGTTTCAGCA CCACTAATAT GAACATCTTG ATTGCTAGAA 102 0 

CGCATTTTTA TACTATACAT G AC GAT C AC C TcAATCTTCT TGaTGCAAAA TTTCAAACAA 1080 

CCTATCTATA TCTTGTTCAG TATGAAAATA CGACAATGAT ATTCTTAAE TTGGCTTAGT 114 0 

CACAGTtGGA TACCTTAAAT AACTTGTAAA CACATGATGC TTTAATAATG TTTGATGAAT 1200 

GTTCTCAGCC GCTTCTATGT CATCAAACTC AATAAACTTA ATCGGCGAGT TTGaACTATT 12 60 

ATAATkAACA TTGAGTGCTT TTAACTTTTG GTTAAAATAT TTACTCAAAC TATTTAATTT 1320 

AGTGCGTCTA TCATCAGCAT TTATTAACTT TTCAATGTTT CTTTTTATAA AATACAAATT 138 0 

ATAAATTGGC AAACTACTTG AG T AGAT GAG TGGTCTACCG TGATTAATTA aCATATCCtT 14 4 0 

CaCATCaTTT GaACTkaAAA TcACACCCCC GTATGCACCA CATGCTTTAG ATAAACTAGA 1500 

AGTGAGTATA TCTACACCTT GATAATTCGA GTAAtTCTCT ATtCCAAAAC TATGTGAAAC 15 60 

ATCGAGTATC AGTGTTGCGT TAnATTTATG CTTTAATGAG ACTAATTGAC CAATATCCAC 1620 

AACGTCGCCA TTCGTTGAAA ATACACTATC AGATATGATT ATTTTTGGTA TATTTTGATT 168 0 

AGGGTATTTC TCTAACCTTT TTTCTAAATC AGCAATATCT AAATGCTTAT ATATCfiTTT 17 4 0 

TTCTAAACCA CTTAACTTAA TACCGTCAAT AATACTCGCA TGATTTTCTT GATCTGAAAA 1800 

C AC G AC AC AA TTTGTATTTT TGAAAATATT AAATAACGCC AAATTAGCAT CATAACCACT 18 60 

ATTTAAGATA GTACATGcAC TATATCCGAG CCAACCTGCT AACATTGTTT CAATTTCTTC 1920 

ATAAGCTGTC GAACTTCCAC TAATTAATCT TGAACTTGAT AAGTGATAAC TATACTTCCG 1980 

CATAAATCTT TCGAAATCAT CCTTATCAAA CGCTATTTGA CCTAATCCTA AATAATCATT 2 04 0 

AGATGTATAG TTCGTACATC TCTTATTTTC TACTTCAATA TACTGTCTAT CTATATACCC 2100 

TACCGATTTA AGCGACCGAT ATAACCCTTT CTGTTGTJVT AAATCAATTT GCTCTTGAAA 2160 

CTTCATTCTT GTTTTCCTTA TTTTCACAAG TGTCATAATC AATTTCAAAG CCTAAATCAT 2220 

TAATCATATC GTAGTCTAAT TGGTTCGGTT GCCCACCAGT AATTAGATAA TnCACCGACA 228 0 
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AATATTGAAT TCGCCGCTTT TAATGCTAAT GGCTGTAACG AACGTAAGTT GACCTCTCTT 2340 

CCTCCAGCAA TACGAATTTC TTTCGTAGGA TTGATTAATC GGAATAATGC TACGATTCTT 2400 

AAACATTTCA TTGGTGTTAA ATCATCCATG CTTCCAAACT TTGTGCCTTT GATTGGATGC 2 4 60 

AAAAAATTAA TCGGAATACT GTCGGCATCC ATTTCTTTTA AAGCAAATGC CATATCAACA 2520 

ATATCTTGAT TAGATTCTCC CATACCACAA ATCACGCCAG AACATGGTGA TATATTATTC 2 58 0 

GCTTTCATTA GTTCTATCGT ATCTGTTCTA TCTTTATAAC TATGCGTTGT CACGACGTTA 2 64 0 

TCATGGTAAT TTTCACTTGT ATTAATATTG TGGTTATATC TGTCTACACC AGCTGACTTA 2700 

AGTTTCTTAG CTTGTTCGTC ATTCGTTAAT CCTAAACATG CACAfiTTTT TAGTTGCGGG 2760 

TGTTGAGATT TAATCGTTCT TACAGTATTA CTAATATGAT CAACTTCTTT ATCGCTCGGT 2820 

CCTCTACCAC TCATAACAAT ACAATATGTT CCAATATGAT TATCATGTGC CACCTTTGCT 2880 

CCATCGATAA TTTGTTCCTC TGGAATTAAA GCATATCGCT GTTTTTGTTT AATATCTCGT 2 94 0 

GATTGTCCAC AGTACCCACA ATTTTCAGGA CATATACCAC TTTTAGCATT T AAAAT CAT G 3000 

TTTAATTTTA CTTTTTTACC AAAATAATGT TTTCTTAAAA TGTACGCCTC ATTTAATAAA 3060 
TCTAAGGTAT CAATATTAGT ATCCTCATAA ATtTTCAATA CAGTCTCTTT TGTtAATTGT ' 3120 

tCCCCTTGTA ATATGCGTTT AGCCAASTTC ATATTAACAC TTCCTATCTA AAA 317 3 
(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

C GAT TAT CCA TTAATACAAC CCTAAGTAAA TGTATAAAAA TTATCTTCCA CAAACTTCAA 60 

C AAAAGCC T A AATAAATTAC AGCAAT T TAT CAAATATTGC TTACTTTGAT TTTATGAAAT 120 

nACTTAATTC TAACACATAC TAAATCATCA TATACTAATT CGAAATCAA TGCATTTAGA 180 

GATAATCaAA ATGCGGAAAC ATCTCCaATA ATCAATAATC TATTCCCAAT AAATATGAAT 24 0 

GTTCTCAACA ATACATTATT TATATCTCTT T AC AC T G T C A TCGACAAAAA CTAAATCTTT 30 0 

CACTTTCAAT TTCGAACGTG GTTCTAC GAC ATTTGCTGCT ATATCATTTA ATGGGATTAA 3 60 

AACAAATGCA CGTTCATTCA TTCTCGGATG TGGCACCGAC AGTTTTGGTA AATCTATCAT 4 20 

TTCTTCTCCA TACAACAAAA TATCCACATC TAAAGTTCTA GGACCCCATC GTTCCTTTCT 480 

AATACGGTGT AAACATTCTT CTGTCkTCAA ACAACATTCC AACAGTTGTA ATACTGTGAG 54 0 
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TGTTGTTTsA ATTTCAACAC ACAAATTTAA AAAGTTAGGT TGCTCAGTAT ACCCAACTGG 600 

TGCTGTTTCA TAAATCGGAG AAATAtTAGA TACGTTAATA CCATCATATT CATTCaAAAT 6 60 

CTTyATAGCA TCGTTTAACt GGCTTTCTCT ATCACCAaTa TTACTACCTA AyCCTAAGTA 720 

TGCTTGAATC ATyTATTCTC CCTCACTATT TCGATACCTA CTCCATCATA ATGACCGGA 780 

ATCGGTGGGT TTTCTTTAGT GATTCTCACT TTCGTTTCCA T T AC AC GAT T ATATTGTGAA 840 

TTTATACGAT TTGCAATACG TTCAGCTAGA TGCTCAAGTA AATTAACGGC CTTACCTTCC 900 

ATAATTGATT TAACCTCTTc GAACACTTCA CCATAATGAA CTGTATCAAT AACATTATCA 960 

GTACGCCCAG CTTCAGcTAA GTCTACTTTC AAAGTTACAT CCACTTTGAA AATTTGCCCT 102 0 

ATTTCATTTT CAGCTGATAA AGCACCATGA TATCCATAAA AGCGCATACC TTTAAGAAAG 1080 

ATTGTGTCTT GCATTTTCAT TCTCCTTTAA AAAATCTATA CCTTTAGCTA ATTTAGCATT 114 0 

CAACTCGACA TTATGAACGC GTACTGCTCT AACGCCTTC ATAATACCAT ATGCAGTCGT 12 00 

AGCTGCAGTT ACTTCATCTC TTTCAACCGG TGTTGTATCA TAACCCATCA TCTCTTTAGT 12 60 

GAAACGTTTC CGGCTTGTCG CTAATAAAAC TGGATATTCT GTTGCAACAA GTTCATCCAG 1320 

TCTTGCCATA ACTTCGGCTT CTTCATTTCT AGTTTTAGCG AAACCTATAC CTGGATCTAG 138 0 

CCAAATTTTA TTTGAAGGTA TACCAGCTAT TTTAGCTTGA TGTGCTTGTG CTAACAAAGA 14 4 0 

TGTTAACATT TCTTCGACAA CCGGTTCATC ACGATTACCA TTTCCATTAT GCATTAAAAT 1500 

AATTTCCGCG TCATAtTTAG CTACAATTTG GAACATACGA TGATCATACA GACCgCCcAT 15 60 

tGATCATTAA TCATATCAPC GGCTAATTTT AAACATGCTT CAGCAACCTC ACTTCGAAAT 1620 

GTATCGACTG AAATTTTTTA CATCAnAACC GACAATAGCT TCAnCAACAG TAATACTCTG 1680 

TTCCATCTCT TCTG 1694 
(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1358 base pairs 

(B) TYPE: nucleic acid 

■ (C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 

AGCATTTCTT TTCTATAAAC ATTTAATTGA ACATTATTAAGTACACTATT AC TAT AG T C A 60 

CTATATTGAA CaCATACCTC ATTTAATTCT AATAGCGGTT CAGATTTGTA CTTATTATCA 12 0 

TTATTTGCAG ATGTTTCATC TATCCATTTT TTCACTTTAA ATTTAACATG TTCACTCATA 18 0 

CAAACGTCAC GTAAATTCGC TAAGTTATCA ATGGATTCGA CATCTACTTC T GC AT AT T T A 24 0 

AGCGCTGTAC AGTATAATGG TTCACGTATG CCTGCTTCTT TAAGCTTAGA TGATTTTAGC 300 
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AAATCACTAG GCGTTGTATT AGCGATGATT TTTCCATCTT TAAAAAGAAG AACTCTATCA 360 

AACGTATCAT CTAATGATTC TTCTAATCGA TGTTCGACAA TAATCATCGT TGACTTTGTT 420 

TCTTCATGAA TATTGTTTAA CAATCTCAGC GTTTCATGTC CTGTCGCAGG ATCTAAATTG 480 

GCCAGTGGCT CATCCAATAT TAAAATAGGC GTACGATGGA TTAATATACC ACCTAATGAA 54 0 

ACGCGTTGTT TTTGACCTCC AG AT AAAT C T TGCGGTCGGT GATTTAAATG TTCTATCATG 600 

CCAACTTTTT CAGCCCAATA ACTTACATTT TTCTTCATAT CATCTTffiTC AACACAATTA 660 

TTTTCTAATA AAAAGGCCAT ATCTTCAGCT GCTGTTAAGC CTATGAACTG TCCATCTGTA 720 

TCTTGTAAAA CTGTACCAAC AACATTAGAT CTATCATGTA AACAACTAAC GGTTGCATCT 780 

TGATTATTTA TATATAGTTC CCCAGTTATG TTACCTTTAG TTTTAAATGG AATTAATCCG 84 0 

TTTATGCAAT TTGCAAAAGT CGATTTACCA CTACCCGAAG CACCAACTAC TAATACTTTT 900 

TCTCCTGGAT AAAT AT C AAC ATTTATATTC TGTAATGTAG GTGTTGCTTG ACTATGATAT 960 

TGAAAACTAA AGTCTTTGAA CGAGATAATT GGTTCAGTCA TGATATATCA TTACCTTTCT 1020 

ATATTCATTT ACATATCTGA TTCAACAAffi TAACTATTCC TTACGTAAAC TACCTTTTTT 1080 

AATTTGAGAT GAaGCATATG CTTTTAATAA TATTGTCCCA ATAATGCCAA CTGAAATAAT 114 0 

ATTTAATACT GCAGAGATAA CACCTTGTGT ATAAACCTTG TTAGCTGGTT CGTTATAAAT 12 00 

CAAAATATCT AATGTTGGTG CAATAAGTGC CCAGCAAATA ATATTCGCAA TAATTGACC 12 60 

GATATTAAAA T AAAC CAT C G ATTTCCTAGA AAATCGGCCT GAAGAAAGAT TTAATTTTAG 1320 

TCCAATCCAG C CAT AT AAAC AGCCTATAAT TCCCGAGC 1358 
(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IENGTH: 4557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

TAGAAGAATT GGAGAAAATG CTAATTCAAT TGTCAACATT CCATAGTTAT CATGATTTAG 60 

AGTTTCTATT TGTGACACGT GAAGATGAAG TTGAAACATT GAAATGGGCA CGTTGGTTGC 12 0 

C AC AT AT G AC ATTGAGAnGG CAAAACATTA GAGGATTTGT TTACAATCAA CGAACGCGTG 180 

ACCAAATTTT AACGTCAATT TATAGCATGA TTAAAGAACG TATCCAAGCT GTGCtGaACG 24 0 

CAGCAGAAGT AATGAGCAAA TTATTTTCAC KCGCAATTA GTGTTTGTCA TT AC AG AT AT 300 

GTCATTAATT ATTGATCATG TCATTTTAGA AT AT G T AAAC CAAGATTTAT CAGAATATGG 360 
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TATTTCATTA ATCTTTGTTG AAGAT Gt GAT TGaAAGTTTG CCAGAGCATG TAGATACCAT 420 

TATTGATATC AAGTCTCGTA CTGAAGGCGA ACTGATTACG AAAGAAAAAG AATTAGTFCA 4 80 

ATTGAAATTT AC AC C T G AAA ATATTGrTAA CGTCGATAAA GAATATATCG CGCGACGTTT 54 0 

GGCGAATTTG ATACACGTCG AACATTTGAA AAATGCAATT CCTGATAGTA TTACATTTTT 600 

AGAGATG TAT AACGTGAAAG AAGTAGATCA GCTTGATGTG GTTAATCGAT G G AG AC AAAA 6 60 

CGAAACATAC AA?ACGATGG CAGTACCTTT AGGTGTAAGA GGTAAAGATG ATATTTTATC 720 

ATTGAACTTA CAT G AAAa AG CACaCgGGCC aCATGGtTTA GTTGctGGtA CCACTGGTTC 7 80 

AGGGAAATCT GAGATTATCC AATCATACAT TTTATCTTTA GCTATTAATT TTCACCCTCA 840 

TGAAGTTGCA TTCCTATTGA TTGACTATAA AGGTGGGGCr ATGGCGAACT TATTTAAAGA 900 

TTTAGTCCAT TTAGTTGGTA C GAT T AC AAA CTTAGATGGC GAT GAAGCGA TGCGTGCCTT 960 

AACATCAATC AAAGCCGAAT TGAGAAAACG TCAACGTTTA TTCGGAGAGC ATGATGTTAA 1020 

CCATATTAAT CAATACCATA AGTTATTTAA AGAAGGTATT GCGACAGAAC CAATGCCACA 108 0 

TTTATTCATT ATTTCCGATG AGTTTGCCGA ATTAAAATCA GAACAACCTG ATTTTATGAA 1140 

AGAACTTGTA TCAACGGCAC GTATTGGACG TTCGTTAGGT ATTCATTTAA TACTTGCGAC 1200 

ACAAAAACCA TCGGGTGTTG TTGaTGACCA AATTTGGTCT AACTCTAAAT TTAAGTTGGC 12 60 

ATTAAAAGTA CAAGATAGAC AAGACAGTAA TGAAATTTTA AAAAC AC C AG ATGCAGCAGA 1320 

CATTACmTTA CCaGgTCGTG CGTATTTACA AGTTGGTAaT AATGAmATTT ATGAATTATt 138 0 

CCAATCTGCA TGGAGTGGTG C AACAT AT GA CATCGAAGGC GATAAATTAG AAGTTGAAGA 14 4 0 

TAAGACGATT TACATGATTA ATGACTATGG TCAACTTCAA GCAATBACA AAGACTTGAG 1500 

TGGACTTGAA GAT GAAGAAA CGAAAGAAAA TCAAACTGAG TTAGAAGCGG TCATAGATCA 15 60 

TATCGAATCT ATTACAACAC GAT T AGAAAT CGAAGAAGTT AAGCGTCCAT GGCTACCACC 162 0 

ATTGCCAGAA AATGTATATC ArGAAGATTT AG T AG AAAC a GATTTCAGAA AATTATGGTC 1680 

A GAT GAT GC A AAAGAAGTGG AATTAACATT AG G AC T T AAA GACGTACCAG AAGAACAATA 1740 

TCAAGGACCG ATGGTATTGC AATTGAAAAA AGCTGGGCAC ATCGCGTTAA TCGGAAGTCC 1800 

AGGATATGGT AGAACAACGT TCTTACACAA CATTATTTTC GATGTTGCAA GACACCATCG 18 60 

TCCTGATCAA GCACACATGT ACTTGTTGSA TTTCGGTACC AATGGTTTGA TGCCAGTTAC 1920 

AGACATACCA CATGTCGCTG ATTACTTTAC AG T AG AT C AA GAAGACAAGA TTGCTAAGGC 1980 

GAT AC GT AT A TTTAATGATG AAATTGATCG TCGTAAGAAG ATTTTAAGTC AGTATCGTGT 2040 

CACTAGTATT TCTGAATATC GAAAATTAAC TGGTGAAACA ATTCCGCATG TCTTATTCT 2100 

TATTGATAAC TTTGACGCAG TAAAAGATTC ACCTTTCCAA GAAGTTTTTG AAAATATGAT 2160 

GATTAAAATG ACGCGTGAAG GGCTAGCATT AGACATGCAA GTAACCTTAA CTGCTTCAAG 2220 
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AGCTAACGCT AT GAAAAC AC CAATGTACAT TAATATGAAA ACGCGTATCG CAATGTTTTT 2280 

AT AT GAT AAA TCAGAGGTGT CGAACGTAGT AGGACAGCAA AAATTTGCGG TTAAAGATGT 2340 

TGTGGGTCGA GCATTGTTAA GTAGTGATGA CAACGTATCA TTCCATATTG GCCAACCATT 24 00 

TAAACATGAT GAG AC C AAAT CATATAATGA TCAAATTAAT GATGAAGTAT CGGCGATGAC 24 60 

AGAATTTTAT AAAGGTGAAc ACCAAATGAT ATtCCTATGA TGCCAGATGA AATTAAATAT 2520 

GAAGATtACA GAGAaTCATt AAACTTACCA GATATAGTTG CAAATGGTGC TTTACCAATT 2580 

GGATTAGATT ATGAAGGTGT T AC AC T AC AA AAAATTAAAT TAACTGAACC AGCAATGATT 2 64 0 

T CAT C AG AAA ATCCGAGAGA AATTGCGCAT ATTGCTGAAA TTATGATGAA AG AAAT T G AC 2 700 

AT AT T AAAT G AAAAATATGC GATTTGTATC GCAGACTCAA GTGGAGAGTT TAAAGCTTAT 27 60 

AGGCATCAAG TGGCTAACTT TGCCGAAGAA AGAGAAGACA TTAAAGCGAT TCATCAACTA 28 2 0 

ATGATTGAAG ACTTAAAGCA AAG AG AAAT G GACGGCCCAT TTGAAAAAGA TTCACTTTAT 2880 

ATTATCAATG ATTTTA2AAC ATTTATTGAT TGCACGTATA TTCCGGAAGA TGATGTTAAA 2 94 0 

AAGCTTATTA CAAAAGGACC AGAACTTGGC TTGAACATTT TATTTGTCGG CATTCATAAA 3000 

GAATTAAtAG ATGCTTATGA T AAAC AG AT T GATGTTGCAC GTAAAATGAT TAACCAATTT 3060 

AGTATAGGTA TTCGTATTTC AGACCAACAA TTCTTTAAAT TB.GATTTAT T C AAC GAG AA 312 0 

CCTGTTATTA AAGAAAAT G A AGCATATATG GTCGCAAACC AAGCTTATCA AAAGATTAGA 3180 

TGGTTTAAAT AGCAAT GAAT TAAATAGGAG GGAGGTATGT TATGAATTTT AATGATATTG 32 4 0 

AAACAATGGT T AAG T C G AAA TTTAAAGATA TTAAAAAGCA TGCTGAAGAG ATTGCGCATG 3300 

AAAT T G AAG T TCGTTCTGGA TAT T T AAG AA AAGCTGAACA ATATAAGCGA TTAGAATTTA 3360 

ATTTGAGTTT TGCACTAGAT GAT AT T G AAA GCACAGCAAA GGACGTACAA ACTGCAAAAT 34 20 

CTAGTGCTAA TAAGGACAGT GTAACTGTTA AGGGAAAGGC GCCCAATACG TTATATATTG 34 80 

AAAAAAGAAA TTTGATGAAA CAAffiGCTTG AAATGTTGGG T G AAG AT AT C GATAAAAATA 354 0 

AAGAATCCCT CCAAAAAGCT AAGGAAATTG CTGGCGAAAA GGCAAGTGAA TATTTTAATA 3 60 0 

AAGCAATGAA TTAATATTGA GGTGAAGATA TGGGTGGATA TAAAGGTATT AAAGCAGATG 3660 

GTGGCAAGGT TGATCAAGCG AAAC AAT TAG CGGCAAAAAC AGCTAAAGATATTGAAGCAT 37 2 0 

GTCAAAAGCA AAC GC AAC AG CTCGCTGAGT ATATCGAAGG TAGTGATTGG GAAGGACAGT 37 8 0 

TCGCCAATAA GGTGAAAGAT GTGTTACTCA TTATGGCAAA GTTTCAAGAA GAATTAGTAC 38 4 0 

AACCGATGGC TGACCATCAA AAAGCAATTG ATAACTTAAG TCAAAATCTA GCGAAATACG 3 900 

AT AC ATT AT C AATTAAGCAA GGGCTTGATA GGGTGAACCC AT GAT G AAAG ATGTTAAGCG 3 9 60 

AATAGATTAT TTTTCTTACG AAG AAT T AAC AATTTTAGGT GGTAGTAAAT TGCCTCTCGT 4 020 
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AAATTTTGAA TTGTTTGATC CATCAAATTT TGAAGAAGCT AAAGCTGCTT TAATTGAAAA 4 08 0 

GGAATTAGTA ACAGAGAATG ACAAGTTAAC 'EATGCAGGT TTTAAAGTGG CTACATTAGT 414 0 

CAGAGAGTAT ATTAGCGCCA TTGTAAATAT TCGAATTAAT GAT AT G T AT T TTGCACCATT 4200 

TAGCTATGAA AAA GAT G AAT ATATTTTGTT AAGCCGGTTT AAAAATAATG GGTTTCAAAT 4 2 60 

ACGAATTATC AATAAAGACA TTGCATGGTG GTCGATTGTA CAATCATATC CTTTATGAT 4 320 

GAGACAAGAA AAGTCCAATG ATTGGGACTT TAAACAAATT G AC GAT G AAA CATTGGAGAA 4 38 0 

CTTAAATAAT GAAAGTATCG ATACGATTGG GCGTGTTTTA GAAAT T G AAA T AT AC AAT C A 44 4 0 

TCAAGGTGAC CCTCAACAAA GTTTATATAA CATTTATGAA CAAAATGATT TGTTATTCAT 4 500 

TCGATACCCA T T AAAAGAT A AAGTGCTGAA TGTTCATATT GGTGTCATTA ATACATT 4 557 
(2) INFORMATION FOR SEQ ID NO: 34 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

TTGAGTGACT TTATTGAAGC GCGTGTAGAA GAAAT AT T C T TCGAAGTATT TGATGTTTTA 60 

CAAGATTTAG GAT T AAC AAA AGTAAATGGT GGGTTTATTG TAACTGGTGG ATCTGCAAAC 120 

TTACTTGGCG TAAAAGAATT ATTATCAGAT ATGOAAGTG AAAAAGTTAG AATTCACACG 18 0 

C C AT C AC AAA TGGGAATTAG AAAACCTGAA TTTTCTTCAG CAATTTCTAC AATTTCTAGT 24 0 

AGTATCGCTT TTGATGAGTT ATTAGATTAT GTTACAATTA ATTATCATGa TAATGAAGAA 300 

ACTGAAGAAG ATGTTATTGA TGTGAAAGAC AAAGATAACG AATCTAAATT AGGCGGaTTT 360 

GaTTGGTTTA AACGTAAAAC AAACAAAAAA GATACTCATG a AAAT GAAG T AGAGTCAACA 4 20 

GAT G AAG AAA TTTATCAATC AGAAGATAAT CATCAGGAAC ATAAACAGAA T CAT G a AC AT 4 80 

GTTCAAGACA AAGATAAAGA TAAAGAAGAA AGTAAATTCA AAAAACTAAT GAAATCTCTA 54 0 

TTTGAATGAT TATTGGCCAA TAAAACTAGG AGGAAATTTA AATGTTAGAA TTTGAACAAG 600 

GATTTAATCA TTTAGCGACT TTAAAGGTCA TTGGTGTAGG TGGTGGCGGT AACAACGCCG 660 

TAAACCGAAT GATTGACCAC GGAAT GAAT A ATGTTGAATT TATCGCTATC AAC AC AG AC G 72 0 

GTCAAGCTTT AAACTTATCT AAAGCTGAAT CTAAAATCCA ATCGGTGAA AAAT T AAC AC 780 

GTGGTTTAGG AGCAGGAGCT AATCCTGAAA TCGGTAAAAA AGCTGCAGAG GAATCTCGTG 84 0 

AACAAATTGA AGATGCAATC CAAGGTGCAG ACATGGTATT TGTTACTTCT GGTATGGGTG 900 

GCGGAACTGG TACTGGTGCA GCACCAGTCG TTGCTAAAAT TGCAAAAGAA ATGGGCGCAT 90S 
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TAACTGTTGG TGTTGTAACT CGTCCATTTA GTTTTGAAGG ACGTAAACGT CAAACTCAAG 102 0 

CTGCTGCTGG AGTAGAAGCT ATGAAAGCTG CAGTAGATAC ATTAATCGTT ATACCAAATG 108 0 

ACCGTTTATT AGATATCGTT GACAAATCTA CGCCAATGAT GGAAGCATTT AAAGAAGCTG 114 0 

ACAACGTGTT ACGCCAAGGT GTATAAGGTA TCTCAGACTT AATCGCTGTT TCTGGTGAAG 1200 

T AAAC T TAG A CTTTGCAGAC GTTAAGACAA TTATGTCTAA CCAAGGTTCT GCAT TAATGG 1260 

GTATTGGTGT TTCTTCTGGT GAAAATAGAG CGGTAGAAGC TGCTAAAAAA GCAATCTCTT 1320 

CTCCATTACT T G AAAC AT C T ATCGTTGGTG CACAAGGTGT GCTTATGA21 ATTACTGGTG 1380 

GCGAGTCATT GTCATTATTT GAAGCACAAG AGGCTGCTGA TATTGTCCAA GATGCTGCAG 14 4 0 

ATGAAGACGT TAATATGATT TTCGGTACAG TTATTAATCC TGAATTACAA GAT GAG AT T G 150 0 

TTGTAACAGT TATTGCAACT GGTTTTGATG ACAAACCAAC ATCACATGGT CGTAAATCTG 1560 

GTAGCACTGG ATTCGGAACA AGCGTAAATA CTTCTAGCAA TGCAACTTCT AAAGATGAAT 162 0 

CATTCACTTC AAATTCATCA AATGCACAAG CAACTGATAG TGTAAGTGAA AGAACACATA 168 0 

CAACTAAAGA AGATGATATT CCTAGCTTCA T T AGAAAT AG AGAAGAAAGA CGTTCAAGAA 174 0 

GAACAAGACG TTAATCGGTT AATATATATA CACAAATAAT T C AAC AC AAA T CAT C AG AT A 18 00 

AC AT AT C T G A TGATTTTTTT ACTAATTTTT AG a AC AT G T A GAAGGACATT TAAGTTTTTC 18 60 

aAAGTTATTA AAAGTGTTTA AGTATCGTGT GAAAATTAAG TCaAAAATTA TTTGCGCAAC 192 0 

ATTTTAACTT TAAACATAAA TGTTATATTA TATAATTATT AACTTTGTAC AGTTASCGA 198 0 

AGATAATTTA AATGAAATGA TGGTGACGAT CGAGTGAATG ATAATTTTAA AAAGCAACCG 204 0 

CATCATTTAA TATATGAAGA GTTATTACAA CAAGGTATTA CTCTAGGTAT TACAACTAGA 2100 

GGAGATGGTT TAAGTGACTA TCCTAAAAAT GCTTTTAATA TGGCGAGATA TATTGATGAT 2160 

CGCCCATATA ATATTACTCA ACATCAATTG CAATTAGCTG AAGAAATTGC GTTTGATAGA 2220 

AAAAATTGGG TGTTTCCCAT T C AAAC AC AT GAAAATAAAG TCGCTTGTAT TACAAAGGAT 22 8 0 

GATATAGGCA CAAATATAGA CACTTTAACT GATGCGCTTC ATGGTATTGA TGCGATGTAC 234 0 

ACATATGATA GTAATGTCTT ATTAACGATG TGTTATG&G ACTGTGTACC AGTATATTTT 2 4 00 

TATAGTACAA AACATCATTT TATTGCATTG GCGCATGCAG GTTGGCGTGG TACCTATACT 24 60 

GAAATTGTAA AAGAAGTGCT AAAACATGTG AACTTTGATT TGAAAGACTT ACATGTCGTT 2520 

ATTGGACCAT CTACATCATC AAGTTATGAA ATTAATGATG ATATTAAAAA TAAATTTGAA 25 80 

ACATTGCCAA TTGATAGTGC CAACTATATT GAAAC TAG AG GACGAGATCG TCATGGTATT 2 64 0 

GATTTGAAAA AAGCCAATGC TGCATTATTA ATTTATTATG GTGTTCCTAA AGAAAATATT 27 0 0 

TATACGACAG CGTATGCTAC AT C T G AAC AT TTAGAATTAT TTTTCTCTTA TCGATTAGAA 27 60 
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AAAGGTCAAA CAGGACGCAT 
GTTTGCGTGT GAAAGATAAT 
AAAATAATTT TTCAACAAAA 
GAGCTAAAGA AGCGTATGAG 
TTTTTCAAAA GAAAGAAGCA 
AATCTCGAAA AGTTAAGGAC 
TGAGCTTAGC CAAAGAAATT 
TGAACGTTTC GGGAGAAGCT 
TAG AT GAT C T TAAAAAATAT 
T GAC AG AT G A TGAAGCATAT 
AAATACAACG ACTCAATTTA 
ATGACTATCT TATTGCAGTT 
TAGGAGAAGA GGAGTGAGCC 
AG AT GAT G AA GAGGAAGTAG 
AAAAGAGCAG TCACAACAAA 
TGCATCAAGA TATACAACAA 
AAATAATTCA CGTAATGTTG 
AAGTTCAAAA ATGTGTTTAT 
TGAGCTTAAA AACCGCCGTG 
GAAAAGAATT ATTGATTTTT TAAGCGGTAC T 
(2) INFORMATION FOR SEQ ID NO: 34 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2820 
2880 
2940 

3000 
3060 
3120 
3180 
3240 
3300 
3360 

3420 
3480 
3540 

3600 
3660 
3720 
3780 
3840 
3900 
3931 



GTTAGCATTC ATTGGTCAAC 
T T AC AAC AAA TCTCAACACA 
CCAAACGTGA TTGCAGTTAC 
GCTGGAATAA GACATTTTGG 
TTACCATCAG ATGCGGTGAT 
GTTATAAACG AC G TAG AT T A 
AACAAACGTG CAGAACATAA 
TCTAAACATG GTATTGCTTT 
GACAAAATCG AAATTGTAGG 
ATTAGATCGT TATTTAAACA 
GAATATCEGC CTTGTGATGA 
GAAGAAGGTG CGACGTTTGT 
ACTTGGCTTT AAAAGATTTA 
AAGTACCTGA CAAACAACAA 
CAACAAAACA AAACGCAATC 
CGTCAGAAGA AAGGAATAAC 
TAACTATGAA CAATGCTACA 
TCGAACCACG TGTTTTTTCA 
CGACACTTGT CAATTTACAA 



AGTAAACAAG GAGGAGATAT 

AATTAATGAC AAAAGTGAAA 
AAAATATGTT ACAATAGAGC 
TGAGATAGA TTGGAAGGCT 
C CAT T T TATA GGATCATTAC 
TTTCCATGCT TTAGATCGAT 
AATTAAATGT TTCTTGCAAG 
AGAAGATGTT GATCAGTTTA 
TTTAATGACG ATGGCACCAT 
GTTACGTTTG AAAAAAGAAG 
ATTATCAATG GGAATGAGTA 
TAGAATTGGG ACTAAACTTG 
TTTAGTGGAT TTTTTGTAAT 
CAGGTAAATG AfiCGCCAGC 
AAATCAGTCC CTCAAAAATC 
CGTATGTCTA AT T AT T C AAA 
CCAAACAATG CATCACAAGA 
GATACACAAG ATATTGCTGA 
CGTaTTGATA AAGTAT CAGC 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

AATTGTCGGG GGACTCTTAG GTTTTGTCAT GCAAAGAACA AGATTTTGTT TAACAGGTGG 60 

CTTTCGAGAT ATGTATGTGC AAAAGAATAA TAAGATGTTC TAT G CAT TAT TAATCGCTAT 120 

TACTATTCAA AGTATAGGAT TATTGATTTT GACGGCAACA GATATTTTAC AAATTCCTGC 180 

ACATAGTTTT CCAATATTGG GAACAATTAT AGGTTCTTTT ATTTTTGGAA TTGGATAGT 24 0 

ATTGGCTGGA GGATGTGCAA CAGGtACTTG GTATCGCGCT GGTGAAGGGC TAATTGGTAG 300 
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TTGGATTGCA TTAGTATTAT ATGCTGTTAC TGCAGCAATC AC T AAAAC AG GGATTTTAAA 3 60 

GCCAGTAATG GATAAAATTA ATCAACCAAC GAATGTAAAT AGTGATATGT CTCAAACAAC 4 20 

TGGCATTCCG TTTTGGGGAT TAGTCGTTAT ATTAACTATA ATCACCATTT TTCTAGTTGT 4 80 

AAGAACACTC AATAATAAGA AAGTTCGAGT TGCTGTTCCA AAACTAAAGC AACGTTATAC 54 0 

AGGTATTAGA TATTACCTTT TCGAAAAACG ATACCATCCA TTTATTGCAG CAATTGTAAT 600 

TGGACTTATC GCACTCTTAG CTTGGCCAAT GAGTGCffCA ACTGGAAGAA ATGACGGTTT 660 

AGGTATAACA ACGCCTTCAG CAAATTTAGT ACACTTTTTG ATTACAGGTG AAACTAAATT 720 

TATTGATTGG GGTGTCTTTT TAGTTCTAGG AATTTTCATT GGTTCATATA TTGCAGCTAG 7 80 

AGGATCAAGA GAATTTAAAT GGCGATTGCC AGACAAGATT ACAATACGAA ACAGTGCCAT 840 

TGGTGGCATA TGTATGGGAT TTGGTGCGTC AGTTGCTGGT GGTTGTTCTA TCGGTAACGG 900 

TTTGGTTGAA ACGGCAACGA TGACTTGGCA AGGATGGATT GCGCTAgCAT gCGATGATAG 960 

TTGGTGTATG GACAATGAGT CATTTTATCT TTGTTCGTCC AATGAAAAAA GTACACCAAC 1020 

AATCTGCAAA GGTTAAACAG CAAACGCAAA TAGTATAGAA GATTATTATG CAAATGATGT 1080 

TGATCAAATA AAAGTGATTG G AAAAG GAGA AATAATTATG AT AC AC G AAT TAGGTACAGT 114 0 

AGGAATGGTA TGTCCATTTC CGTTAATTGA AGCGC AAAAG AAAATGGCAA CATTGCAATC 12 00 

TGGAGATGAA TTAAAAATTG ATTTTGATTG CACGCAAGCG ACGlSAGCCA TTCCAAATTG 12 60 

GGCTGCAGAA AATGGTTATC CTGTAACAAA CTATGAACAA ATTGATAATG CTTCATGGAC 1320 

AAT T AC AAT T CAAAAAGTTT AACGTTATCA TTTTAACAAT AAAATAGATA TTAGATTCTA 1380 

TGGCTACTTC CGCTAATTTA AAAGTGAGTA AGTAGTCTTT TTTTTTTTAG TTCATGAAAT 14 4 0 

CATTTTTATA TAGTGTGGCA CATTTTATTC C AAAAG AT G T AAT AAAAC T T AACGCATTTT 1500 

TGCTTTTTAT AAATTGTCAG ATT AT TAT GA AAAAAAGGGA GTGGTAAGTA TGAATCTTAA 15 60 

CGATACGATA TTTATGTTTT TGTGTACATT ATTAGTTTGG TTAATGACAC CAGGATTAAG 162 0 

TTTATTTTAT GGTGGGTTAG TTCAAXTAA AAATGCGCTT AATACTGTCA TGCAAAGTAT 168 0 

GGCAGCAATT GTGCTTGTTA CAT T T GT AT G GATAACAGTT GGTTTTACAA TTAGTTTTGG 17 4 0 

GAATGGGAAT TTATGGTTCG GAAATTGGGA ATATACTTTT CTTAATCATG TAGGTTTTGC 1800 

GACTCAAGAA GATATTAGCC CACATATTCC TTTCGCTTTG TTTATGTTAT TCAAATGAT 18 60 

GTTTTGTACG ATTGCAATTT CTATTTTATC TGGTTCAATC GCTGAGAAAA TGAAGTTTAT 1920 

TCCTTATTTA TTATTCGTAG TAATATGGAC TGCTCTTGTA TACAGTCCAG TAGCACATTG 198 0 

GGTTTGGGGC GGCGGTTGGA TTAACAAACT CGGTGTATTA GATTTCGCTG GAGGTACGGT 2040 

TGTTCATATT ACATCAGGTG TTTCTGGTTT AGTATTAGCT ATTATGATTG G AAAAG G AAA 2100 
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CAAACATTCT GAATCAACAC CACATAATCT TATCATTACG TTGATTGGCG GTATATTCGT 2160 

GTGGATTGGT TGGTATGGAT TTAATGTAGG TAGTGCTTTT ACATTTGATA ATATTGCGAT 222 0 

GCTTGCATTT ACAAATACTG TCATTTCAGC CACTGCAGGT GCTATAGGTT GGTTAATTTT 22 8 0 

AGAATATATT TTTAAAAAGA CGACAAGTTT ACTTGGACTT TTACTCGGTG CATTAGCAGG 234 0 

ATTAGTTGTC ATTACTCCTG CAGCAGGATA TGTAACATAT CTTAGTGCAA CAATAATGGC 24 00 

TTTAATAGGA GGTATCTGTT GTTATATTGT CATTAATTAC ATCAAGGTAA AACTAAAAff 24 60 

TCATGATGCA TTAGATGCAT TTGGTATTCA TGGTGTTGGT GGTATTATTG GTGCTGTTTT 2 52 0 

AACAGCAGTT TTCCAAAGTA AAAAAGCCAA TCCTGACATT GAGAATGGCT T T AT T TAT AC 2 58 0 

TGGTGACATA CAT AT TAT AC TTGTACAAAT ATTATGTGTA ACAGCAGTTG TAATTTTTAG 2 64 0 

TATAGTCATG ACGTTTATTA TTGCGAAAGT AATTAAATTa ATTACACCAT TATCTGTTAC 27 00 

GGAACAAGAA ACGAATATAG GAT T AG AC AA GATTGTTCaC GGTGAACATG CTTACTTTGA 27 60 

AGGTGAGCTA AATAGATTCA ATAAACATAT TCGATATTAG AATATATTTA CATAGAATAT 2820 

TCATTGTCCT G AC AT T T AAC TAAAGGTTGA TGTTGGGACATTTTGTTATA CAAAAGTTTT 2880 

ATTTTGAAAT CTTTTTATGA AAGAAGCAGA AATATTATTT AAAGCGGTTA CACATATGCT 2 94 0 

AAAATAAGGC TAAGTGTCAC AAATAATGAT AGGTGAATAA GTATGAAAAA TATATCTGAT 3000 

ATTGCCAAAT TGGCAGGCGT TTCAAAAAGT ACAGTATCTA GATTTTTAAA TAATGGATCT 3ffi0 

GTCAGTAAAA AAACAAGTGA AAAATTAACA AGAATTATAG C AG AAC AT G A CTATCAACCG 312 0 

AATCAATTTG CTCAAAGTTT AAGAGCGAGA 3150 
(2) INFORMATION FOR SEQ ID NO: 34 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3719 hase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

GTTATAGTGA AATTGACTCA TCACATTTCA CAGACCGTGA CAAACGCGTT ATTAGACGTG 60 

ATCATGTTAA AGAAGCAOAA AGCTTAGTAG AGAACTATAA AGATACACAA AGTGCTGATG 12 0 

CTAGGATGAA AGCCAAACAA AAAGTTAACA CAT T AAGC AA ACCGCATCAA AACTATTTCA 18 0 

ATAAACAAAT TGATAAGGTT TATAATGGAT TACAACGCTA ATCCAAAGTA AATTATAAGT 24 0 

TAT AC AT C T C GTTTTTAAAT GACAATTTAT CCCCGTAAAT ATE.TAAATA ATCTTTTCAA 300 

ATTCCACATA GATATAGAGA CACTAATAAA CCTCTTTGTC TCGATATGAT AGTCTGCAAC 3 60 

GATTCATGTT GTAGGCTTTT TAATTTTACA AATAAGGCTA AATATATAAG TTCTGGCACC 4 20 
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TAAAATATAG AAAATACATA AAAGTAAGTA TAGTTATTTT ATTATAATTA TTAAATTTTT 4 80 

ATTAATTAAT TGTAAAAATG TATAATTATA ATTAATTAAC GTTTAATATT AAAATTAACT 54 0 

AAAAAGAAAG AGGTGTTAGT TATGACAGAA TACTTATTAA GTGCTGGCAT ATGTATGGCA 600 

ATCGTTTCAA TATTACTTAT AGGGATGGCT ATCAGTAATG TTTCGAAAGG GCAATACGCA 660 

AAGAGGTTTT TCTATTTCGC TACTASTTGT TTAGTGTTAa CTTTAGTTGT AGTTTCAAGT 720 

CTAAGTAGCT CAGCAAATGC ATCACAAACA GATAATGGCG TAAATAGAAG TGGTTCTGAA 7 80 

GATCCAACAG TATATAGTGC AcTTnCAACT AAAAAATTAC ATAAAGAACC TGCGACATTA 84 0 

AT T AAAGCGA TTGATGGTGA TACGGTTAAA TTAATGTACA AAG G T C AAC CAAT G AC AT T C 900 

AGACTATTAT TGGTTGATAC ACCTGAAACA AAGCATCCTA AAAAAGGTGT AGAGAAATAT 9 60 

GGTCCTGAAG CAAGTGCATT TACGAAAAAA ATGGTAGAAA ATGCAAAGAA AATTGAAGTC 1020 

GAGTTTGACA AAGGT CAAAG AAC T GAT AAA TATGGACGTG GCTTAGCGTA TATTTATGCT 1080 

GATGGAAAAA TGGTAAACGA AGCTTTAGTT CGTCAAGGCT TGGCTAAAGT TGCTTATGTT 114 0 

TATAAACCTA ACAATACACA T G AAC AAC T T TTAAGAAAAA GTGAAGCACA AGCAAAAAAA 1200 

GAGAAATTAA ATATTTGGAG C G AAG AC AAC GCTGATTCAG GTCAATAATG CTCATTGTAA 12 60 

AAGTGTCACT GCTGCTAGTG GCACTTTTAT AATTTTAGA T C AC GAT AT G ATTTATTATC 1320 

AATTCAGAAT TAAAAAAGTA AATAGTATCA AAAGTAAGTG TATTTAATAT TAGAAAATAA 138 0 

AAATTTTAAA TTTAGTATTA AAATGGAATG TT AC TAT AT A GTTCAATGTG TATTATCACA 14 4 0 

GAAAATAAAA TAATGCTTTA CTTCTATATT TAAAAGTGTA TAATGAAAGT TAAGTAASA 15 00 

AG AG C G T G AA GAAAAATGTG AGTTATTTAT ATAGAATATT CTCCTTTTCA TTTATGAATT 15 60 

TGTTACAAAA TATTTAGTGC AAAAG C AC G A cGGAGGTATT CAATATGaAT AACGGTACAG 1620 

TTAAATGGTT TAATGCAGAA AAAGGTTTTG GTTTCATCGa AAGAGAAGAT GGTAGCGACG 168 0 

TATTCgTACA CTTCtCAGCA ATCGCTGAAG ATGGATACAA AT CAT T AG AA GAAGGCCAAA 174 0 

AAGTTGAATT CGACATCGTT GAAGGCGACC GTGGCGAGCA AGCTGCAAAC GTAGTTAAAA 18 00 

TGTAATTTTA ACTTATTCAA ACAGTCCTTA CTATAGGGCT GTTTTTTTAT GCTTTAAATC 18 60 

GATAACAGTT GGTGTGGTAA AAGCAC TAGC CGTTATTTTT TTGTCCAATA AATTTAGTTG 1920 

GAGATTTAAC AATATATAAT GGTTCTAAAA TAAATCGAAC TGATGGAAAA GTTTTTTACT 198 0 

TTTCATCTGT CCGACTTTTG ATTTTGAATA TAAAAAAGCG CCAATACAGA ACTTTAATAA 204 0 

TGACGAGAAT TAAAGTCTGT ATATGGCGAT AAC AAG AAG T AATGTTAAAC ACTCAAAATG 2330 

TTTAACAATA ATAGGATACC ACATCGCATA ATATCTTACT ACTTAATTAA TAATTTAACT 2160 

AATCAACTTT TTGTTAATTT TTTATTAAGA CTGATTAATT ATTGAGAATA TTTATTGTTT 2220 
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TTAAAATCTC ATAATAATTC AGTAATCTTG TTTTCATTTA AAAGGCGAAA CATTAAAATA 2280 

ATTAAATAAA AATATTGCGT TTAATTTACA GCGTCAAATA TACTTATTTC TAATGCTTTG 2 34 0 

GGGTCTACTG AAACAAGTAA AGAATGATCG ATGTTACTAA TATTGCCATT CTCCAAATTT 2400 

ATTTCTGTGA GTATTTGGAA GCTACCATTA GGCAACGGTT TAACAATAGA CAATTGCTTT 2 4 60 

TCCGCTTGTT GTATTAAAAA AGGTTTTGTA GATTGATTAT TAATATGCA TTCACTCATG 252 0 

TATGTTTTTC ACTCCTGCTT TAAAATAGGG TTAGAAAGTT TATAGTTGAG ACATTCATGT 2580 

TCAACCAAAA TTTTGTTCGA AT T C AAT AAA TGTCTTGTTT AAAATAGAAA TATTGTAAAT 2 64 0 

GTTATCGTCC AAAACTTCAC CAGTTAAGTA TTTGTTTTGA ATTAAAATTT GGCAGTTAGT 2700 

TAAGAAGTCT TGATAATCAC GATCGCAAAA ATAGTTTTCA CGTGCATCTT TAGCATCGCC 2 7 60 

AAAAAAGTTA GCGACTGTTT CTGTTTCTCC TTTATTCGAA CGTTCAATAT ATAATTTGTA 2820 

AAATTTAGCT ATTGTATACT TTTGTTCTTT AGTTAGTTCA TTCAAAATAT TGGGCCTCCT 2880 

GAAATATCAT TTGTAATCTA TACCCAATTT ATTGCAAAAC AAAAACTAAT TTAACTATTT 2 94 0 

GATGAAACTG TGTTAATAAg CTTTAACAAG CCTTAGTTTG TATGGATCTA TAAAATTATC 3000 

TTTAATTGCA TAGGGTGAAA TAATATGTAG TCCATAACTT TTAACTGATT TTTCACTTAC 3060 

ACCAAATTTA TAAGCTTGGT AGATAATTTT AGTACAATAC GTAAATTTTT TGCGTTCAA 3120 

ATTTAATGTA ACTAGATAAC GATGATTTGT ATTCTCATAG TTTTTCTTAA CCCATTCAGC 3180 

CGCTTTTTTA CCTGCACCAG GATAGCTGCA AC GATAAACT TTCATCCAAT CATTTTTGCC 32 4 0 

ACTTGCATAA TTATATTTAA AAGATTCGAA GGATTGTGTA GTTGGTTTGT CGCCAGGCCC 3300 

CTCAATTTGT AAAATCGTTT TATCATCAAT CGCGATACTA C AAT G AC C AA AAAATCscCA 3360 

CATGACAGGG CCTTTTGTAA C AAT AAT AT C ACCAGGTTGT AATTGGAATT TGTCATCTTG 34 20 

AATTTCTGAA TACTTATTAT CTGCAATTGT TTTTGGTGAG TTTATTGGGG ATACGACAAC 34 80 

G AAT AAT AT A AGTAAAATTA TCGTTCGTTT AATATiETTC ACTTAAAAGC TCCTTGTTGA 354 0 

AGAAATATAT GTAAATAGTC TTAAATTAGA ATTGTAATCT TTAATAAGCT TGtAAGACTA 3600 

AAACATATCT TAAATATTAA AGTATGAGAG TGTGAAATGT CTATTAAGAA T nAAAAAC AG 3660 

TCTGAAACAT C AT T GAG AC G TTCCAGACTG GATATAAAAT G AAT T T CAT T TATAGCACA 3719 
(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 
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TTGCGTTGCC GCACCAAGAT ATTGAATGCC TAGCGATTCC GAGTATGCAA ACTGAACGCA 60 

ATAATTCGCT TTTTCTTCAA TCTTCTGTTC GATAATAAGC GAATTCGTTT CAGCAGTTGC 120 

CTCTTTAATA CGCGTTATCG CTTTTTGTAA ATCTGCATCA TGATAACAAA TCATAACGCC 180 

ATAGCCACCT GCTGTCGGAA GATCATCTCC CGGCTTAATT ACTAACGGGA ATTCCCAATT 24 0 

CTTAATCTCG TTTTCGAATT GCTCAATTTT TACAACTTTT CTTTTTGGTA AAAACTTCCC 300 

ATTTGTCCAT TCAGGTATTC TTGCTTTATT ATTTAAAGCA ACAAATAACG TTTTATCTAA 360 

TGCATAATAT TGCTGATTCA AGATTGTTTC ATCATGAAH TAT TGAAAAT AAATCTTTTT 4 20 

ATTTTCCTTA TGTGCCAATT GTTTGATCAA GTTTTCGTAA GATTGCTGAT TGTTAAATGT 480 

ATAAATTGAG TTCGGtACTT CCTTACCAAT AACTTGAAAT AGCTGATGCA ATTTGTCTGT 54 0 

CGCACTAGCT TCGTGAACAA TAACAGGTAA TTGATTTGCT ATTAATAACT CCCTACCAGT 600 

TAAAAAATTA GATTGATGTT CGTCCGGTTT CAACCATGGA TTCGATATAT ACGAAGGTCT 660 

T G AC G TAT AG ACAACATCTT TGTCATATAA ATCACTTAAC GTTAAGTTCG GCTCATTACC 72 0 

AT TATTTGTC ATTACTTCCC ATTCCCTTTC AAATGCGCAT GCTCTTCAAT AATGTCTTGA 780 

TAAACGTCTT GATTTGTAAT TAACTCTAAC CCCATCAACG CCATTATTTT AGCGCCTTTA 84 0 

ATTAATGCTT CATCACCATG TACACTCGCA GCCGCTTCTC TAAATCTATG CGTATGTCCT 900 

ACTAAATTAC GTGATCCTAT TTTAATATGA GGATGTATTG TTGGCACAaC ATgaCTTACG 9 60 

TTCCCTGTAT CCGTAGAGCC ATAACCAAAA T CAT CAT CAA TAACTGTTC ACCAACTTCT 1020 

TCAGCATATT TAGCAAATAA ATCATCTAAT TTCGGCGTTT TAATGAATTC ATT C AC AC C G 108 0 

TTTTGAATTC GACCAAATTC ATAATCACAA CCAGTCTGTA TCGCAGCTCC ACGTGCGATT 114 0 

TGATTTACTT TTTCTGTTAA TATATCCAAT TCTTTACGCG TCATTGCTCT AGTATAAAAA 1200 

CGAGCATGTG TATAGTCTGG AATAATATTA GCTGCTTTCC CGCCATCTAA AATCACACCA 12 60 

TGCACACGTT GATCTTTTTT AATATGTTGT CGTAGTTGTG C T AC AC CAT T AAAATAACTA 1320 

ATCATAGCGT CTAATGCATT TAACGCTTCA TCTGCATTTT CAGAGGCATG AGCACTTTTT 138 0 

CCGTAAAATT TAACATCTAA AACATCCA^T GCCAAAGTAT CAATCGTTTT ATAAGTTTCA 14 4 0 

TTTCCCGGAT GAATCATTAA GGCAATGTCT ATTTGATCAA TCACACCAGC CTTGACATAA 1500 

GAAGCTTTAG CGCTACCATT TTCCCCACCT TCTTCAGCTG GACATCCAAG AACGACTACT 15 60 

TTACCACCAA TTTGGTCAAT CACTTGCTTC AAACCAATTG CACCAAGAAC ACTGCAGTT 162 0 

CCAATGATAT TATGACCACA AGCATGACCC AATCCTGGCA AAGCATCGTA TTCTGC 167 6 

(2) INFORMATION FOR SEQ ID NO: 34 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1294 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAN3EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

TACAGTAGGA AT C AT AAAAC CTAATACAAC AAATACAAAA CCATTTAAGG CATAACTAAA 60 

TGTGTTCCAA ATTTGATGGT AATTCATTTG TAGTTCCGTT TGTGCTCTAA TTAAACGGTC 120 

GCGTTCTAAA CCATGGATTA GACCTGCGAT TACAACTGCA ATGATACCTG AAGCATGAAC 180 

TTCTTCTGCT AAAAAGTATA CGACAAAAGG AGTTAATAAT TGAATAAAAG TTAAGGTATT 24 0 

GTTATCTTTT AAACCTTTAT TAGCGGTTAA GTCTATACGT ATTCTAACGA CAACGAATCC 300 

AATAATTGCA CCAATAAGTA CACCTAGTAT TGTTGAAATG ATAAATTGTT CAACAGCTTG 360 

GAATAATGAA AAGGTACCAG TTACTAATGC AGTAACAGCA ATTTTAAATG AAATGATACC 4 20 

TGCTGCATCA TTGAGTAAAG ATTCACCTTC TAAAATCGTC ATAGAACCTT TAGGTAATAA 4 80 

TTTTCCGCGT GTAATAGCAG ATACTGCTAC TGCATCAGTA GGACATAAAA TTGCTGTAT 54 0 

TGCAAAAGCG GCTGGCATTG GTAAGGCAGG CCAAATCCAA TGTATAAAAT AGCCAACACC 600 

GACTACAGTT GCAAACACTA GTGCCATTGA CATTAATAGT ATAGGTTTAC GATATTCTAA 6 60 

TAATTTTGTT CGAGAGACGT GGGTACCTTC C AC AAAAAG T AGTGGCGCGA TAACGGCAAA 720 

CATAAATACT TGAGAATTGA ATTGGAAATC AACTTGTATT GGAATAATGA AAATAACGAC 780 

ACCTAATGCA ATTTGAATAA AGGCAGTAGG AATTTGTGGG AATCGATTAT TGATAACCGA 84 0 

ACTAATAATC ACAGCAAAAA TAAAAATTAA AAATGCTTCT AATAGTGCCA TACAATACTC 900 

CTCAAAATTT TAATAGTTAA TATTTTATCA CTTTTAAGC ATAATGACAT AGATATATTG 960 

ATAAAATGAA GTTATTTTCA AAAAAACTCT AGTATCGGTT GAACTGATAC TAGAGCGAGA 1020 

TGTTTAAATT ATTGATTGTc AT AT C T G AAA TGACCGCTGT CATTTTGTCG TTGTTCATAC 1080 

GCGAGCTTTT CAGCATTCGT TTTGTATTTT TtATAAAAGa AAAATAaAAA TATnAACCaG 114 0 

AATGGCGAAA TATAAATAGC TGCTCTTGtT TCGTCACTAA AGAATAATAA AATGAATACA 1200 

AAGAAGAAGA ACGCTAGAAT AATGTAAGCA ATAGGCTTAC CACCAATCAA CTTAAATTTA 12 60 

CTGTTTTTAT GTGCCTCAGG ATGCTTTTTC AAAT 1294 
(2) INFORMATION FOR SQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

AC AT G AT AAT GATGACGCTA TTAAAACACG TTTTTTATTT TTCATTGTTA TAACCTTCTT 60 

TCGTATGATT GATATTTGTT GATATGTATC GACATGTGAA T AAT AT C AC A AAAACAGAGA 12 0 

ATATATATTT AACTATTTAT TAAATGATTT TGTTAATATT ATTAAATACT TTATCCTCTT 18 0 

TAAAAATAAT GTGTGTACAA AGTCATTAAT T TAG C AAAT A TTTTTATT T A GTAGTTAATA 24 0 

ACCATCGATT TGAAATTTAT ATATAATTAT TAGCTAAATA ATATCCTGCA TCTTTCTCAT 300 

ACAATTTACT ATAAAtTAgC ATATCCGATA TCAGCGTTAA TAAGATCGTT GATACTAGmC 3 60 

AGTTAATTTC AT AG AAC G AA AT C AAAT AAC ACACTACTTT CTGCATTTTA AATTATGTTT 240 

AAGAATCAnA ATTATGTTTA nATAAATATA TAT AC T AC T T TGAAAGGTGT GAGCTTAATG 4 80 

ACAACTTTTA G T G AAAAAG A AAAAATTCAA TTACTAGCAG ATATTGTTGA ACTACAAACT 54 0 

GAAAATAATA AT G AAAT AGA CGTTTGTAAT TAT TT AAC AG ATTTATTCGA CAAGTACGAT 600 

ATTAAATCTG AAATTTTGAA AGTTAATGAA CACCGCGCCA ATATCGTTGC AGAAATCGGT 660 

AACGGCTCAC CTATACTCGC ATTGAGTGGT CATATGGATG TTGTTGATGC AGGAAATCAA 72 0 

GATAATTGGT CATATCCCCC TTTTCAACTG ACAGAAAAAG ATGGCAAATT ATATGGCCGA 780 

GGCACTACAG ATATGAAAGG CGGTTTAATG GCTTTGGTCG TATCTCTAT CGAATTAAAA 840 

GAACAAAATG AATTGCCTCA TGGAACGATT AGATTACTGG CTACTGCTGG CGAAGAGAAA 900 

GAACAAGAAG GTGCCAAATT ATTAGCTGAT AAAGGC T AT T TAGACGATGT CGATGGCTTA 9 60 

ATTATTGCTG AACCAACTGG ATCTGGAATT TATTATGCAC ATAAGGGGTC TATGTCATGT 1020 

AAAGTAACTG CAACTGGTAA AGCTGTCCAT AGCTCAGTTC CATTTATTGG TGACAATGCA 1080 

ATTGATACAC TGCTTGAATT TTATAATCTA T T T AAAG AAA AAT AT T C AG A GCTTAAACAA 114 0 

CAAGATACTA AACATGAATT AGATGTTGCG CCTATGTTCA AATCATTGAT TGGAAAAGAA 12 00 

ATTTCTGAAG AGGATGCAAA TTATGCATCT GGTCTTACAG CTGTATGTTC GATTATAAAT 1260 

GGCGGcAAAC AATTTAACTC TGTACCAGAT GAAGCTTCAC TTGAATTTAA CGTAAGACCA 1320 

GTTCCTGAGT ATGATAACGA CTTTATAGAA TCGTTTTTCC AAAATATCAT TAATGATGTG 138 0 

GATAGCAATA AGCTTTCACT CGATATTCCA AGCAATCACC GACCTGTAAC AAGCSTAAA 144 0 

AATAGCAAAT TAATTACTAC GAT T AAAG AT GTAGCTTCTA GTTATGTAGA ACAAGACGAA 1500 

ATATTTGTTT CAGCGCTTGT AG G C G C AAC A GATGCCTCTA GTTTCTTAGG AGATAATAAG 15 60 

GACAATGTTG ATTTAGCCAT TTTTGGACCA GGTAATCCAT TAATGGCACA' TCAAATCGAT 1620 

GAATATATTG AAAAAGATAT GTATCTGAAA TATATTGATA TTTTTAAAGA GGCTTCCATT 16 80 

CAATATTTAA AAGAAAAATA AGAACGATGC TGTCAGCTGC CCTATTCGCG TGCTGGCAGT 17 4 0 
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TTTTTATTGC GAATTTTAAG TTGATAGAAA TAGCGATGAA TATTGTTTGA ATGTAAATAT 1800 

TCAAATATCA ACAAGCACAT TTTCATTGAT TAAGTGffGT AAAACTGAAA TTATTGTGCT 18 60 

GATTTGTCAT ACATATATTG ACTAATGGGC ATATAAAAAG ATAGCCTCTA AT AG T nAC AT 1920 

AAACTCGTAA AAnCC 1935 
(2) INFORMATION FOR SEQ ID NO: 34 8: 

(i) SEQUENCE CHARACTERI STES : 

(A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

CCTTTnCCTA AACAATTTTT AGATTTAGAC AACAAACCGA TTTTAATCCA TACATTASA 60 

AnATTTATTT TAATTAAT GA TTTTGAAAAA ATTATTATCG CGACGCCACn ACAATGGATG 12 0 

ACGCATACGA AAG AT AC AC T TAGAAAATTC AAAATTTCTG ATGAAAGAAT TGAAGTCATT 180 

CAAGGTGGTA GCGATCGTAA CGATACAATT ATGAATATCG TTAAACATAT TGAATCAACA 24 0 

AATGGTATTA ACGATGACGA TGTCATTGTG ACACATGATG CAGTTAGACC ATTTTTAACG 300 

CATCGTATTA TTAAAGAAAA TATTCAAGCT GCTTTAGAGT ACGGTGCAGT AGATACAGTG 3 60 

ATTGATGCTA TAGATACGAT TGTTACATCT AAAGATAATC AAACGATTGA TGCAATTCCA 4 20 

GtGCGTAATG AAATGTACCA AGGTCAAACA CCTCAATCGT TTAATATTaA TTTATTAAAA 4 80 

GAaAGCTATG CACAGTTGAG TGATGAGCAA AAGAGTATTT TATCTGATGC TTGTAAGATT 54 0 

ATTGTAGAAA CAAACAAACC GGTTCGACTT GTAAAAGGTG AGTTATATAA CAT T AAAGT A 60 0 

ACAACACCTT AC GAT T T AAA AGTAGCGAAT GCTATTATTC GAGGTGGTAT TGCCGATGAT 6 60 

TAATCAAGTA TATCAATTAG TTGCACCTAG ACAATTTGAA GTTACGTATA ACAACGTAGA 72 0 

TATTTACAGT GACTATGTCA TTGTACGTCC TTTATATATG TCAATTTGTG CTGCCGATCA 780 

AAGAT AT TAT ACTGGTAGCC GTGATGAGAA TGTCTTATCT CAGAAATTGC CAATGTCTTT 84 0 

AATTCATGAA GGTGTTGGTG ffiGTCGTATT T G AC AG T AAA GGTGTGTTTA ATAAAGGTAC 900 

AAAAGTAGTT ATGGTACCGA ATACGCCGAC AGAAAAAGAC GATGTCATTG CTGAAAACTA 960 

TTTAAAATCG AGCTACTTCA GATCAAGTGG ACATGATGGG TTTATGCAAG ATTTTGTGTT 102 0 

GC T AAAT CAT GATAGAGCTG TACCACTACC TGATGATATT GATTTABTA TTATTTCATA 10 8 0 

TACAGAGCTT GTAACAGTAA GTTTGCATGC TATTCGTCGT TTTGAAAAGA AATCTATTTC 114 0 

AAATAaAAAT ACATTTGGTA TTTGGGGTGA TGGTAACTTA ggTTACATTA CAGCCATTTT 1200 

ATTACGTAAA TTATATCCAG AGTCTAAAAT ATATGTCTTT GGTAAAACAG ATTATAAATT 12 60 
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GAGTCACTTC TCATTTGTTG ATGATGTCTT CTTTATTAAT AAAATACCTG AAGGCTTAAC 1320 
ATTTGATCAT GCATTTGAGT GTGTGGGTGG T 1351 
(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 base pais 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

TCATCAAGTC TACGATAAAT TAAGTCCATA TCTAAAGGCT CGGGGTCGAC AGTTTGTAAA 60 

GTATAACCAA CTGCACAGTG GCTACAACGC ATATTACAAA GATTTGTAGT TGTAAATTCG 120 

ATGTTACTTA AAGTTAATTG GCCATGTTCT TTAACATCGT TATATGCTTC CCATGGGTCG 18 0 

TTTTGAATAC TTATTTTAGG CTTGTTATTA CGCATTTTAT AAACTCCTTA ATTGTTATTT 24 0 

GATACCAATT TGATACCGTT TAATCAAATA TGCTCATAGC TTGATGTTTTTTATCAGTAT 300 

ATAAATGAGA GTACGTTTGA ATTGTTTCTG TAATGTTAGA ATGCCTCATT AATTCCATTA 360 

ATAAATACAT ATCTACACCA TTATTAATTA AaTAGCTAGC GTACGAGTGT G 411 



(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1639 base pairs 

(B) TYPE: nucleic acid 
.(C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

TCATTTTCAT AGGTTATTAC GCAGATCAGC ATAATAATCC ATTCCATATG AGTcCTTATT 60 

TTGGTTATGC AGCACGTCTA TTGGCAACAA GTGGCATTGA CTATACGTAT GTAAGAATGG 12 0 

CAATGTACAT GGATCCACTT AAACCATATT TACCAGAATT GAT GnAT AT G CATAAACTGA 18 0 

TTTATCCnGC TGGCGATGGT CGTATTAATT ATATTACTAG AAATGATATT GCTAGAGGTG 24 0 

TCATTGCTAT TAT T AAAAAT CCAGATACTT GGGGCAAACG CTACTTATTA TCAGGCTACA 300 

GTTATGATAT GAAAGAACTT GCTGCAATTT TATCTGAGGC ATCaGGCACA GAAATTAAAT 3 60 

ATGAGCCCGT TTCATTAGAG ACATTTGCAG AAATGTATGA TGAACCTAAA GGCTTTGGTG 42 0 

CATTATTGGC ATCAATGTAC GACGCAGGAG CAAGAGGACT ATTAGACCAA GAlCCAATk 4 80 

ATTtCCAaCA ATTAGTCAAT GAtCwACCAC AGaCACTGCA ATCATTTTTc AAGwAAATAT 54 0 
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TTAATAATAA AGGAGCGTTA TAGTGAATAT CATCTCAACA ATTtTAATCA TATTTGTGGC 600 

AT TAG AG T T T TTCTATATTA TGTACCTTGA AACGATTGCT ACAACTTCCA AAAAGAC TAG 660 

CGAGACATTT AATATAAGCG TCGATAAATT GAAAGACAAA AATATTAACC TACTTTTGAA 720 

GAACCAAGGC GTATATAACG GTTTAATCGG AGTTTTGCTA ATATACGGTT TGTTTATCAG 7 80 

CAGTAATCCA AAAGAAATAT GCGCAGCTAT TTTAGTGTAT ATCATTGGCG TTGCTATTTA 840 

TGGTGGCCTT TCAAGCAATA TTAGTATCTT TTTCffiACAA GGCACATTGC CAGTATTGGC 900 

ACTCATATCA ATGCTTTGGT AAGTATTGGT GTTTGGGGGG GTGGAGATGT AGTCGGAGGT 960 

TTGGAGGATT TGAGCGAATT GTGTGTGGAC TTTAGACTCA GAGTATTTCA TCCTAATTAT 1020 

TTCAAGCAGA GGTGACAGTA GCGTTGCCTC TGTTTCCTTA TAAAAAAATT ATTTAATGAA 108 0 

G AAAAC C CAT ATCTGATTTA ATTTTCAGCT GATAAATACT CCATATATTA GAATGGCTAC 114 0 

TTTATCTATT GCATCAATCC T T T AAAAC AA AAAACCCATG ATTTCGAAAT TCCCGTATGA 1200 

TGGGGTTCCT ACTCTCATGG ATCAGTTAAA T AAA TAT TAT CACTATCAGT TTATTATTTC ■ 12 60 

AATATTATTA ACAATATATG TAGTCGTAAA AGGAAAGAGG AC AT G AG AAC TTCGGTGTTG 1320 

ATTGGCATTA CATAACGCTT C C AAAC AT AT TATTTGGTAA CAATAAGAAA CTATTTACAC 1380 

AATATATTTT GTATAGTAAA ATTATTTTAT AATATTTAAA TCCAATTGCA CAAGGAGTGA 14 4 0 

TTATCaTGGT ACCAGAAGAA AAAGGTTCTA TTACTTTGTC AAAGAAGCA GCTATCATAT 1500 

TTGCAATCGC AAAATTCAAA CCATTTAAGA ACAGAATTAA AAATAACCCA CAAAAAACAA 1560 

ATCCATTTCT T AAAT T AC AT GAAAACaAAA AATCTTAATC ACTTTtATTT ATAGcATTTC 162 0 

TAATCTCAGA AATGCTATA 163 9 



(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

AAAATCGCAT ATAGTAATAT GAATAACCAG ATTGTATCTA CAAAAAAGTA TAT T G AAAAA 60 

CCAAGCGCAC CCATTAATAA TGCGAGAATA ATAATAATTT TTCTATTAAA GTGATGCGTA 12 0 

TCCGAAAATC TAGCAATAAT TGAATTTACT GTAAACTGGC TAATCGCTGC AGAT GC TAG A 180 

AGTAATCCAT ACTGATTTGT TGTCATACCT AAATCTTTAG TTGCAAAAAG AACAAGATAT 24 0 

GGGACCGTAA CCGCAATACC CATACCTAGT AGAAACATAT TAGCAACAAA GAGTTTATAA 300 

TTCTyTATTT GTAaTAACGC TGCAAACATA TCCATAACCC CGCTTCTTAG AGCCCCTTTT 360 
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AATTnATnAA TTAGGGGCTC TTATGCAGTT GGTGCATTAG CAACEACTG TATTCCTTTG 420 

TCCCCTTTTA ATTTATTaAT TAGGGGCTCT TTTGCTGTTG GTGCATTAGC AACCAACTAC 4 80 

GTTCAATTTA ACCGAATAGT TTAAAATTAA ATACAAACCT T AAAT TAG T C T AAAAC T AC G 54 0 

CCTTTGGTTG TTCAACAAAG CTCGCCATGA GATTTACAAA AGAATCAACT TGTGGCAATT 600 

GCAACATGCT CGGATCATAA CTCATAAATG TCGAACGAAT CAGCGGTTCA TTATCAATTT 6 60 

CTACTTTTTC AAACTCAAAT TGTTCTTTGC TGATATTTTT CATCATAATT TCTGGCAAGA 720 

TTGTAACACC TACACCACTA ATCAACATTT CTTTGCAAGT TGCTACTTGA TCCACTGTAA 7 80 

TAGTTGCATG GTAATCTTGT TCTAAATfAT CGTTATACCA TTCTTTTATT T GAT T TAT AT 84 0 

AAATCGGATC AGCTTGAAAC TCTATAAATG GTAACTTTGT AACATCATCT CGTCTATTTT 900 

TTGGAAAAAT AAAATAATGA TCATCATTAA ATAAATGTGT GTTAGCTAAA TTCATTACCT 9 60 

TATTTCCACG AGTTATCATA ACATGATAAT CTCTATGATT TGCTTTAATT TETCAGTTG 1020 

AACCAACTTG CACTTGTATT T C AAC AT TAG GAAATTGGGC ATTATATAGG CTCAAAACTT 10 8 0 

CAGGAAGTAA GGTTTGTCCA ATCAAAGAAG AACACCCGAT TGATATTGTT CCATTCACTT 1140 

CACCAATATG TGCCTGCATT TTGTCAAAAA ATAATCGCTC TCTTTTCAAC ATGTCACGAG 12 00 

CATGCTCAAT AAT CATTGTT CCTTCAGTTG TTGTAATCAA TTGTTTTTTT GTTCTGATAA 12 60 

AAATATCTAC TCCAAAAGCA TTTTCAATAG CTTTTAGTCT TTGTGTAACA GCAGGTTGAG 1320 

AT AT AT AT AA AATTTCAGCC GCTTTACGTA ACGTTTTCGT TTCGTCTAAT GTTATTAGTA 1380 

AACGATAGTC TTCAATCTTC ATAATTTCCC CCCA'AAATT ATTCAATTAT TGAACTTTCA 14 4 0 

TGGCTACAAG CATTCATGAG TTCATTACTA ACGAATAATT TCACCAATTT TATTGGTATG 1500 

GCTGCAGCTT GAATTACTTA GTTTTTCTTT TGTTGTTGGT GATTTTTAQT TTGATTATAT 1560 

TGCTTAGGCT TTATTTGTTT GCTTTTTTCA ATATTAGTTT TATTTTGTGG CTTTTGATGA 1620 

TTTTTTTGAG CCTTTGCATT AATTTTATTA AAGCAGTACA TGATTTTCTT TTGGAATCCT 168 0 

T T AAAAT CAT TTTCTAACTC TGCCATAATT TGATGTGCAA TCATATATGC TTCATGAAAT 17 4 0 

TGCTTTTTTG TAATTTGCTC ACTTTCTAAT GCAAACATTA AATCATCTTC ATCTACCAAC 1800 

TCATaTcACC ACTTGG 1816 
(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIOGY: linear 
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